Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / dirty-cat/dirty_cat issues and pull requests

#556 - The link to the circle_ci artifacts seems broken

Issue - State: open - Opened by GaelVaroquaux over 1 year ago - 1 comment
Labels: bug, CI / Build

#555 - Improving DateTime detection?

Issue - State: open - Opened by GaelVaroquaux over 1 year ago
Labels: enhancement

#554 - Example to do model selection with TableVectorizer

Issue - State: open - Opened by GaelVaroquaux over 1 year ago
Labels: Documentation

#553 - Fix CircleCI artifacts not rendering

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago - 3 comments
Labels: bug, CI / Build, No Changelog Needed

#552 - FEA Fuzzy joining on datetime

Pull Request - State: open - Opened by jovan-stojanovic over 1 year ago - 1 comment

#551 - Maintenance before migration

Pull Request - State: open - Opened by LilianBoulard over 1 year ago - 6 comments
Labels: enhancement, Documentation, No Changelog Needed

#550 - `pkg_ressources` is deprecated

Issue - State: open - Opened by LilianBoulard over 1 year ago - 3 comments
Labels: bug, dependencies

#549 - get_ken_table_aliases needs to be added to the documented functions

Issue - State: open - Opened by GaelVaroquaux over 1 year ago - 1 comment
Labels: bug

#548 - Don't continue fitting in `GapEncoder.transform`

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago
Labels: bug

#547 - Fuzzy joining on mixed typed columns is with unequal weights

Issue - State: open - Opened by jovan-stojanovic over 1 year ago - 1 comment
Labels: bug

#546 - Examples overhaul

Pull Request - State: open - Opened by LilianBoulard over 1 year ago
Labels: enhancement, Documentation, No Changelog Needed

#544 - Example 3 does not render

Issue - State: closed - Opened by LilianBoulard over 1 year ago - 1 comment
Labels: bug, CI / Build

#543 - Improved datetime format inference

Pull Request - State: closed - Opened by LeoGrin over 1 year ago

#542 - Dynamic stable link

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago - 1 comment
Labels: enhancement, Documentation

#541 - Fix _set_drop_idx bug for sklearn versions >=1.2.0 and < 1.2.2

Pull Request - State: closed - Opened by LeoGrin over 1 year ago - 1 comment

#540 - TableVectorizer test failing with latest dependencies

Issue - State: closed - Opened by jovan-stojanovic over 1 year ago
Labels: bug

#539 - Add functions for exploring KEN embeddings

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago - 2 comments
Labels: enhancement

#538 - DOC Configure filename_pattern to run all examples during doc build

Pull Request - State: closed - Opened by lesteve over 1 year ago - 1 comment

#537 - DOC `FeatureAugmenter` typo quick fix

Pull Request - State: closed - Opened by Vincent-Maladiere over 1 year ago - 1 comment

#536 - DOC clean formatting for `01_dirty_categories`

Pull Request - State: closed - Opened by Vincent-Maladiere over 1 year ago - 1 comment

#535 - DOC simplify `print_worst_matches` for `04_fuzzy_joining_and_FeatureAugmenter`

Pull Request - State: closed - Opened by Vincent-Maladiere over 1 year ago - 2 comments

#534 - FIX Error when wrong type of fuzzy_join parameter

Pull Request - State: closed - Opened by jovan-stojanovic over 1 year ago - 1 comment
Labels: No Changelog Needed

#533 - Make fuzzy_join's match_score error more explicit

Issue - State: closed - Opened by jovan-stojanovic over 1 year ago
Labels: bug

#532 - FIX _compute_drop_idx AttributeError with scikit-learn>=1.2.0

Pull Request - State: closed - Opened by jovan-stojanovic over 1 year ago - 3 comments

#530 - FEA Improving fuzzy_join: numerical columns and multiple keys

Pull Request - State: closed - Opened by jovan-stojanovic over 1 year ago - 5 comments

#529 - ENH Add warning fuzzy join on missing values

Pull Request - State: closed - Opened by jovan-stojanovic over 1 year ago - 3 comments

#528 - FIX Examples on dev version of website

Pull Request - State: closed - Opened by jovan-stojanovic over 1 year ago - 5 comments

#527 - Examples not rendering on the dev website version

Issue - State: closed - Opened by jovan-stojanovic over 1 year ago - 1 comment
Labels: Documentation

#526 - Fix duplicate CI workers

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago - 1 comment
Labels: bug, CI / Build, No Changelog Needed

#525 - Duplicated workers in CI

Issue - State: closed - Opened by LilianBoulard over 1 year ago
Labels: bug, CI / Build

#524 - Shorten fetching tests

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago - 6 comments
Labels: enhancement, CI / Build, No Changelog Needed

#523 - Fetching tests are halting CI

Issue - State: closed - Opened by LilianBoulard over 1 year ago
Labels: enhancement, CI / Build

#521 - fuzzy_join should accept joining on missing values

Issue - State: closed - Opened by jovan-stojanovic over 1 year ago
Labels: enhancement

#520 - ENH Exact joins with fuzzy_join equivalent to pandas.merge

Pull Request - State: closed - Opened by jovan-stojanovic over 1 year ago - 1 comment

#519 - Exact matching: fuzzy_join columns equivalent to pandas.merge

Issue - State: closed - Opened by jovan-stojanovic over 1 year ago - 1 comment
Labels: enhancement

#518 - Fix `fetch_openml` parser warning

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago - 6 comments
Labels: bug

#517 - `sklearn.datasets.fetch_openml` parser warning

Issue - State: closed - Opened by LilianBoulard over 1 year ago
Labels: bug

#516 - Fix CI not running tests

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago - 1 comment
Labels: bug, CI / Build, No Changelog Needed

#515 - CI is not running tests

Issue - State: closed - Opened by LilianBoulard over 1 year ago
Labels: bug

#514 - Clean signatures for cleaner calls

Pull Request - State: open - Opened by LilianBoulard over 1 year ago - 3 comments
Labels: enhancement

#513 - FIX Do not densify sparse matrices in `fuzzy_join`

Pull Request - State: closed - Opened by jjerphan over 1 year ago - 4 comments

#512 - FIX Array memory error of fuzzy_join

Pull Request - State: closed - Opened by jovan-stojanovic over 1 year ago - 14 comments

#511 - ArrayMemoryError when doing fuzzy_join on large tables

Issue - State: closed - Opened by jovan-stojanovic over 1 year ago - 1 comment
Labels: bug

#510 - Improve call readability

Issue - State: open - Opened by LilianBoulard over 1 year ago - 2 comments
Labels: enhancement

#509 - FIX fuzzy_join AttributeError

Pull Request - State: closed - Opened by jovan-stojanovic over 1 year ago - 3 comments
Labels: No Changelog Needed

#508 - fuzzy_join error when using sparse matrix

Issue - State: closed - Opened by jovan-stojanovic over 1 year ago
Labels: bug

#507 - DOC Fix broken binder

Pull Request - State: closed - Opened by jovan-stojanovic over 1 year ago - 8 comments

#506 - DOC Fix broken binder

Pull Request - State: closed - Opened by jovan-stojanovic over 1 year ago - 4 comments

#505 - Replace weblinks with openml-hosted ones

Issue - State: open - Opened by jovan-stojanovic over 1 year ago
Labels: Documentation

#504 - DOC Ensures that `SimilarityEncoder` passes numpydoc validation

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago - 7 comments
Labels: Documentation, No Changelog Needed

#503 - Get rid of the "scalability considerations..." example

Issue - State: closed - Opened by GaelVaroquaux over 1 year ago
Labels: Documentation

#502 - Ci improvements

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago
Labels: CI / Build, No Changelog Needed

#501 - Support paths as strings in public fetching API

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago
Labels: No Changelog Needed

#500 - Show the results of deduplicate quicker in the example

Issue - State: open - Opened by GaelVaroquaux over 1 year ago
Labels: Documentation

#499 - Prepare for 0.4

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago

#498 - Revert "Support paths as strings in public fetching API"

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago

#497 - BLD Depends on `numpy>=1.21.2`

Pull Request - State: closed - Opened by jjerphan over 1 year ago - 3 comments

#496 - DOC: Add a see-also

Pull Request - State: closed - Opened by GaelVaroquaux over 1 year ago - 2 comments

#495 - Help users find relevant ken types

Issue - State: closed - Opened by GaelVaroquaux over 1 year ago
Labels: enhancement

#494 - Fix long CI install

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago - 8 comments
Labels: bug, CI / Build, No Changelog Needed

#493 - Parallelize parametrized tests

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago - 2 comments
Labels: CI / Build, No Changelog Needed

#492 - Set default value to None for similarity in SimilarityEncoder

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago - 1 comment
Labels: bug

#491 - Fix CI issue

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago - 1 comment
Labels: CI / Build

#490 - CI issue

Issue - State: closed - Opened by LilianBoulard over 1 year ago - 4 comments
Labels: CI / Build

#489 - Refactor fetching tests

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago - 2 comments
Labels: CI / Build, No Changelog Needed

#488 - Use categorical dtypes for low cardinality variables?

Issue - State: open - Opened by LeoGrin over 1 year ago
Labels: enhancement

#487 - Add example with KEN Wikipedia embeddings

Pull Request - State: closed - Opened by jovan-stojanovic over 1 year ago - 6 comments

#486 - Add dynamic type checking

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago - 1 comment
Labels: enhancement

#485 - DOC add See also to docstring

Pull Request - State: closed - Opened by jovan-stojanovic over 1 year ago - 2 comments

#484 - Rename SuperVectorizer to TableVectorizer

Pull Request - State: closed - Opened by jovan-stojanovic over 1 year ago - 1 comment

#483 - DOC: css for narrow screens

Pull Request - State: closed - Opened by GaelVaroquaux over 1 year ago - 2 comments

#482 - MAINT Prepare for beta version 2 0.4.0

Pull Request - State: closed - Opened by jovan-stojanovic over 1 year ago - 1 comment

#481 - Rename SuperVectorizer to TableVectorizer

Issue - State: closed - Opened by GaelVaroquaux over 1 year ago
Labels: enhancement

#473 - Use handle_unknown=ignore in SuperVectorizer

Pull Request - State: closed - Opened by LeoGrin over 1 year ago - 7 comments

#472 - Make sure that we have the right sealso information

Issue - State: closed - Opened by GaelVaroquaux over 1 year ago
Labels: Documentation

#470 - Better threshold metric for fuzzy_join

Issue - State: open - Opened by LeoGrin over 1 year ago - 2 comments
Labels: enhancement

#458 - `SuperVectorizer` passes NaNs through, which causes many ML models to produce errors

Issue - State: closed - Opened by lsorber over 1 year ago - 7 comments
Labels: bug

#454 - `SuperVectorizer` raises a pandas `FutureWarning`

Issue - State: closed - Opened by lsorber over 1 year ago - 3 comments
Labels: bug

#453 - Support paths as strings in public fetching API

Pull Request - State: closed - Opened by LilianBoulard over 1 year ago - 3 comments
Labels: enhancement

#446 - Benchmark fuzzy join hash vectorizer

Pull Request - State: closed - Opened by LeoGrin almost 2 years ago - 2 comments

#443 - MAINT Check parameter values at init in SimilarityEncoder

Pull Request - State: closed - Opened by jovan-stojanovic almost 2 years ago - 2 comments
Labels: No Changelog Needed

#442 - Encoders do not raise parameter Value Error at initialisation

Issue - State: closed - Opened by jovan-stojanovic almost 2 years ago - 4 comments
Labels: bug

#434 - ENH Improve coverage of GapEncoder

Pull Request - State: closed - Opened by jovan-stojanovic almost 2 years ago
Labels: CI / Build, No Changelog Needed

#398 - Document attributes in `GapEncoder`

Issue - State: closed - Opened by LilianBoulard almost 2 years ago - 1 comment
Labels: Documentation

#396 - On dev doc, redirect to current page on stable branch

Issue - State: closed - Opened by LilianBoulard almost 2 years ago
Labels: enhancement, Documentation

#381 - Our binder is broken

Issue - State: closed - Opened by GaelVaroquaux almost 2 years ago - 12 comments
Labels: bug, Documentation, CI / Build

#373 - Add simple reproducible examples to features

Issue - State: open - Opened by jovan-stojanovic about 2 years ago
Labels: Documentation, meta-issue

#371 - Improve coverage of main features

Issue - State: open - Opened by jovan-stojanovic about 2 years ago
Labels: enhancement, meta-issue

#349 - Style is off for website title

Issue - State: closed - Opened by LilianBoulard about 2 years ago - 4 comments
Labels: bug

#345 - Ensure that docstrings pass numpydoc validation

Issue - State: open - Opened by LilianBoulard about 2 years ago - 17 comments
Labels: enhancement, good first issue, Documentation, meta-issue

#341 - Example 1: use DatetimeEncoder instead of isolating year

Issue - State: open - Opened by jovan-stojanovic about 2 years ago - 3 comments
Labels: Documentation

#335 - Rename "SuperVectorizer" to "TableVectorizer"

Issue - State: closed - Opened by GaelVaroquaux about 2 years ago
Labels: enhancement

#306 - Deduplication

Issue - State: closed - Opened by LilianBoulard about 2 years ago - 2 comments
Labels: enhancement

#283 - ENH Add k-folded TargetEncoder

Pull Request - State: closed - Opened by jovan-stojanovic over 2 years ago - 1 comment

#53 - improve target encoder

Issue - State: closed - Opened by GaelVaroquaux almost 6 years ago - 1 comment
Labels: enhancement