Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / dedupeio/dedupe issues and pull requests
#1101 - Remove usage of DataModel from core.py and labeler.py
Pull Request -
State: closed - Opened by NickCrews over 2 years ago
- 1 comment
#1100 - Update numpy requirement to >=1.20
Pull Request -
State: closed - Opened by benmanns over 2 years ago
- 1 comment
#1099 - Bump pypa/cibuildwheel from 2.9.0 to 2.10.1
Pull Request -
State: closed - Opened by dependabot[bot] over 2 years ago
- 1 comment
Labels: dependencies, github_actions
#1098 - Reduce responsibilities of DataModel
Pull Request -
State: closed - Opened by NickCrews over 2 years ago
- 3 comments
#1097 - Add_singletons
Pull Request -
State: closed - Opened by NickCrews over 2 years ago
- 1 comment
#1096 - Point out gephi as a debugger
Issue -
State: open - Opened by NickCrews over 2 years ago
- 2 comments
#1095 - Cut a release that includes #1087
Issue -
State: closed - Opened by NickCrews over 2 years ago
- 3 comments
#1094 - Inference time of RecordLink is too slow
Issue -
State: closed - Opened by QQSkill over 2 years ago
- 5 comments
#1093 - Error when installing dedupe on an M1 Mac with macOS 12.5.1
Issue -
State: closed - Opened by leifericf over 2 years ago
- 3 comments
#1092 - Consider HDBSCAN as clustering algorithm
Issue -
State: closed - Opened by NickCrews over 2 years ago
- 2 comments
#1091 - ConvergenceWarning during training
Issue -
State: open - Opened by NickCrews over 2 years ago
- 2 comments
#1090 - Error when reproducing Gazetteer Example
Issue -
State: open - Opened by hlra over 2 years ago
- 1 comment
#1089 - Add random_state everywhere for reproducibility
Issue -
State: closed - Opened by NickCrews over 2 years ago
- 3 comments
#1088 - Split up Datamodel into predicates, rename to Featurizer
Issue -
State: open - Opened by NickCrews over 2 years ago
- 5 comments
#1087 - Don't return 0s from scores
Pull Request -
State: closed - Opened by NickCrews over 2 years ago
- 2 comments
#1086 - Auto cleanup memmap scores
Pull Request -
State: open - Opened by NickCrews over 2 years ago
- 4 comments
#1085 - transition to plugins for dedupe variables.
Issue -
State: closed - Opened by fgregg over 2 years ago
- 7 comments
#1084 - Bump pypa/cibuildwheel from 2.8.1 to 2.9.0
Pull Request -
State: closed - Opened by dependabot[bot] over 2 years ago
- 2 comments
Labels: dependencies, github_actions
#1083 - Setuptools-62
Pull Request -
State: closed - Opened by NickCrews over 2 years ago
- 1 comment
#1082 - Setuptools-63
Pull Request -
State: closed - Opened by NickCrews over 2 years ago
- 7 comments
#1081 - Move project metadata to pyproject.toml
Pull Request -
State: closed - Opened by NickCrews over 2 years ago
- 1 comment
#1080 - Remove buggy pyproject.toml from manifest.in
Pull Request -
State: closed - Opened by NickCrews over 2 years ago
#1079 - Extract predicate filtering from data model
Pull Request -
State: closed - Opened by NickCrews over 2 years ago
- 11 comments
#1078 - Documenting the guarantee that fingerprinter won't emit duplicate tokens for the stame field.
Issue -
State: open - Opened by fgregg over 2 years ago
- 1 comment
#1077 - Training not providing enough matches
Issue -
State: open - Opened by tigerang22 over 2 years ago
- 26 comments
#1076 - Make cleanup of memmapped scores be more DRY
Pull Request -
State: closed - Opened by NickCrews over 2 years ago
- 1 comment
#1075 - Enforce match when 2 fields are equal
Issue -
State: closed - Opened by the-whopper over 2 years ago
- 1 comment
#1074 - NotADirectoryError: [WinError 267] The directory name is invalid: 'C:\\Users\\username\\AppData\\Local\\Temp\\tmpfb6idzyr\\blocks.db'
Issue -
State: closed - Opened by mbkupfer over 2 years ago
- 1 comment
#1073 - Bump pypa/cibuildwheel from 2.7.0 to 2.8.1
Pull Request -
State: closed - Opened by dependabot[bot] over 2 years ago
Labels: dependencies, github_actions
#1072 - Clustering scores containing 0 fails filtering
Issue -
State: closed - Opened by NickCrews over 2 years ago
- 8 comments
#1071 - Provide datasets of examples / tutorials
Issue -
State: closed - Opened by Phlogi over 2 years ago
- 1 comment
#1070 - Bump pypa/cibuildwheel from 2.7.0 to 2.8.0
Pull Request -
State: closed - Opened by dependabot[bot] over 2 years ago
- 1 comment
Labels: dependencies, github_actions
#1069 - KeyError when removing stop words in canopy_index
Issue -
State: closed - Opened by oreccb over 2 years ago
- 8 comments
#1068 - Dedupe 2.0.16 is not compatible with python 3.6
Issue -
State: closed - Opened by EdAbati over 2 years ago
- 4 comments
#1067 - Bump pypa/cibuildwheel from 2.6.1 to 2.7.0
Pull Request -
State: closed - Opened by dependabot[bot] over 2 years ago
- 1 comment
Labels: dependencies, github_actions
#1065 - Refactor labeler.py
Pull Request -
State: closed - Opened by NickCrews over 2 years ago
- 10 comments
#1054 - Remove unused `sample_size` and `blocked_proportion` from public API
Issue -
State: open - Opened by NickCrews over 2 years ago
#1046 - Inconsistent alphaNumericPredicate behavior
Issue -
State: closed - Opened by Nephirus over 2 years ago
- 3 comments
#1045 - Create __setstate__to smooth over data model refactor
Issue -
State: closed - Opened by fgregg over 2 years ago
- 8 comments
#1044 - tests for breaking changes to settings file
Issue -
State: closed - Opened by fgregg over 2 years ago
- 2 comments
#1043 - factor linters out into separate step
Issue -
State: closed - Opened by fgregg over 2 years ago
#1032 - Deal with missing values better
Issue -
State: closed - Opened by NickCrews over 2 years ago
- 9 comments
#1029 - CanopyIndex stop word removal non-determinism
Issue -
State: closed - Opened by oreccb over 2 years ago
- 4 comments
#1025 - Record linkage as classification
Issue -
State: open - Opened by fgregg over 2 years ago
- 2 comments
#976 - consider forking levenshtein-search
Issue -
State: open - Opened by fgregg almost 3 years ago
- 1 comment
#965 - disk has reached capacity issue with moderate record size with >500 gb of free disk space
Issue -
State: open - Opened by zwarshavsky almost 3 years ago
- 15 comments
#964 - Consider holding data in sqlite table
Issue -
State: open - Opened by fgregg almost 3 years ago
- 17 comments
#940 - Performance degrades when loading/training with large labeled training file to prepare_train()
Issue -
State: open - Opened by cbhower about 3 years ago
- 12 comments
#937 - partially supervised classification
Issue -
State: open - Opened by fgregg about 3 years ago
- 1 comment
Labels: research
#856 - virtual compound predicate
Issue -
State: open - Opened by fgregg over 4 years ago
- 3 comments
Labels: enhancement