Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / J535D165/recordlinkage issues and pull requests
#207 - How do I perform deduplication with the python record linkage toolkit with large data sets?
Issue -
State: open - Opened by sidhugithub1 7 months ago
#205 - Changing norms of comparison functions
Issue -
State: open - Opened by JosephKuchar 9 months ago
#204 - recordlinkage.NaiveBayesClassifier() fit returns multiindex of all feature pairs
Issue -
State: open - Opened by MWiggins 10 months ago
#203 - Avoid np.log of zero in ECM
Pull Request -
State: open - Opened by emuccino 12 months ago
#202 - Length mismatch at
Issue -
State: open - Opened by TongmengXie about 1 year ago
#201 - automatically check how many components are defined in rl.Compare()
Issue -
State: open - Opened by bergen288 about 1 year ago
#200 - Duplicated matching columns with rl_comparer.compute while looping over zip code
Issue -
State: closed - Opened by bergen288 about 1 year ago
- 2 comments
#199 - Add pre-commit hooks
Pull Request -
State: closed - Opened by J535D165 over 1 year ago
#198 - Update the docs CI pipeline
Pull Request -
State: closed - Opened by J535D165 over 1 year ago
#197 - Update CI docs generation and CI pipeline
Pull Request -
State: closed - Opened by J535D165 over 1 year ago
#196 - Lint with Ruff and format with Black
Pull Request -
State: closed - Opened by J535D165 over 1 year ago
#195 - Replace setup.py by pyproject.toml
Pull Request -
State: closed - Opened by J535D165 over 1 year ago
#194 - Address Matching Conditional on value of another column
Issue -
State: open - Opened by konsbn over 1 year ago
- 1 comment
#193 - `ECMClassifier` returns almost all candidate pairs
Issue -
State: open - Opened by Evnsn over 1 year ago
- 2 comments
#192 - Add support for pandas==2
Pull Request -
State: closed - Opened by J535D165 over 1 year ago
#190 - Fix usage examples
Pull Request -
State: closed - Opened by martinhohoff almost 2 years ago
- 2 comments
#189 - add threshold None and label docstrings for String
Pull Request -
State: closed - Opened by davidggphy almost 2 years ago
#187 - Indexing - performance warning - full index can result in a large number of pairs
Issue -
State: open - Opened by gajghaten about 2 years ago
- 3 comments
#186 - Fix links
Pull Request -
State: closed - Opened by andyjessen about 2 years ago
#185 - update of the introduction
Pull Request -
State: closed - Opened by karpanGit about 2 years ago
#184 - Fix typo
Pull Request -
State: closed - Opened by havardox about 2 years ago
#183 - Candidate pairs issue
Issue -
State: open - Opened by Shivamkumar285 over 2 years ago
#182 - For when support for packages like Dask or Ray (or Modin)?
Issue -
State: open - Opened by ialvata over 2 years ago
#181 - Possible bug with _dedup_index when df has only 1 row.
Issue -
State: open - Opened by IavTavares over 2 years ago
#180 - missing value is not working and it is default to 0 even if we change the value.
Issue -
State: open - Opened by selva221724 over 2 years ago
- 1 comment
#179 - Support for pandas datatypes
Issue -
State: open - Opened by devmcp over 2 years ago
#178 - How to utilize prob-related methods of ECM classifier
Issue -
State: open - Opened by Ramin1368 over 2 years ago
#176 - AttributeError: module 'recordlinkage' has no attribute 'SortedNeighbourhoodIndex'
Issue -
State: open - Opened by naeemahaz over 2 years ago
- 1 comment
#175 - Data Corruptors a la GeCO
Issue -
State: open - Opened by aflaxman almost 3 years ago
#174 - Make use of nbsphinx for documentation and guides
Pull Request -
State: closed - Opened by J535D165 almost 3 years ago
- 1 comment
#173 - Remove deprecated recordlinkage classes
Pull Request -
State: closed - Opened by J535D165 almost 3 years ago
#172 - fastparquet 0.8.1: writing dataframe to parquet file from a table data field with rtf doc content falls with TypeError exception
Issue -
State: open - Opened by PavelD0770 almost 3 years ago
#171 - Bump min Python version to 3.6, ideally 3.8+
Pull Request -
State: closed - Opened by J535D165 almost 3 years ago
#170 - Fix various deprecation warnings and broken docs build
Pull Request -
State: closed - Opened by J535D165 almost 3 years ago
#169 - fixing failed build-docs action
Pull Request -
State: closed - Opened by twalen almost 3 years ago
- 1 comment
#168 - fixing broken build and removed some warnings
Pull Request -
State: closed - Opened by twalen about 3 years ago
- 1 comment
#167 - optimize Performance ?
Issue -
State: open - Opened by jigar-prajapati18 about 3 years ago
#166 - What languages are supported by this toolkit? only English?
Issue -
State: open - Opened by yoeldk about 3 years ago
#165 - compare.date
Issue -
State: open - Opened by yishairasowsky about 3 years ago
#164 - Update ref-compare.rst
Pull Request -
State: closed - Opened by hwong557 about 3 years ago
- 1 comment
#163 - Update ref-compare.rst
Pull Request -
State: closed - Opened by hwong557 about 3 years ago
- 1 comment
#162 - missing values
Issue -
State: open - Opened by yishaistreamline about 3 years ago
- 4 comments
#161 - threshold in at compere is broken
Issue -
State: open - Opened by skuam over 3 years ago
#160 - Option to return intersection of pairs returned from indexers rather than union
Issue -
State: open - Opened by chriskl over 3 years ago
#159 - Compare.compute return real score for each metric, not binary `0`/`1` after threshold.
Issue -
State: closed - Opened by oyeromenko-ebsco almost 4 years ago
- 3 comments
#158 - Fix random indexer
Pull Request -
State: closed - Opened by tteigman almost 4 years ago
- 1 comment
Labels: bug
#157 - Recordlinkage, ValueError: index of DataFrame is not unique
Issue -
State: open - Opened by lsun907 about 4 years ago
- 3 comments
#156 - Network OnetoMany docs
Issue -
State: open - Opened by Davide-Bianchi about 4 years ago
#155 - ECM algorithm on large data sets
Issue -
State: open - Opened by gnatarajanmboard about 4 years ago
- 1 comment
#154 - Update data_deduplication.rst
Pull Request -
State: closed - Opened by hwong557 about 4 years ago
#153 - Update README.rst
Pull Request -
State: closed - Opened by tylerbinski about 4 years ago
#152 - Update base.py
Pull Request -
State: closed - Opened by tylerbinski about 4 years ago
#151 - Update performance.rst
Pull Request -
State: closed - Opened by tylerbinski about 4 years ago
- 1 comment
#150 - Generating Pairs
Issue -
State: open - Opened by thbeh about 4 years ago
- 2 comments
#149 - Addition of String Comparison Method - Jaccard Similarity
Pull Request -
State: open - Opened by debadridtt over 4 years ago
#148 - Progess indicator or verbose output in Recordlinkage Python
Issue -
State: open - Opened by gsunit over 4 years ago
- 1 comment
#147 - Calculate distance in addition to similarity
Issue -
State: open - Opened by AntoineLamer over 4 years ago
#146 - Add textdistance matching algorithms in recordlinkage compare string
Issue -
State: open - Opened by rafmacalaba over 4 years ago
- 4 comments
#145 - A comparison of 2 nan columns returns: empty vocabulary; perhaps the documents only contain stop words
Issue -
State: open - Opened by micheledemeo over 4 years ago
- 1 comment
#144 - ECMClassifier throws ValueError: Unknown label type
Issue -
State: open - Opened by ayn28 over 4 years ago
- 10 comments
#143 - numeric offset vs scale
Issue -
State: open - Opened by s3afroze over 4 years ago
#142 - Using Recordlinkage on a single dataset
Issue -
State: closed - Opened by jennguo9 over 4 years ago
- 8 comments
#141 - Removed unnecessary np.sort from the _get_sorting_key_values in SNI
Pull Request -
State: closed - Opened by twalen over 4 years ago
- 1 comment
#140 - Multiple Core Issues
Issue -
State: open - Opened by logisticregress over 4 years ago
- 1 comment
#139 - Record Linkage Compare - Numeric method
Issue -
State: closed - Opened by s3afroze almost 5 years ago
#138 - More detailed missing value
Issue -
State: open - Opened by rbagd almost 5 years ago
#137 - Hey, can we expect compatability with Python 3.7 version anytime soon?
Issue -
State: open - Opened by echkay96 almost 5 years ago
- 1 comment
#136 - Issues with the geographic classification method
Issue -
State: open - Opened by JosephKuchar almost 5 years ago
- 1 comment
#135 - Fix bugs for cosine and qgram string comparisons
Pull Request -
State: closed - Opened by Rosina9700 almost 5 years ago
- 1 comment
#134 - module 'recordlinkage' has no attribute 'BernoulliEMClassifier'
Issue -
State: open - Opened by roelof89 almost 5 years ago
#133 - cosine metric throws an error
Issue -
State: open - Opened by mastreips almost 5 years ago
- 5 comments
#132 - Replace Travis by Github Actions
Pull Request -
State: closed - Opened by J535D165 almost 5 years ago
#131 - Replace nbsphinx by IPython.sphinxext.ipython_directive
Pull Request -
State: closed - Opened by J535D165 almost 5 years ago
#130 - Fix bug in low memory random sampling
Pull Request -
State: closed - Opened by J535D165 about 5 years ago
Labels: bug
#129 - Question on the numerical similarity measure exp
Issue -
State: closed - Opened by GordonAn about 5 years ago
- 4 comments
#128 - random_pairs_without_replacement_large_frames does not create random_state when it's not given as a parameter
Issue -
State: closed - Opened by pauldg about 5 years ago
- 1 comment
#127 - cannot import name 'SKLearnClassifier' from 'recordlinkage.adapters'
Issue -
State: closed - Opened by soft-kelly about 5 years ago
- 2 comments
#126 - ValueError: The truth value of a DataFrame is ambiguous. Use a.empty, a.bool(), a.item(), a.any() or a.all().
Issue -
State: closed - Opened by Shonexu about 5 years ago
- 3 comments
Labels: help wanted, good-first-issue
#125 - AttributeError: 'list' object has no attribute 'rename'
Issue -
State: closed - Opened by Dragut about 5 years ago
- 1 comment
#124 - Initialize Compare with (a list of) features
Pull Request -
State: closed - Opened by jpweytjens about 5 years ago
- 3 comments
#123 - Date comparison problem
Issue -
State: closed - Opened by Dragut over 5 years ago
#122 - Fix single value columns for ECM classifier
Pull Request -
State: closed - Opened by J535D165 over 5 years ago
- 4 comments
#121 - fixed typo in ref-index.rst
Pull Request -
State: closed - Opened by LuciaBaldassini over 5 years ago
- 2 comments
#120 - K-Fold Cross validation in Record Linkage
Issue -
State: open - Opened by mayerantoine over 5 years ago
- 1 comment
#119 - Testing
Issue -
State: closed - Opened by EdgarSanchez1796 over 5 years ago
#118 - Jellyfish returns wrong Jaro Winkler distance if cjellyfish is not available
Issue -
State: closed - Opened by jpweytjens over 5 years ago
- 1 comment
#117 - Can't initialize Compare with list of features
Issue -
State: closed - Opened by jpweytjens over 5 years ago
#116 - Fixed grammar error
Pull Request -
State: closed - Opened by tknuth over 5 years ago
- 1 comment
#115 - Division by Zero Error in blocking indexer
Issue -
State: closed - Opened by Dragut over 5 years ago
- 1 comment
#114 - Function that returns actual records pairs (not count) of the confusion matrix- feature
Issue -
State: open - Opened by mayerantoine over 5 years ago
#113 - Fuzzy matching
Issue -
State: open - Opened by shreyaspuranik over 5 years ago
- 4 comments
#112 - Obtaining the Jaro-Winkler score
Issue -
State: closed - Opened by SultanOrazbayev over 5 years ago
- 2 comments
#111 - Incremental Training
Issue -
State: open - Opened by ahmed-emam over 5 years ago
- 2 comments
#110 - Double Metaphone - Feature request
Issue -
State: open - Opened by mayerantoine over 5 years ago
#107 - Max weighted matching
Pull Request -
State: open - Opened by jpweytjens over 5 years ago
- 6 comments
#102 - correctly parse n_jobs=-1 (#96)
Pull Request -
State: closed - Opened by jpweytjens over 5 years ago
- 3 comments
#101 - remove non binary columns, fixes #75
Pull Request -
State: closed - Opened by jpweytjens over 5 years ago
- 4 comments
#100 - ECM Classisifer - Wrong True positive - Runtime Warning: divide by zero encountered in log
Issue -
State: closed - Opened by mayerantoine over 5 years ago
- 2 comments
#98 - Help with creating a MultiIndex when training a classifier
Issue -
State: closed - Opened by cacciatc almost 6 years ago
#95 - How to do a "LEFT JOIN", enforce on dataset to be on the comparison?
Issue -
State: open - Opened by ccrvlh almost 6 years ago
- 1 comment