Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / Bergvca/string_grouper issues and pull requests
#98 - Feature/upgrade topn
Pull Request -
State: open - Opened by icfly2 about 2 months ago
#97 - Update string_grouper.py for pandas
Pull Request -
State: open - Opened by crc32 5 months ago
#96 - value too large to convert to int , help, How can just a few tens of thousands of strings not work? TKS ~~
Issue -
State: open - Opened by honglitao 6 months ago
- 2 comments
#95 - Ensure requirements are more explicit to create reproducible builds
Pull Request -
State: closed - Opened by robinknox 8 months ago
#94 - Change applymap to map to remove "FutureWarning: DataFrame.applymap h…as been deprecated. Use DataFrame.map instead."
Pull Request -
State: closed - Opened by robinknox 10 months ago
#93 - sparse_dot_topn_for_blocks dependency broken with Cython 3.0
Issue -
State: open - Opened by alexiskaldany 12 months ago
- 8 comments
#92 - Ngram re-use
Issue -
State: open - Opened by hyshandler about 1 year ago
- 1 comment
#91 - Module compiled against API version 0x10 but this version of numpy is 0xf
Issue -
State: closed - Opened by jpswagner about 1 year ago
- 1 comment
#90 - Installation not working: Massive error message
Issue -
State: open - Opened by MatthewMishek over 1 year ago
- 3 comments
#89 - Import Error
Issue -
State: closed - Opened by ZhimingMei over 1 year ago
- 4 comments
#88 - Tips for working with large datasets
Issue -
State: open - Opened by ryangdar over 1 year ago
- 1 comment
#87 - How to cite this work?
Issue -
State: open - Opened by skwolvie over 1 year ago
- 1 comment
#86 - Unable to allocate 8.41 GiB for an array with shape (2258174000,) and data type int32
Issue -
State: open - Opened by BudiZhu almost 2 years ago
#85 - Numpy version issue?
Issue -
State: open - Opened by arnicas almost 2 years ago
- 2 comments
#84 - Some general questions about the package
Issue -
State: open - Opened by eneszv almost 2 years ago
#83 - added ability to cache matrix in queries across which `master` is constant
Pull Request -
State: open - Opened by ParticularMiner about 2 years ago
- 8 comments
#82 - updated CHANGELOG.md to contain certain missing notable changes
Pull Request -
State: closed - Opened by ParticularMiner over 2 years ago
- 1 comment
#81 - Different matching behavior across versions
Issue -
State: closed - Opened by probablyfine over 2 years ago
- 9 comments
#80 - Question about version string_grouper group_similar_strings
Issue -
State: open - Opened by dariswan over 2 years ago
- 4 comments
#79 - using string_grouper with a lookup column in orginal source data
Issue -
State: closed - Opened by campbellalex321 over 2 years ago
- 13 comments
#78 - able to change default cosine similarity of .8?
Issue -
State: closed - Opened by campbellalex321 over 2 years ago
- 2 comments
#77 - Could not install string-grouper
Issue -
State: closed - Opened by campbellalex321 over 2 years ago
- 2 comments
#76 - Question / suggestion to use multiple n-grams to get more features
Issue -
State: open - Opened by iibarant over 2 years ago
#75 - Formula for optimal matrix block-size
Issue -
State: open - Opened by ParticularMiner over 2 years ago
#74 - added guesstimate for n_blocks
Pull Request -
State: closed - Opened by ParticularMiner over 2 years ago
- 4 comments
#73 - Error When matching Chinese name
Issue -
State: open - Opened by ZhihaoMa over 2 years ago
- 9 comments
#72 - added blocking capabilities
Pull Request -
State: closed - Opened by ParticularMiner almost 3 years ago
- 18 comments
#71 - Jupyter Notebook installation not working
Issue -
State: open - Opened by junotown almost 3 years ago
- 1 comment
#70 - Do the groups returned by String_Grouper follow the transitive property?
Issue -
State: closed - Opened by bmanczak almost 3 years ago
- 2 comments
#69 - Question: How to have built StringGrouper corpus persist across multiple match_string calls in a programming session
Issue -
State: open - Opened by justasojourner almost 3 years ago
- 5 comments
#68 - Error when importing
Issue -
State: open - Opened by okkyadhi almost 3 years ago
- 4 comments
#67 - raise StringLengthException if vectoriser is applied to strings that …
Pull Request -
State: open - Opened by gw00207 almost 3 years ago
- 6 comments
#66 - how to handle 'ValueError: empty vocabulary; perhaps the documents only contain stop words' in group_similar_strings
Issue -
State: open - Opened by gw00207 almost 3 years ago
- 2 comments
#65 - match_string on small data series
Issue -
State: closed - Opened by berndnoll almost 3 years ago
- 2 comments
#64 - Question: How to achieve matching for multiple fields and priorities
Issue -
State: open - Opened by berndnoll almost 3 years ago
- 81 comments
#63 - Get an error message while running match_string
Issue -
State: open - Opened by iibarant almost 3 years ago
- 7 comments
#62 - included string_grouper_utils package specification in setup.py (bug fix)
Pull Request -
State: closed - Opened by ParticularMiner almost 3 years ago
- 1 comment
#61 - [question] How to import string_grouper_utils?
Issue -
State: open - Opened by MauKruisheer almost 3 years ago
- 5 comments
#60 - set max_n_matches=1 in match_most_similar() for a significant performance boost
Pull Request -
State: closed - Opened by ParticularMiner almost 3 years ago
#59 - Installation problem
Issue -
State: closed - Opened by skwolvie about 3 years ago
- 2 comments
#58 - updated dependency on new version of sparse_dot_topn (v0.3.1)
Pull Request -
State: closed - Opened by ParticularMiner about 3 years ago
- 6 comments
#57 - [question] Partial matching of strings
Issue -
State: open - Opened by andrei-volkau about 3 years ago
- 2 comments
#56 - created a more user-friendly error message when bad data is found
Pull Request -
State: open - Opened by ParticularMiner about 3 years ago
- 2 comments
#55 - integrated external package sparse_dot_topn into string_grouper as dedicated sub-package string_grouper_topn
Pull Request -
State: closed - Opened by ParticularMiner about 3 years ago
- 2 comments
#54 - updated string_grouper_utils.py to quell unittest deprecated warnings
Pull Request -
State: closed - Opened by ParticularMiner about 3 years ago
#53 - optimized code and fixed bug related to input Series of length 1 (discovered by user)
Pull Request -
State: closed - Opened by ParticularMiner about 3 years ago
- 4 comments
#52 - group_similar_strings with series length 1 produces ValueError
Issue -
State: closed - Opened by gw00207 about 3 years ago
- 2 comments
#51 - boosted _symmetrize_matches_list() (5x) and _get_matches_list() (33x)
Pull Request -
State: closed - Opened by ParticularMiner about 3 years ago
- 1 comment
#50 - fixed pypi README.md issues
Pull Request -
State: closed - Opened by ParticularMiner about 3 years ago
- 1 comment
#49 - [question] Does string_grouper is for English only?
Issue -
State: closed - Opened by andrei-volkau about 3 years ago
- 2 comments
#48 - New awesome: _get_true_n_max_matches() boosted
Pull Request -
State: closed - Opened by ParticularMiner about 3 years ago
- 1 comment
#47 - fixed Microsoft Windows 10 issue with numpy int datatype mismatch.
Pull Request -
State: closed - Opened by ParticularMiner about 3 years ago
- 1 comment
#46 - added StringGrouper attribute function _get_true_max_n_matches()
Pull Request -
State: closed - Opened by ParticularMiner about 3 years ago
- 9 comments
#45 - Added changelog with all changes since version 0.3.2
Pull Request -
State: closed - Opened by Bergvca about 3 years ago
- 1 comment
#44 - Add simple test workflow
Pull Request -
State: closed - Opened by StevenMaude about 3 years ago
- 17 comments
#43 - Fixed a bug which occurs when min_similarity=0 (User discovery)
Pull Request -
State: closed - Opened by ParticularMiner over 3 years ago
- 10 comments
#42 - Return complete similarity matrix with get_matches() - including elements with 0 similarity
Issue -
State: open - Opened by nbcvijanovic over 3 years ago
- 4 comments
#41 - Complete Index Functionality
Pull Request -
State: closed - Opened by ParticularMiner over 3 years ago
- 2 comments
#40 - Pairwise Similarity Computation (User Request)
Pull Request -
State: closed - Opened by ParticularMiner over 3 years ago
- 9 comments
#39 - Function of simply calculating pairwise similarity without matching
Issue -
State: closed - Opened by KiraJYQiu over 3 years ago
- 3 comments
#38 - Complete Index Functionality
Pull Request -
State: closed - Opened by ParticularMiner over 3 years ago
- 2 comments
#37 - Please Review Complete Index Functionality
Issue -
State: closed - Opened by ParticularMiner over 3 years ago
#36 - Group Connectivity Visualization may reveal other possible representatives
Issue -
State: closed - Opened by ParticularMiner over 3 years ago
- 2 comments
#35 - Index mismatch may lead to unintended consequences!
Issue -
State: closed - Opened by ParticularMiner over 3 years ago
- 2 comments
#34 - Modularized group-representative selection into string_grouper_utils.py
Pull Request -
State: closed - Opened by ParticularMiner over 3 years ago
- 4 comments
#33 - Group representative functionality added
Pull Request -
State: closed - Opened by ParticularMiner over 3 years ago
- 10 comments
#32 - Updated unit tests in an attempt to reach 100% code coverage.
Pull Request -
State: closed - Opened by ParticularMiner over 3 years ago
- 1 comment
#31 - Updated README.md to reflect added ID functionality
Pull Request -
State: closed - Opened by ParticularMiner over 3 years ago
- 5 comments
#30 - Remaining ID functionality added (2)
Pull Request -
State: closed - Opened by ParticularMiner over 3 years ago
#29 - Updated StringGrouper._deduplicate() for increased performance.
Pull Request -
State: closed - Opened by ParticularMiner over 3 years ago
- 1 comment
#28 - Used scipy.sparse.csgraph routine connected_components() routine to g…
Pull Request -
State: closed - Opened by ParticularMiner over 3 years ago
#27 - Added functionality for using IDs in match_strings function, added/up…
Pull Request -
State: closed - Opened by justasojourner over 3 years ago
- 2 comments
#26 - On Group Representatives
Issue -
State: closed - Opened by ParticularMiner over 3 years ago
- 6 comments
#25 - Remaining ID functionality added
Pull Request -
State: closed - Opened by ParticularMiner over 3 years ago
- 25 comments
#24 - Update documentation for added ID parameter & tutorial written
Pull Request -
State: closed - Opened by justasojourner over 3 years ago
- 4 comments
#23 - Updated routine for increased performance
Pull Request -
State: closed - Opened by ParticularMiner over 3 years ago
- 4 comments
#22 - Regarding ID functionality for match_most_similar() and group_similar_strings()
Issue -
State: closed - Opened by ParticularMiner over 3 years ago
- 4 comments
#21 - Optimizing ._deduplicate()
Issue -
State: closed - Opened by ParticularMiner over 3 years ago
#20 - Update documentation after match_strings code change for ID
Issue -
State: open - Opened by justasojourner over 3 years ago
- 1 comment
#19 - Add functionality to include an ID column to the string matching.
Pull Request -
State: closed - Opened by justasojourner over 3 years ago
- 4 comments
#18 - Including an ID column in String Grouper output
Issue -
State: closed - Opened by justasojourner over 3 years ago
- 6 comments
#17 - Duplicate (but swapped) right and left
Issue -
State: closed - Opened by mustafa0x over 3 years ago
- 4 comments
#16 - Setting min_similarity missing in _init_
Issue -
State: closed - Opened by aksg87 over 3 years ago
- 1 comment
#15 - Need help
Issue -
State: closed - Opened by baarsrj over 3 years ago
#14 - Adding similarity column in the group_similar_strings output
Issue -
State: open - Opened by selfcontrol7WC over 3 years ago
- 2 comments
#13 - Option to add words to list of n-grams
Pull Request -
State: closed - Opened by geokarak over 3 years ago
- 2 comments
#12 - update prior master-dupe pairings
Pull Request -
State: closed - Opened by tsajid88 over 3 years ago
- 3 comments
#11 - pip install string-grouper doesn't work
Issue -
State: closed - Opened by AlmogW almost 4 years ago
- 3 comments
#10 - Issue importing string_grouper
Issue -
State: closed - Opened by KingWiggle about 4 years ago
#9 - Update pip installer
Issue -
State: closed - Opened by cartersolutions about 4 years ago
- 1 comment
#8 - match_strings(): Any way to return additional columns?
Issue -
State: closed - Opened by dbcandle about 4 years ago
- 15 comments
#7 - ModuleNotFoundError: No module named 'string_grouper'
Issue -
State: closed - Opened by neongreen13 about 4 years ago
- 6 comments
#6 - sklearn is the wrong package, the correct one is scikit-learn
Issue -
State: closed - Opened by fjsj about 4 years ago
- 1 comment
#5 - Case is not ignored
Issue -
State: closed - Opened by flindeberg about 4 years ago
- 2 comments
#4 - Added ignore case functionality
Pull Request -
State: closed - Opened by flindeberg about 4 years ago
- 4 comments
#3 - does this work on python3.7?
Issue -
State: closed - Opened by zhihongchensg over 4 years ago
- 4 comments
Labels: question
#2 - Is it possible to swap the tfidfvectorizer with different vectorizer?
Issue -
State: closed - Opened by donnydongchen over 4 years ago
- 2 comments
#1 - time complexity & accuracy
Issue -
State: closed - Opened by HARIHARAN1103 over 4 years ago
- 1 comment