Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / Bergvca/string_grouper issues and pull requests

#98 - Feature/upgrade topn

Pull Request - State: open - Opened by icfly2 about 2 months ago

#97 - Update string_grouper.py for pandas

Pull Request - State: open - Opened by crc32 5 months ago

#93 - sparse_dot_topn_for_blocks dependency broken with Cython 3.0

Issue - State: open - Opened by alexiskaldany 12 months ago - 8 comments

#92 - Ngram re-use

Issue - State: open - Opened by hyshandler about 1 year ago - 1 comment

#91 - Module compiled against API version 0x10 but this version of numpy is 0xf

Issue - State: closed - Opened by jpswagner about 1 year ago - 1 comment

#90 - Installation not working: Massive error message

Issue - State: open - Opened by MatthewMishek over 1 year ago - 3 comments

#89 - Import Error

Issue - State: closed - Opened by ZhimingMei over 1 year ago - 4 comments

#88 - Tips for working with large datasets

Issue - State: open - Opened by ryangdar over 1 year ago - 1 comment

#87 - How to cite this work?

Issue - State: open - Opened by skwolvie over 1 year ago - 1 comment

#85 - Numpy version issue?

Issue - State: open - Opened by arnicas almost 2 years ago - 2 comments

#84 - Some general questions about the package

Issue - State: open - Opened by eneszv almost 2 years ago

#83 - added ability to cache matrix in queries across which `master` is constant

Pull Request - State: open - Opened by ParticularMiner about 2 years ago - 8 comments

#82 - updated CHANGELOG.md to contain certain missing notable changes

Pull Request - State: closed - Opened by ParticularMiner over 2 years ago - 1 comment

#81 - Different matching behavior across versions

Issue - State: closed - Opened by probablyfine over 2 years ago - 9 comments

#80 - Question about version string_grouper group_similar_strings

Issue - State: open - Opened by dariswan over 2 years ago - 4 comments

#79 - using string_grouper with a lookup column in orginal source data

Issue - State: closed - Opened by campbellalex321 over 2 years ago - 13 comments

#78 - able to change default cosine similarity of .8?

Issue - State: closed - Opened by campbellalex321 over 2 years ago - 2 comments

#77 - Could not install string-grouper

Issue - State: closed - Opened by campbellalex321 over 2 years ago - 2 comments

#75 - Formula for optimal matrix block-size

Issue - State: open - Opened by ParticularMiner over 2 years ago

#74 - added guesstimate for n_blocks

Pull Request - State: closed - Opened by ParticularMiner over 2 years ago - 4 comments

#73 - Error When matching Chinese name

Issue - State: open - Opened by ZhihaoMa over 2 years ago - 9 comments

#72 - added blocking capabilities

Pull Request - State: closed - Opened by ParticularMiner almost 3 years ago - 18 comments

#71 - Jupyter Notebook installation not working

Issue - State: open - Opened by junotown almost 3 years ago - 1 comment

#70 - Do the groups returned by String_Grouper follow the transitive property?

Issue - State: closed - Opened by bmanczak almost 3 years ago - 2 comments

#68 - Error when importing

Issue - State: open - Opened by okkyadhi almost 3 years ago - 4 comments

#67 - raise StringLengthException if vectoriser is applied to strings that …

Pull Request - State: open - Opened by gw00207 almost 3 years ago - 6 comments

#65 - match_string on small data series

Issue - State: closed - Opened by berndnoll almost 3 years ago - 2 comments

#64 - Question: How to achieve matching for multiple fields and priorities

Issue - State: open - Opened by berndnoll almost 3 years ago - 81 comments

#63 - Get an error message while running match_string

Issue - State: open - Opened by iibarant almost 3 years ago - 7 comments

#62 - included string_grouper_utils package specification in setup.py (bug fix)

Pull Request - State: closed - Opened by ParticularMiner almost 3 years ago - 1 comment

#61 - [question] How to import string_grouper_utils?

Issue - State: open - Opened by MauKruisheer almost 3 years ago - 5 comments

#59 - Installation problem

Issue - State: closed - Opened by skwolvie about 3 years ago - 2 comments

#58 - updated dependency on new version of sparse_dot_topn (v0.3.1)

Pull Request - State: closed - Opened by ParticularMiner about 3 years ago - 6 comments

#57 - [question] Partial matching of strings

Issue - State: open - Opened by andrei-volkau about 3 years ago - 2 comments

#56 - created a more user-friendly error message when bad data is found

Pull Request - State: open - Opened by ParticularMiner about 3 years ago - 2 comments

#53 - optimized code and fixed bug related to input Series of length 1 (discovered by user)

Pull Request - State: closed - Opened by ParticularMiner about 3 years ago - 4 comments

#52 - group_similar_strings with series length 1 produces ValueError

Issue - State: closed - Opened by gw00207 about 3 years ago - 2 comments

#51 - boosted _symmetrize_matches_list() (5x) and _get_matches_list() (33x)

Pull Request - State: closed - Opened by ParticularMiner about 3 years ago - 1 comment

#50 - fixed pypi README.md issues

Pull Request - State: closed - Opened by ParticularMiner about 3 years ago - 1 comment

#49 - [question] Does string_grouper is for English only?

Issue - State: closed - Opened by andrei-volkau about 3 years ago - 2 comments

#48 - New awesome: _get_true_n_max_matches() boosted

Pull Request - State: closed - Opened by ParticularMiner about 3 years ago - 1 comment

#47 - fixed Microsoft Windows 10 issue with numpy int datatype mismatch.

Pull Request - State: closed - Opened by ParticularMiner about 3 years ago - 1 comment

#46 - added StringGrouper attribute function _get_true_max_n_matches()

Pull Request - State: closed - Opened by ParticularMiner about 3 years ago - 9 comments

#45 - Added changelog with all changes since version 0.3.2

Pull Request - State: closed - Opened by Bergvca about 3 years ago - 1 comment

#44 - Add simple test workflow

Pull Request - State: closed - Opened by StevenMaude about 3 years ago - 17 comments

#43 - Fixed a bug which occurs when min_similarity=0 (User discovery)

Pull Request - State: closed - Opened by ParticularMiner over 3 years ago - 10 comments

#41 - Complete Index Functionality

Pull Request - State: closed - Opened by ParticularMiner over 3 years ago - 2 comments

#40 - Pairwise Similarity Computation (User Request)

Pull Request - State: closed - Opened by ParticularMiner over 3 years ago - 9 comments

#39 - Function of simply calculating pairwise similarity without matching

Issue - State: closed - Opened by KiraJYQiu over 3 years ago - 3 comments

#38 - Complete Index Functionality

Pull Request - State: closed - Opened by ParticularMiner over 3 years ago - 2 comments

#37 - Please Review Complete Index Functionality

Issue - State: closed - Opened by ParticularMiner over 3 years ago

#36 - Group Connectivity Visualization may reveal other possible representatives

Issue - State: closed - Opened by ParticularMiner over 3 years ago - 2 comments

#35 - Index mismatch may lead to unintended consequences!

Issue - State: closed - Opened by ParticularMiner over 3 years ago - 2 comments

#34 - Modularized group-representative selection into string_grouper_utils.py

Pull Request - State: closed - Opened by ParticularMiner over 3 years ago - 4 comments

#33 - Group representative functionality added

Pull Request - State: closed - Opened by ParticularMiner over 3 years ago - 10 comments

#32 - Updated unit tests in an attempt to reach 100% code coverage.

Pull Request - State: closed - Opened by ParticularMiner over 3 years ago - 1 comment

#31 - Updated README.md to reflect added ID functionality

Pull Request - State: closed - Opened by ParticularMiner over 3 years ago - 5 comments

#30 - Remaining ID functionality added (2)

Pull Request - State: closed - Opened by ParticularMiner over 3 years ago

#29 - Updated StringGrouper._deduplicate() for increased performance.

Pull Request - State: closed - Opened by ParticularMiner over 3 years ago - 1 comment

#27 - Added functionality for using IDs in match_strings function, added/up…

Pull Request - State: closed - Opened by justasojourner over 3 years ago - 2 comments

#26 - On Group Representatives

Issue - State: closed - Opened by ParticularMiner over 3 years ago - 6 comments

#25 - Remaining ID functionality added

Pull Request - State: closed - Opened by ParticularMiner over 3 years ago - 25 comments

#24 - Update documentation for added ID parameter & tutorial written

Pull Request - State: closed - Opened by justasojourner over 3 years ago - 4 comments

#23 - Updated routine for increased performance

Pull Request - State: closed - Opened by ParticularMiner over 3 years ago - 4 comments

#21 - Optimizing ._deduplicate()

Issue - State: closed - Opened by ParticularMiner over 3 years ago

#20 - Update documentation after match_strings code change for ID

Issue - State: open - Opened by justasojourner over 3 years ago - 1 comment

#19 - Add functionality to include an ID column to the string matching.

Pull Request - State: closed - Opened by justasojourner over 3 years ago - 4 comments

#18 - Including an ID column in String Grouper output

Issue - State: closed - Opened by justasojourner over 3 years ago - 6 comments

#17 - Duplicate (but swapped) right and left

Issue - State: closed - Opened by mustafa0x over 3 years ago - 4 comments

#16 - Setting min_similarity missing in _init_

Issue - State: closed - Opened by aksg87 over 3 years ago - 1 comment

#15 - Need help

Issue - State: closed - Opened by baarsrj over 3 years ago

#14 - Adding similarity column in the group_similar_strings output

Issue - State: open - Opened by selfcontrol7WC over 3 years ago - 2 comments

#13 - Option to add words to list of n-grams

Pull Request - State: closed - Opened by geokarak over 3 years ago - 2 comments

#12 - update prior master-dupe pairings

Pull Request - State: closed - Opened by tsajid88 over 3 years ago - 3 comments

#11 - pip install string-grouper doesn't work

Issue - State: closed - Opened by AlmogW almost 4 years ago - 3 comments

#10 - Issue importing string_grouper

Issue - State: closed - Opened by KingWiggle about 4 years ago

#9 - Update pip installer

Issue - State: closed - Opened by cartersolutions about 4 years ago - 1 comment

#8 - match_strings(): Any way to return additional columns?

Issue - State: closed - Opened by dbcandle about 4 years ago - 15 comments

#7 - ModuleNotFoundError: No module named 'string_grouper'

Issue - State: closed - Opened by neongreen13 about 4 years ago - 6 comments

#6 - sklearn is the wrong package, the correct one is scikit-learn

Issue - State: closed - Opened by fjsj about 4 years ago - 1 comment

#5 - Case is not ignored

Issue - State: closed - Opened by flindeberg about 4 years ago - 2 comments

#4 - Added ignore case functionality

Pull Request - State: closed - Opened by flindeberg about 4 years ago - 4 comments

#3 - does this work on python3.7?

Issue - State: closed - Opened by zhihongchensg over 4 years ago - 4 comments
Labels: question

#2 - Is it possible to swap the tfidfvectorizer with different vectorizer?

Issue - State: closed - Opened by donnydongchen over 4 years ago - 2 comments

#1 - time complexity & accuracy

Issue - State: closed - Opened by HARIHARAN1103 over 4 years ago - 1 comment