Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / fnl/segtok issues and pull requests

#26 - bug: split_contractions fails for certain patterns

Issue - State: open - Opened by MattGPT-ai 3 months ago

#25 - Remove license file from data_files

Pull Request - State: closed - Opened by PrimozGodec almost 3 years ago - 5 comments

#24 - Fix deprecation warnings due to invalid escape sequences.

Pull Request - State: closed - Opened by tirkarthi over 4 years ago - 2 comments

#23 - Deprecation warning due to invalid escape sequences

Issue - State: closed - Opened by tirkarthi over 4 years ago - 1 comment

#22 - package license

Issue - State: closed - Opened by oblute over 4 years ago - 2 comments

#21 - text to sentences segmentation

Issue - State: open - Opened by Tortoise17 about 5 years ago - 3 comments

#20 - Word tokenizer does not split apostrophe and apostrophe s

Issue - State: open - Opened by pwichmann over 5 years ago - 2 comments

#19 - Date gets segmented

Issue - State: closed - Opened by mnishant2 over 5 years ago - 4 comments
Labels: enhancement

#18 - Extensibility through custom regex or abbreviation lists?

Issue - State: closed - Opened by RyanMcCarl almost 6 years ago - 7 comments
Labels: enhancement

#17 - Issue with sentence separator

Issue - State: closed - Opened by arcticOak2 about 6 years ago - 4 comments

#16 - Over-splitting on quotes with names.

Issue - State: open - Opened by jakepoz over 6 years ago - 1 comment
Labels: enhancement

#15 - Broken sentence terminal splice at token boundary in some cases.

Pull Request - State: closed - Opened by gkucsko over 6 years ago - 2 comments

#14 - Boundary without a space

Issue - State: closed - Opened by ml-pickle about 7 years ago - 1 comment
Labels: enhancement, wontfix

#13 - Sentence over-splitting on consecutive first-name abbreviations

Issue - State: closed - Opened by fnl about 7 years ago - 2 comments

#12 - Improper split at a first name abbreviation

Issue - State: closed - Opened by fnl about 7 years ago - 1 comment
Labels: enhancement

#11 - Single-line splitting mode is joining sentence across lines

Issue - State: closed - Opened by yucongo over 7 years ago - 3 comments

#10 - Improper segmentation with proper names where middle initial is abbreviated

Issue - State: closed - Opened by christian-storm over 7 years ago - 5 comments
Labels: enhancement

#9 - Failure to split on abberviations

Issue - State: closed - Opened by Klim314 over 9 years ago - 10 comments
Labels: enhancement

#8 - `web_tokenizer`: Unknown error on ",;" sequence.

Issue - State: closed - Opened by geovedi over 9 years ago - 3 comments
Labels: bug

#7 - tokenizer usage

Issue - State: closed - Opened by dineshbvadhia over 9 years ago - 1 comment
Labels: invalid

#6 - segtok in programs

Issue - State: closed - Opened by dineshbvadhia over 9 years ago - 2 comments
Labels: invalid

#5 - Currency and percentages

Issue - State: closed - Opened by dineshbvadhia over 9 years ago - 2 comments
Labels: enhancement

#4 - An issue about segment

Issue - State: closed - Opened by lixiangnlp over 9 years ago - 1 comment
Labels: question

#3 - Kmike py2 with stdio encoding fix

Pull Request - State: closed - Opened by fnl over 9 years ago

#2 - Python 3.3 and 2.7 support

Pull Request - State: closed - Opened by kmike over 9 years ago - 4 comments

#1 - Support Python 2.7

Issue - State: closed - Opened by wvengen almost 10 years ago - 5 comments
Labels: enhancement