Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / emorynlp/nlp4j-tokenization issues and pull requests

#12 - tokenization splitting terms with & in them

Issue - State: closed - Opened by ggiavelli over 5 years ago - 1 comment

#11 - Twitter users and hashtags with leading numbers

Issue - State: open - Opened by cakelly about 8 years ago

#10 - Malformed contractions not being split

Issue - State: open - Opened by cakelly about 8 years ago

#9 - Tokenization of html UTF-8 chars

Issue - State: closed - Opened by cakelly about 8 years ago - 2 comments

#8 - Tokens with fancy quotes are being merged

Issue - State: open - Opened by cakelly about 8 years ago

#7 - Tokenizer java.lang.StringIndexOutOfBoundsException

Issue - State: closed - Opened by nartz over 8 years ago - 4 comments

#6 - Tokenizing dates ranges

Issue - State: closed - Opened by mzhai2 over 8 years ago - 2 comments

#5 - Symbol offset and minor bugfixes.

Pull Request - State: closed - Opened by spraynasal over 8 years ago - 1 comment

#4 - Fixed offsets in addSymbol tokenization method

Pull Request - State: closed - Opened by spraynasal over 8 years ago - 3 comments

#3 - Handle final "y" in english word tokenization

Pull Request - State: closed - Opened by spraynasal over 8 years ago - 1 comment

#2 - Local

Pull Request - State: closed - Opened by amit-deshmane over 8 years ago - 1 comment

#1 - Original text preservation

Issue - State: closed - Opened by capdevc almost 9 years ago - 1 comment