Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / nltk/nltk issues and pull requests

#3325 - Output is "Stag" instead of "Stage"

Issue - State: open - Opened by ravishankar-cloud 10 days ago - 2 comments

#3324 - Unable to use word_tokenize function

Issue - State: closed - Opened by beingEniola 16 days ago - 4 comments

#3323 - Adressing issue "nltk.metrics.aline feature map missing IPA symbols #3285"

Pull Request - State: closed - Opened by WilliamPLaCroix 17 days ago - 4 comments
Labels: tagger, parsing, GUI, metrics

#3322 - Mention major Morphy change in ChangeLog

Pull Request - State: closed - Opened by ekaf 20 days ago

#3321 - Synsets method giving different results between nltk 3.8.1 and nltk 3.9

Issue - State: closed - Opened by ruhayat 20 days ago - 2 comments

#3320 - NLTK word tokeniser splits archaic "[verb]'d" into "[verb]" and "'d"

Issue - State: closed - Opened by Crissium 22 days ago - 3 comments

#3319 - PunktTokenizer not visible

Issue - State: closed - Opened by pricoptudor 24 days ago

#3318 - remove ci workaround for issue in python 3.12.4

Pull Request - State: closed - Opened by purificant 26 days ago
Labels: CI

#3316 - punkt model for Arabic needed

Issue - State: open - Opened by snoucair about 1 month ago - 1 comment

#3315 - Chicken-and-egg problem with downloading wordnet in GitHub Action

Issue - State: closed - Opened by peterbe about 1 month ago - 1 comment

#3314 - NLTK Tokenizer not working in Latest Tag [ 3.9.1 ]

Issue - State: closed - Opened by anuraged51a about 1 month ago - 1 comment

#3313 - Document breaking change in ChangeLog

Pull Request - State: closed - Opened by ekaf about 1 month ago

#3312 - Lookup error issue in nltk even with new version 3.9.1, similar to PR #3308

Issue - State: closed - Opened by kirupang-code about 1 month ago - 5 comments

#3311 - Load Punkt collocations as a set

Pull Request - State: closed - Opened by ekaf about 1 month ago
Labels: tokenizer

#3309 - Fix bug in WordNetLemmatizer

Pull Request - State: closed - Opened by ekaf about 1 month ago - 1 comment
Labels: stem/lemma

#3308 - nltk 3.9.0: Lookup error on import "Resource wordnet not found"

Issue - State: closed - Opened by danjac about 1 month ago - 16 comments
Labels: critical

#3307 - Ensure https download

Pull Request - State: closed - Opened by ekaf about 1 month ago

#3306 - Don't add PY3 in data path

Pull Request - State: closed - Opened by ekaf about 1 month ago - 2 comments
Labels: corpus

#3304 - Add doctests in data.py

Pull Request - State: closed - Opened by ekaf about 1 month ago - 1 comment

#3303 - Use LRU cache when getting POS tagger

Pull Request - State: closed - Opened by strangetom about 2 months ago - 1 comment
Labels: tagger

#3302 - Don't break old pickle requests

Pull Request - State: closed - Opened by ekaf about 2 months ago

#3301 - nltk version 3.8.2 is no longer available on PyPi

Issue - State: closed - Opened by tisnik about 2 months ago - 46 comments

#3300 - Use a lru cache when instantiating PunktTokenizer

Pull Request - State: closed - Opened by antoniomika about 2 months ago - 3 comments
Labels: tokenizer

#3299 - Performance regression of `word_tokenize` in NLTK 3.8.2

Issue - State: closed - Opened by juhoinkinen about 2 months ago - 7 comments

#3297 - [BUG] punkt_tab downloading is taking a lot of time.

Issue - State: closed - Opened by payaljain2003 about 2 months ago - 10 comments

#3296 - Fix for https://github.com/nltk/nltk/issues/3294

Pull Request - State: closed - Opened by soras about 2 months ago - 3 comments
Labels: tokenizer

#3295 - Ensure https download

Pull Request - State: closed - Opened by ekaf about 2 months ago - 2 comments

#3293 - [BUG] punkt_tab breaking change

Issue - State: closed - Opened by robcaulk about 2 months ago - 23 comments

#3292 - fix whitespace

Pull Request - State: closed - Opened by purificant about 2 months ago

#3291 - BLEU Score Exceeds 1 for Certain Test Cases

Issue - State: closed - Opened by TheBruh141 2 months ago - 1 comment

#3290 - Prevent data.load from unpickling classes or functions

Pull Request - State: closed - Opened by ekaf 2 months ago - 3 comments

#3289 - ⚡️ Speed up windowdiff() by 122% in nltk/metrics/segmentation.py

Pull Request - State: open - Opened by ihitamandal 2 months ago - 1 comment
Labels: metrics

#3288 - Incomplete Language Support in NLTK for Open Multilingual WordNet

Issue - State: closed - Opened by zarkua 2 months ago - 1 comment

#3287 - NLTK Tokenizer Crashes

Issue - State: closed - Opened by SesaYash 3 months ago - 25 comments

#3286 - Pickle-free maxent chunkers

Pull Request - State: closed - Opened by ekaf 3 months ago - 5 comments
Labels: classifier

#3285 - nltk.metrics.aline feature map missing IPA symbols

Issue - State: closed - Opened by TomerYS 3 months ago - 2 comments

#3284 - nltk.metrics.aline delta error

Issue - State: open - Opened by TomerYS 3 months ago - 2 comments

#3283 - Load PunktParameters from tab files

Pull Request - State: closed - Opened by ekaf 3 months ago - 13 comments
Labels: corpus, tokenizer, metrics, sentiment

#3282 - Replaced black with ruff in pre-commit hook

Pull Request - State: open - Opened by devesh-2002 3 months ago - 2 comments

#3281 - CI retest Github short cut no longer working

Issue - State: open - Opened by alvations 3 months ago
Labels: CI, internals

#3280 - fix python version for pre-commit as a workaround

Pull Request - State: closed - Opened by purificant 3 months ago - 1 comment
Labels: CI

#3279 - Put pyupgrade back when Github default precommit hook uses v3.14.5

Issue - State: closed - Opened by alvations 3 months ago - 1 comment
Labels: pythonic, CI, internals

#3278 - Replacing black with ruff in CI/CD precommit hook

Issue - State: open - Opened by alvations 3 months ago - 3 comments
Labels: good first issue, nice idea, CI

#3277 - Towards pickle-free Punkt

Pull Request - State: closed - Opened by ekaf 3 months ago - 4 comments
Labels: tokenizer, critical

#3276 - ntlk unsafe deserialization vulnerability

Issue - State: closed - Opened by JohnJyong 3 months ago - 2 comments
Labels: critical

#3275 - Summing Ngram LM probabilities requires math.fsum

Issue - State: open - Opened by alvations 3 months ago
Labels: language-model, bug

#3274 - Fixing CI/CD

Pull Request - State: closed - Opened by alvations 3 months ago - 5 comments
Labels: corpus, tagger, parsing, stem/lemma, metrics

#3273 - Pyupgrade failing on latest commit

Issue - State: closed - Opened by alvations 3 months ago - 5 comments

#3272 - Pickless-less tagsets helper

Pull Request - State: closed - Opened by alvations 3 months ago - 4 comments
Labels: critical

#3271 - Removed PickleCorpusView

Pull Request - State: closed - Opened by alvations 3 months ago - 1 comment
Labels: corpus, critical

#3270 - Pickle-less average perceptron tagger

Pull Request - State: closed - Opened by alvations 3 months ago - 5 comments
Labels: tagger, critical

#3269 - Typo in nltk/tbl/demo.py?

Issue - State: open - Opened by mcepl 3 months ago

#3268 - fix miscellaneous minor misspelled words

Pull Request - State: closed - Opened by timoteostewart 3 months ago
Labels: tokenizer

#3267 - Juan1

Pull Request - State: closed - Opened by Jus1311 3 months ago

#3266 - Remote code execution vulnerability in NLTK

Issue - State: closed - Opened by Dunedan 3 months ago - 93 comments
Labels: critical

#3265 - Fix k-alpha for expected disagreement equals 0

Pull Request - State: closed - Opened by vera-bernhard 4 months ago - 1 comment
Labels: metrics

#3263 - missing delimiter for 'u' glob qualifier

Issue - State: closed - Opened by bijubjs 4 months ago - 2 comments
Labels: invalid

#3263 - missing delimiter for 'u' glob qualifier

Issue - State: open - Opened by bijubjs 4 months ago - 1 comment

#3261 - Unable to get local issuer certificate CentOS_7

Issue - State: open - Opened by francoiscap 4 months ago

#3261 - Unable to get local issuer certificate CentOS_7

Issue - State: open - Opened by francoiscap 4 months ago

#3260 - TreebankWordDetokenizer inverting order of tokens

Issue - State: open - Opened by jmccrae 4 months ago - 1 comment

#3260 - TreebankWordDetokenizer inverting order of tokens

Issue - State: open - Opened by jmccrae 4 months ago - 1 comment

#3259 - add gensim@da2f388 for support py3.12

Pull Request - State: open - Opened by tavallaie 4 months ago

#3259 - add gensim@da2f388 for support py3.12

Pull Request - State: open - Opened by tavallaie 4 months ago

#3258 - Multiprocessing of NLTK ngrams?

Issue - State: open - Opened by km5ar 4 months ago - 1 comment

#3258 - Multiprocessing of NLTK ngrams?

Issue - State: open - Opened by km5ar 4 months ago - 1 comment

#3257 - Develop text lemmatize function

Pull Request - State: open - Opened by Sion1225 4 months ago - 18 comments
Labels: corpus, stem/lemma

#3257 - Develop text lemmatize function

Pull Request - State: open - Opened by Sion1225 4 months ago - 18 comments
Labels: corpus, stem/lemma

#3256 - Add functionality to return the lemmas of words used in a corpus.

Issue - State: open - Opened by Sion1225 4 months ago - 1 comment

#3256 - Add functionality to return the lemmas of words used in a corpus.

Issue - State: open - Opened by Sion1225 4 months ago - 1 comment

#3255 - Not able to install nltk on termux, getting the following error:

Issue - State: closed - Opened by divyanshluthra 4 months ago - 1 comment

#3253 - Corpora used to train Punkt Segmenter in German

Issue - State: open - Opened by AugustinErnoult 5 months ago - 1 comment

#3253 - Corpora used to train Punkt Segmenter in German

Issue - State: open - Opened by AugustinErnoult 5 months ago - 1 comment

#3252 - Inconsistent output between Windows and Mac

Issue - State: open - Opened by Hongao0611 5 months ago

#3252 - Inconsistent output between Windows and Mac

Issue - State: open - Opened by Hongao0611 5 months ago

#3251 - MS Visual Studio C and C++ build during pip install fails.

Issue - State: open - Opened by oldspammer 5 months ago - 2 comments

#3251 - MS Visual Studio C and C++ build during pip install fails.

Issue - State: open - Opened by oldspammer 5 months ago - 2 comments

#3250 - Normalize function in sentence_bleu

Issue - State: open - Opened by Razbolt 5 months ago - 4 comments

#3250 - Normalize function in sentence_bleu

Issue - State: open - Opened by Razbolt 5 months ago - 10 comments

#3249 - SnowballStemmer: how to avoid transliteration?

Issue - State: open - Opened by satyrmipt 5 months ago - 1 comment

#3249 - SnowballStemmer: how to avoid transliteration?

Issue - State: open - Opened by satyrmipt 5 months ago - 1 comment

#3248 - Downloader race condition with multiple processes

Issue - State: open - Opened by naktinis 5 months ago - 5 comments

#3248 - Downloader race condition with multiple processes

Issue - State: open - Opened by naktinis 5 months ago - 6 comments
Labels: cluster, nltk_data, critical, needs review

#3247 - Write downloaded model files atomically

Pull Request - State: open - Opened by naktinis 5 months ago - 7 comments

#3246 - Failed to run post install script for guardrails/toxic_language

Issue - State: closed - Opened by xinzaifeixiang1992 6 months ago - 2 comments

#3245 - Avoid duplicate output in acyclic_breadth_first

Pull Request - State: closed - Opened by ekaf 6 months ago - 4 comments

#3244 - Duplicates in wordnet hypernyms closure

Issue - State: closed - Opened by ekaf 6 months ago