Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / nltk/nltk issues and pull requests
#3326 - Added Table of Contents and new section on POS Tagging in README
Pull Request -
State: open - Opened by SamchenUF 4 days ago
#3325 - Output is "Stag" instead of "Stage"
Issue -
State: open - Opened by ravishankar-cloud 10 days ago
- 2 comments
#3324 - Unable to use word_tokenize function
Issue -
State: closed - Opened by beingEniola 16 days ago
- 4 comments
#3323 - Adressing issue "nltk.metrics.aline feature map missing IPA symbols #3285"
Pull Request -
State: closed - Opened by WilliamPLaCroix 17 days ago
- 4 comments
Labels: tagger, parsing, GUI, metrics
#3322 - Mention major Morphy change in ChangeLog
Pull Request -
State: closed - Opened by ekaf 20 days ago
#3321 - Synsets method giving different results between nltk 3.8.1 and nltk 3.9
Issue -
State: closed - Opened by ruhayat 20 days ago
- 2 comments
#3320 - NLTK word tokeniser splits archaic "[verb]'d" into "[verb]" and "'d"
Issue -
State: closed - Opened by Crissium 22 days ago
- 3 comments
#3319 - PunktTokenizer not visible
Issue -
State: closed - Opened by pricoptudor 24 days ago
#3318 - remove ci workaround for issue in python 3.12.4
Pull Request -
State: closed - Opened by purificant 26 days ago
Labels: CI
#3317 - KneserNeyInterpolated taking an unreasonable amount of time to generate text
Issue -
State: open - Opened by owo 26 days ago
- 3 comments
#3316 - punkt model for Arabic needed
Issue -
State: open - Opened by snoucair about 1 month ago
- 1 comment
#3315 - Chicken-and-egg problem with downloading wordnet in GitHub Action
Issue -
State: closed - Opened by peterbe about 1 month ago
- 1 comment
#3314 - NLTK Tokenizer not working in Latest Tag [ 3.9.1 ]
Issue -
State: closed - Opened by anuraged51a about 1 month ago
- 1 comment
#3313 - Document breaking change in ChangeLog
Pull Request -
State: closed - Opened by ekaf about 1 month ago
#3312 - Lookup error issue in nltk even with new version 3.9.1, similar to PR #3308
Issue -
State: closed - Opened by kirupang-code about 1 month ago
- 5 comments
#3311 - Load Punkt collocations as a set
Pull Request -
State: closed - Opened by ekaf about 1 month ago
Labels: tokenizer
#3310 - `load_punkt_params()` loads `PunktParameters.collocations` as list instead of set
Issue -
State: closed - Opened by ryanamannion about 1 month ago
- 1 comment
#3309 - Fix bug in WordNetLemmatizer
Pull Request -
State: closed - Opened by ekaf about 1 month ago
- 1 comment
Labels: stem/lemma
#3308 - nltk 3.9.0: Lookup error on import "Resource wordnet not found"
Issue -
State: closed - Opened by danjac about 1 month ago
- 16 comments
Labels: critical
#3307 - Ensure https download
Pull Request -
State: closed - Opened by ekaf about 1 month ago
#3306 - Don't add PY3 in data path
Pull Request -
State: closed - Opened by ekaf about 1 month ago
- 2 comments
Labels: corpus
#3305 - nltk.find:No such file or directory: '/root/nltk_data/tokenizers/punkt/PY3_tab'
Issue -
State: closed - Opened by hpx502766238 about 1 month ago
- 9 comments
#3304 - Add doctests in data.py
Pull Request -
State: closed - Opened by ekaf about 1 month ago
- 1 comment
#3303 - Use LRU cache when getting POS tagger
Pull Request -
State: closed - Opened by strangetom about 2 months ago
- 1 comment
Labels: tagger
#3302 - Don't break old pickle requests
Pull Request -
State: closed - Opened by ekaf about 2 months ago
#3301 - nltk version 3.8.2 is no longer available on PyPi
Issue -
State: closed - Opened by tisnik about 2 months ago
- 46 comments
#3300 - Use a lru cache when instantiating PunktTokenizer
Pull Request -
State: closed - Opened by antoniomika about 2 months ago
- 3 comments
Labels: tokenizer
#3299 - Performance regression of `word_tokenize` in NLTK 3.8.2
Issue -
State: closed - Opened by juhoinkinen about 2 months ago
- 7 comments
#3298 - [BUG] _pickle.UnpicklingError: global 'copy_reg._reconstructor' is forbidden
Issue -
State: closed - Opened by shrey-gupta-2809 about 2 months ago
- 1 comment
#3297 - [BUG] punkt_tab downloading is taking a lot of time.
Issue -
State: closed - Opened by payaljain2003 about 2 months ago
- 10 comments
#3296 - Fix for https://github.com/nltk/nltk/issues/3294
Pull Request -
State: closed - Opened by soras about 2 months ago
- 3 comments
Labels: tokenizer
#3295 - Ensure https download
Pull Request -
State: closed - Opened by ekaf about 2 months ago
- 2 comments
#3294 - [BUG] NLTK's PunktTokenizer fails to initialize due to UnicodeDecodeError [Windows specific]
Issue -
State: closed - Opened by soras about 2 months ago
- 2 comments
#3293 - [BUG] punkt_tab breaking change
Issue -
State: closed - Opened by robcaulk about 2 months ago
- 23 comments
#3292 - fix whitespace
Pull Request -
State: closed - Opened by purificant about 2 months ago
#3291 - BLEU Score Exceeds 1 for Certain Test Cases
Issue -
State: closed - Opened by TheBruh141 2 months ago
- 1 comment
#3290 - Prevent data.load from unpickling classes or functions
Pull Request -
State: closed - Opened by ekaf 2 months ago
- 3 comments
#3289 - ⚡️ Speed up windowdiff() by 122% in nltk/metrics/segmentation.py
Pull Request -
State: open - Opened by ihitamandal 2 months ago
- 1 comment
Labels: metrics
#3288 - Incomplete Language Support in NLTK for Open Multilingual WordNet
Issue -
State: closed - Opened by zarkua 2 months ago
- 1 comment
#3287 - NLTK Tokenizer Crashes
Issue -
State: closed - Opened by SesaYash 3 months ago
- 25 comments
#3286 - Pickle-free maxent chunkers
Pull Request -
State: closed - Opened by ekaf 3 months ago
- 5 comments
Labels: classifier
#3285 - nltk.metrics.aline feature map missing IPA symbols
Issue -
State: closed - Opened by TomerYS 3 months ago
- 2 comments
#3284 - nltk.metrics.aline delta error
Issue -
State: open - Opened by TomerYS 3 months ago
- 2 comments
#3283 - Load PunktParameters from tab files
Pull Request -
State: closed - Opened by ekaf 3 months ago
- 13 comments
Labels: corpus, tokenizer, metrics, sentiment
#3282 - Replaced black with ruff in pre-commit hook
Pull Request -
State: open - Opened by devesh-2002 3 months ago
- 2 comments
#3281 - CI retest Github short cut no longer working
Issue -
State: open - Opened by alvations 3 months ago
Labels: CI, internals
#3280 - fix python version for pre-commit as a workaround
Pull Request -
State: closed - Opened by purificant 3 months ago
- 1 comment
Labels: CI
#3279 - Put pyupgrade back when Github default precommit hook uses v3.14.5
Issue -
State: closed - Opened by alvations 3 months ago
- 1 comment
Labels: pythonic, CI, internals
#3278 - Replacing black with ruff in CI/CD precommit hook
Issue -
State: open - Opened by alvations 3 months ago
- 3 comments
Labels: good first issue, nice idea, CI
#3277 - Towards pickle-free Punkt
Pull Request -
State: closed - Opened by ekaf 3 months ago
- 4 comments
Labels: tokenizer, critical
#3276 - ntlk unsafe deserialization vulnerability
Issue -
State: closed - Opened by JohnJyong 3 months ago
- 2 comments
Labels: critical
#3275 - Summing Ngram LM probabilities requires math.fsum
Issue -
State: open - Opened by alvations 3 months ago
Labels: language-model, bug
#3274 - Fixing CI/CD
Pull Request -
State: closed - Opened by alvations 3 months ago
- 5 comments
Labels: corpus, tagger, parsing, stem/lemma, metrics
#3273 - Pyupgrade failing on latest commit
Issue -
State: closed - Opened by alvations 3 months ago
- 5 comments
#3272 - Pickless-less tagsets helper
Pull Request -
State: closed - Opened by alvations 3 months ago
- 4 comments
Labels: critical
#3271 - Removed PickleCorpusView
Pull Request -
State: closed - Opened by alvations 3 months ago
- 1 comment
Labels: corpus, critical
#3270 - Pickle-less average perceptron tagger
Pull Request -
State: closed - Opened by alvations 3 months ago
- 5 comments
Labels: tagger, critical
#3269 - Typo in nltk/tbl/demo.py?
Issue -
State: open - Opened by mcepl 3 months ago
#3268 - fix miscellaneous minor misspelled words
Pull Request -
State: closed - Opened by timoteostewart 3 months ago
Labels: tokenizer
#3267 - Juan1
Pull Request -
State: closed - Opened by Jus1311 3 months ago
#3266 - Remote code execution vulnerability in NLTK
Issue -
State: closed - Opened by Dunedan 3 months ago
- 93 comments
Labels: critical
#3265 - Fix k-alpha for expected disagreement equals 0
Pull Request -
State: closed - Opened by vera-bernhard 4 months ago
- 1 comment
Labels: metrics
#3264 - ZeroDivisionError when computing Krippendorff's alpha
Issue -
State: closed - Opened by vera-bernhard 4 months ago
#3263 - missing delimiter for 'u' glob qualifier
Issue -
State: closed - Opened by bijubjs 4 months ago
- 2 comments
Labels: invalid
#3263 - missing delimiter for 'u' glob qualifier
Issue -
State: open - Opened by bijubjs 4 months ago
- 1 comment
#3262 - StanfordNERTagger couldn't process words contain space in.
Issue -
State: open - Opened by Sion1225 4 months ago
#3262 - StanfordNERTagger couldn't process words contain space in.
Issue -
State: open - Opened by Sion1225 4 months ago
#3261 - Unable to get local issuer certificate CentOS_7
Issue -
State: open - Opened by francoiscap 4 months ago
#3261 - Unable to get local issuer certificate CentOS_7
Issue -
State: open - Opened by francoiscap 4 months ago
#3260 - TreebankWordDetokenizer inverting order of tokens
Issue -
State: open - Opened by jmccrae 4 months ago
- 1 comment
#3260 - TreebankWordDetokenizer inverting order of tokens
Issue -
State: open - Opened by jmccrae 4 months ago
- 1 comment
#3259 - add gensim@da2f388 for support py3.12
Pull Request -
State: open - Opened by tavallaie 4 months ago
#3259 - add gensim@da2f388 for support py3.12
Pull Request -
State: open - Opened by tavallaie 4 months ago
#3258 - Multiprocessing of NLTK ngrams?
Issue -
State: open - Opened by km5ar 4 months ago
- 1 comment
#3258 - Multiprocessing of NLTK ngrams?
Issue -
State: open - Opened by km5ar 4 months ago
- 1 comment
#3257 - Develop text lemmatize function
Pull Request -
State: open - Opened by Sion1225 4 months ago
- 18 comments
Labels: corpus, stem/lemma
#3257 - Develop text lemmatize function
Pull Request -
State: open - Opened by Sion1225 4 months ago
- 18 comments
Labels: corpus, stem/lemma
#3256 - Add functionality to return the lemmas of words used in a corpus.
Issue -
State: open - Opened by Sion1225 4 months ago
- 1 comment
#3256 - Add functionality to return the lemmas of words used in a corpus.
Issue -
State: open - Opened by Sion1225 4 months ago
- 1 comment
#3255 - Not able to install nltk on termux, getting the following error:
Issue -
State: closed - Opened by divyanshluthra 4 months ago
- 1 comment
#3255 - Not able to install nltk on termux, getting the following error:
Issue -
State: open - Opened by divyanshluthra 4 months ago
#3254 - Error loading punkt: <urlopen error [SSL: [nltk_data] CERTIFICATE_VERIFY_FAILED] certificate verify failed: [nltk_data] unable to get local issuer certificate (_ssl.c:1000)>
Issue -
State: closed - Opened by slpminn 4 months ago
#3254 - Error loading punkt: <urlopen error [SSL: [nltk_data] CERTIFICATE_VERIFY_FAILED] certificate verify failed: [nltk_data] unable to get local issuer certificate (_ssl.c:1000)>
Issue -
State: closed - Opened by slpminn 4 months ago
- 2 comments
#3253 - Corpora used to train Punkt Segmenter in German
Issue -
State: open - Opened by AugustinErnoult 5 months ago
- 1 comment
#3253 - Corpora used to train Punkt Segmenter in German
Issue -
State: open - Opened by AugustinErnoult 5 months ago
- 1 comment
#3252 - Inconsistent output between Windows and Mac
Issue -
State: open - Opened by Hongao0611 5 months ago
#3252 - Inconsistent output between Windows and Mac
Issue -
State: open - Opened by Hongao0611 5 months ago
#3251 - MS Visual Studio C and C++ build during pip install fails.
Issue -
State: open - Opened by oldspammer 5 months ago
- 2 comments
#3251 - MS Visual Studio C and C++ build during pip install fails.
Issue -
State: open - Opened by oldspammer 5 months ago
- 2 comments
#3250 - Normalize function in sentence_bleu
Issue -
State: open - Opened by Razbolt 5 months ago
- 4 comments
#3250 - Normalize function in sentence_bleu
Issue -
State: open - Opened by Razbolt 5 months ago
- 10 comments
#3249 - SnowballStemmer: how to avoid transliteration?
Issue -
State: open - Opened by satyrmipt 5 months ago
- 1 comment
#3249 - SnowballStemmer: how to avoid transliteration?
Issue -
State: open - Opened by satyrmipt 5 months ago
- 1 comment
#3248 - Downloader race condition with multiple processes
Issue -
State: open - Opened by naktinis 5 months ago
- 5 comments
#3248 - Downloader race condition with multiple processes
Issue -
State: open - Opened by naktinis 5 months ago
- 6 comments
Labels: cluster, nltk_data, critical, needs review
#3247 - Write downloaded model files atomically
Pull Request -
State: open - Opened by naktinis 5 months ago
- 7 comments
#3246 - Failed to run post install script for guardrails/toxic_language
Issue -
State: closed - Opened by xinzaifeixiang1992 6 months ago
- 2 comments
#3245 - Avoid duplicate output in acyclic_breadth_first
Pull Request -
State: closed - Opened by ekaf 6 months ago
- 4 comments
#3244 - Duplicates in wordnet hypernyms closure
Issue -
State: closed - Opened by ekaf 6 months ago
#3243 - Questions about Copilot + Open Source Software Hierarchy
Issue -
State: closed - Opened by liaochris 6 months ago