Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / nipunsadvilkar/pySBD issues and pull requests

#127 - Bump nltk from 3.5 to 3.9

Pull Request - State: open - Opened by dependabot[bot] about 1 month ago
Labels: dependencies

#126 - Specific string causes segment function to return empty array

Issue - State: open - Opened by NiftyliuS 4 months ago - 1 comment

#125 - `--` breaks segmentation

Issue - State: open - Opened by micycle1 8 months ago

#124 - How to separate sentences when there is no punctuation?

Issue - State: closed - Opened by asasas234 about 1 year ago - 2 comments

#121 - Control characters break German segmentation

Issue - State: open - Opened by edloginova over 1 year ago

#120 - Bug in German splitting with parenthesis

Issue - State: open - Opened by kongyurui over 1 year ago

#119 - pysbd_as_spacy_component.py -- fails to find pysbd module

Issue - State: closed - Opened by hiwaveSupport over 1 year ago - 2 comments

#118 - Does not properly segment within quotations

Issue - State: open - Opened by Hgherzog over 1 year ago - 1 comment

#117 - How is accuracy on OPUS-100 computed?

Issue - State: open - Opened by bminixhofer about 2 years ago - 1 comment

#116 - Shashank

Pull Request - State: closed - Opened by g-nerix about 2 years ago

#115 - Added decorator as required by latest SpaCy

Pull Request - State: closed - Opened by soldni over 2 years ago

#114 - Update PySBD component to support spaCy v3

Pull Request - State: open - Opened by nipunsadvilkar over 2 years ago - 4 comments
Labels: enhancement

#113 - Arabic sentence split on the Arabic comma

Issue - State: open - Opened by ymoslem over 2 years ago

#112 - Does pysbd delete sentences after detection ?

Issue - State: open - Opened by StephennFernandes over 2 years ago

#111 - Update pysbd_as_spacy_component.py

Pull Request - State: open - Opened by guebeln0 over 2 years ago

#108 - Examples of modifying sentence segmentation rules.

Issue - State: closed - Opened by delvinso over 2 years ago - 2 comments

#107 - Question: PHP port feasibility

Issue - State: closed - Opened by Stamenov over 2 years ago - 1 comment

#106 - Does pyBSD correctly handle i.e. ?

Issue - State: open - Opened by LifeIsStrange over 2 years ago

#105 - Support Ukrainian language

Pull Request - State: open - Opened by asivokon over 2 years ago

#104 - English segmenter fails if no space between 2 sentences

Issue - State: closed - Opened by GokulNC almost 3 years ago - 1 comment

#101 - Example not working with Spacy version 3.1 and 3.0.6

Issue - State: open - Opened by Atul997 about 3 years ago - 3 comments

#100 - make spaCy requirement more explicit

Issue - State: open - Opened by thomas-onesourceregulatory about 3 years ago

#99 - Chinese segmenter's unexpected output

Issue - State: open - Opened by zealfory over 3 years ago

#98 - PyBSD vs PolyGlot

Issue - State: open - Opened by GokulNC over 3 years ago

#97 - Fix spacy component example (issue #96)

Pull Request - State: open - Opened by iibrahimli over 3 years ago - 3 comments

#96 - Spacy integration example is broken

Issue - State: open - Opened by iibrahimli over 3 years ago

#95 - Trim sentences

Issue - State: open - Opened by GabrielLin over 3 years ago

#94 - node.js port ?

Issue - State: open - Opened by chopinml over 3 years ago - 3 comments

#93 - sre_constants.error: missing ), unterminated subpattern at position 0

Issue - State: closed - Opened by GabrielLin over 3 years ago - 1 comment

#92 - Catastrophic backtracking in HTMLTagRule

Issue - State: open - Opened by balazik over 3 years ago

#91 - Exception when clean=True in search_for_connected_sentences

Issue - State: open - Opened by balazik over 3 years ago - 1 comment

#90 - How to modify segmentation rules by hand?

Issue - State: closed - Opened by anferico over 3 years ago - 3 comments
Labels: question

#89 - πŸ› Fix trailing period/ellipses with spaces

Pull Request - State: closed - Opened by nipunsadvilkar over 3 years ago
Labels: edge-cases

#87 - ERROR when proceesing this paragraphy

Issue - State: closed - Opened by GabrielLin almost 4 years ago - 2 comments
Labels: bug

#86 - XXXX et al. [2004] error

Issue - State: closed - Opened by GabrielLin almost 4 years ago - 2 comments
Labels: wontfix

#85 - πŸ’š πŸ‘· Update codecov github action

Pull Request - State: closed - Opened by nipunsadvilkar almost 4 years ago

#84 - Slovak lang support

Pull Request - State: closed - Opened by misotrnka almost 4 years ago - 6 comments
Labels: language

#83 - destructive behaviour in edge-cases

Issue - State: closed - Opened by aflueckiger almost 4 years ago - 5 comments
Labels: bug, edge-cases

#82 - 🎨 ✨ Add highlight : NLP-OSS Workshop, EMNLP 2020

Pull Request - State: closed - Opened by nipunsadvilkar almost 4 years ago

#81 - Escape abbreviation before substitution

Pull Request - State: closed - Opened by matthen almost 4 years ago - 4 comments

#80 - re.error: unbalanced parenthesis at position 10

Issue - State: closed - Opened by matthen almost 4 years ago

#79 - Infinite loop?

Issue - State: open - Opened by matthen almost 4 years ago - 2 comments
Labels: bug, edge-cases

#78 - ✨ Better handling consecutive periods and reserved special symbols

Pull Request - State: closed - Opened by nipunsadvilkar about 4 years ago - 2 comments
Labels: bug, enhancement, edge-cases

#77 - πŸ› βœ… Enforce clean=True when doc_type="pdf

Pull Request - State: closed - Opened by nipunsadvilkar about 4 years ago - 1 comment
Labels: bug

#76 - Pysbd just hangsπŸ›

Issue - State: closed - Opened by kariato about 4 years ago - 8 comments
Labels: help wanted

#75 - πŸ› doc_type='pdf' no longer works

Issue - State: closed - Opened by matthewmcintire about 4 years ago - 1 comment

#74 - πŸš‘ βœ… Handle Newline character & update tests

Pull Request - State: closed - Opened by nipunsadvilkar about 4 years ago - 1 comment

#73 - Typo in example code on spacy universe

Issue - State: closed - Opened by AtanuCSE about 4 years ago - 1 comment
Labels: duplicate, docs

#72 - Example on website is broken

Issue - State: closed - Opened by cranedroesch about 4 years ago - 1 comment

#71 - ⚑️ ♻️ Abbreviation replacer refactor - approach II

Pull Request - State: closed - Opened by nipunsadvilkar about 4 years ago - 2 comments
Labels: enhancement

#70 - Performance improvements

Pull Request - State: closed - Opened by nipunsadvilkar about 4 years ago - 2 comments
Labels: enhancement, ⚑️ performance

#67 - ✨ πŸ’« Support Multiple languages

Pull Request - State: closed - Opened by nipunsadvilkar over 4 years ago - 4 comments
Labels: enhancement

#66 - Cleaning the text before segmentation

Issue - State: closed - Opened by databill86 over 4 years ago - 1 comment
Labels: bug, docs

#65 - Update issue templates

Pull Request - State: closed - Opened by nipunsadvilkar over 4 years ago

#64 - Use GitHub actions for CI

Pull Request - State: closed - Opened by nipunsadvilkar over 4 years ago
Labels: enhancement, πŸ‘·β€β™‚οΈci & cd

#63 - ✨ πŸ’« sent `char_span` through with spaCy & regex & ♻️ Refactoring for more languages support

Pull Request - State: closed - Opened by nipunsadvilkar over 4 years ago - 1 comment
Labels: enhancement

#62 - Bump bleach from 3.1.0 to 3.1.4

Pull Request - State: closed - Opened by dependabot[bot] over 4 years ago - 1 comment
Labels: dependencies

#61 - Bump bleach from 3.1.0 to 3.1.2

Pull Request - State: closed - Opened by dependabot[bot] over 4 years ago - 1 comment
Labels: dependencies

#60 - Bump bleach from 3.1.0 to 3.1.1

Pull Request - State: closed - Opened by dependabot[bot] over 4 years ago - 1 comment
Labels: dependencies

#59 - Handle irregularities between pySBD & pySBD + spaCy sentence output

Issue - State: closed - Opened by nipunsadvilkar over 4 years ago - 1 comment
Labels: bug, enhancement

#58 - Long number stalls process.

Issue - State: closed - Opened by mollerhoj over 4 years ago - 1 comment
Labels: bug

#57 - Looses text when breaking into sentences

Issue - State: closed - Opened by ghost over 4 years ago - 1 comment
Labels: duplicate

#56 - Regexp issues

Issue - State: closed - Opened by mollerhoj almost 5 years ago - 4 comments

#55 - Different segmentation with Spacy and when using pySBD directly

Issue - State: closed - Opened by nmstoker almost 5 years ago - 6 comments
Labels: bug, help wanted

#54 - Carriage return fix

Pull Request - State: closed - Opened by dakinggg almost 5 years ago - 3 comments
Labels: bug, enhancement

#53 - Incorrect text span start and end returned

Issue - State: closed - Opened by dakinggg almost 5 years ago - 1 comment
Labels: bug

#52 - Apply pdf rules

Pull Request - State: closed - Opened by neogyzhu almost 5 years ago - 1 comment
Labels: invalid, wontfix

#51 - Extend normal text rules

Pull Request - State: closed - Opened by neogyzhu almost 5 years ago - 1 comment

#50 - Reduce some calls to re.sub

Pull Request - State: closed - Opened by dakinggg almost 5 years ago - 4 comments
Labels: enhancement

#49 - Incorrect text span start and end returned

Issue - State: closed - Opened by dakinggg almost 5 years ago - 7 comments
Labels: bug

#48 - πŸ› Fix unbalanced parenthesis

Pull Request - State: closed - Opened by nipunsadvilkar almost 5 years ago
Labels: bug

#47 - "unbalanced parenthesis" error

Issue - State: closed - Opened by dumitrescustefan almost 5 years ago
Labels: bug

#46 - fixed typo in README example

Pull Request - State: closed - Opened by Huffon almost 5 years ago - 1 comment

#45 - Text surrounded by quotes seems to not get segmented?

Issue - State: closed - Opened by RuABraun almost 5 years ago - 1 comment
Labels: question

#44 - Shouldn't colons cause a sentence split?

Issue - State: closed - Opened by RuABraun almost 5 years ago - 2 comments
Labels: question

#43 - fixed typo in readme

Pull Request - State: closed - Opened by spate141 almost 5 years ago - 1 comment

#42 - ✨ `pysbd` as a spacy component through entrypoints

Pull Request - State: closed - Opened by nipunsadvilkar almost 5 years ago

#41 - Performance improvement?

Issue - State: closed - Opened by dakinggg almost 5 years ago - 7 comments
Labels: question

#40 - ✨Add `char_span` functionality

Pull Request - State: closed - Opened by nipunsadvilkar almost 5 years ago
Labels: enhancement

#39 - Question marks at the end swallowed

Issue - State: closed - Opened by dakinggg almost 5 years ago - 11 comments
Labels: bug, edge-cases

#38 - πŸ› Handle text with only punctuations & ! at EOL

Pull Request - State: closed - Opened by nipunsadvilkar almost 5 years ago

#37 - Last sentence disappearing from input

Issue - State: closed - Opened by dakinggg almost 5 years ago
Labels: bug, edge-cases

#36 - Question marks swallowed from input

Issue - State: closed - Opened by dakinggg almost 5 years ago
Labels: bug, edge-cases

#35 - ✨ βœ… Handle intermittent punctuations

Pull Request - State: closed - Opened by nipunsadvilkar almost 5 years ago
Labels: enhancement

#34 - Starting period edge case

Issue - State: closed - Opened by dakinggg almost 5 years ago
Labels: bug, enhancement

#33 - Debugging print statements left in

Issue - State: closed - Opened by dakinggg almost 5 years ago - 1 comment
Labels: bug, edge-cases

#32 - πŸ› BugFixes on abbreviation, list_item_replacer

Pull Request - State: closed - Opened by nipunsadvilkar almost 5 years ago
Labels: bug, enhancement

#31 - crashing on input

Issue - State: closed - Opened by dakinggg almost 5 years ago - 2 comments
Labels: bug, edge-cases

#30 - periods replacing characters

Issue - State: closed - Opened by dakinggg almost 5 years ago - 1 comment
Labels: bug, edge-cases

#29 - character getting swallowed

Issue - State: closed - Opened by dakinggg almost 5 years ago
Labels: bug, edge-cases

#28 - πŸ› Fix scanlists

Pull Request - State: closed - Opened by nipunsadvilkar almost 5 years ago
Labels: bug

#27 - IndexError: list index out of range

Issue - State: closed - Opened by isabelcachola almost 5 years ago - 1 comment
Labels: bug, edge-cases

#26 - 🚚 Renaming package & ♻️Refactor scripts

Pull Request - State: closed - Opened by nipunsadvilkar almost 5 years ago
Labels: enhancement

#25 - Add README.md, setup file

Pull Request - State: closed - Opened by nipunsadvilkar over 5 years ago

#24 - Add SBD tests from Pragmatic Segmenter

Pull Request - State: closed - Opened by nipunsadvilkar over 5 years ago - 1 comment
Labels: enhancement

#10 - Integrating Inline formatting rule

Issue - State: closed - Opened by nipunsadvilkar over 7 years ago

#8 - Removing HTML tags rule

Issue - State: closed - Opened by nipunsadvilkar over 7 years ago