Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / Unstructured-IO/unstructured-inference issues and pull requests

#299 - Refactor: remove image extraction related code

Pull Request - State: closed - Opened by christinestraub 12 months ago

#298 - chore(deps): Bump ruff from 0.1.5 to 0.1.6 in /requirements

Pull Request - State: closed - Opened by dependabot[bot] 12 months ago - 2 comments
Labels: dependencies, python

#297 - chore(deps): Bump mypy from 1.7.0 to 1.7.1 in /requirements

Pull Request - State: closed - Opened by dependabot[bot] 12 months ago - 1 comment
Labels: dependencies, python

#296 - chore(deps): Bump httpx from 0.25.1 to 0.25.2 in /requirements

Pull Request - State: closed - Opened by dependabot[bot] 12 months ago - 1 comment
Labels: dependencies, python

#295 - Feat/chipper repetitions

Pull Request - State: closed - Opened by ajjimeno 12 months ago - 4 comments

#294 - Refactor: remove `pdfminer` related code

Pull Request - State: closed - Opened by christinestraub 12 months ago

#292 - Feat/improve chipper bounding boxes

Pull Request - State: closed - Opened by ajjimeno 12 months ago - 3 comments

#291 - enhancement: get model name and initialization params externally

Pull Request - State: closed - Opened by qued almost 1 year ago

#290 - Chore: nit of table init logger to show up the log info

Pull Request - State: closed - Opened by yuming-long about 1 year ago

#284 - enhancement: bring ruff params in line with unstructured

Pull Request - State: closed - Opened by qued about 1 year ago

#282 - Jj/warnings

Pull Request - State: closed - Opened by Coniferish about 1 year ago - 1 comment

#280 - ci: update ingest script to match new folder structure

Pull Request - State: closed - Opened by badGarnet about 1 year ago

#279 - fix: reformat chipper table element to match standard format

Pull Request - State: closed - Opened by badGarnet about 1 year ago

#278 - chore: remove logger info for chipper since its private

Pull Request - State: closed - Opened by yuming-long about 1 year ago

#277 - chore: rolling slack invite link in chipper logger info

Pull Request - State: closed - Opened by yuming-long about 1 year ago

#276 - enhancement: better error message on image/page extraction mismatch

Pull Request - State: closed - Opened by qued about 1 year ago

#275 - chore: change the default model to yolox

Pull Request - State: closed - Opened by awalker4 about 1 year ago

#274 - chore(deps): Bump mypy from 1.6.0 to 1.6.1 in /requirements

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago - 1 comment
Labels: dependencies, python

#273 - chore(deps): Bump black from 23.9.1 to 23.10.1 in /requirements

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago - 1 comment
Labels: dependencies, python

#272 - chore(deps): Bump pytest-mock from 3.11.1 to 3.12.0 in /requirements

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago - 1 comment
Labels: dependencies, python

#271 - chore(deps): Bump ruff from 0.0.292 to 0.1.3 in /requirements

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago - 1 comment
Labels: dependencies, python

#270 - partition_groups_from_regions misses sources

Issue - State: open - Opened by benjats07 about 1 year ago

#269 - build: update ingest installation invocation

Pull Request - State: closed - Opened by qued about 1 year ago

#268 - Feat/field to store inner elements

Pull Request - State: closed - Opened by benjats07 about 1 year ago - 2 comments

#267 - Feat/chipper gpu float16

Pull Request - State: closed - Opened by ajjimeno about 1 year ago

#266 - fix:excess transfer parameters are not processed

Pull Request - State: closed - Opened by 2710932616 about 1 year ago

#264 - chore: streamline kwarg handling

Pull Request - State: closed - Opened by qued about 1 year ago

#263 - Feat: add more output format for table inference

Pull Request - State: closed - Opened by badGarnet about 1 year ago

#262 - fix: annotate image is fixed after refactor

Pull Request - State: closed - Opened by benjats07 about 1 year ago

#261 - fix: chipper memory problem with long documents on Intel CPUs

Pull Request - State: closed - Opened by ajjimeno about 1 year ago - 4 comments

#260 - Fix/no order chipper elements

Pull Request - State: closed - Opened by benjats07 about 1 year ago

#259 - bug: Fix layout sorting when bbox is None (ChipperV1)

Issue - State: closed - Opened by 0-hero about 1 year ago - 3 comments

#258 - fix: memory leak on chipper processor, beam search parameters, and bbox bug

Pull Request - State: closed - Opened by ajjimeno about 1 year ago - 1 comment

#257 - fix: invalid zoom lead to cv2 error

Pull Request - State: closed - Opened by badGarnet about 1 year ago

#256 - Chore: allow table model to accept optional OCR data

Pull Request - State: closed - Opened by yuming-long about 1 year ago

#255 - fix(deps): add onnx as base requirement

Pull Request - State: closed - Opened by cragwolfe about 1 year ago

#254 - chipperv2 unusable due to private model

Issue - State: closed - Opened by 0-hero about 1 year ago - 2 comments

#253 - Fix PDFMiner bug

Pull Request - State: closed - Opened by ajjimeno about 1 year ago - 2 comments

#252 - Faster version of Chipper

Pull Request - State: closed - Opened by ajjimeno about 1 year ago - 2 comments

#251 - Chore: remove layout parser dependency and detectron2

Pull Request - State: closed - Opened by yuming-long about 1 year ago - 1 comment

#250 - chore: chipper model name should point to latest chipper version

Pull Request - State: closed - Opened by qued about 1 year ago

#249 - build: improve packaging

Pull Request - State: closed - Opened by qued about 1 year ago

#248 - Bug when loading Chipper model

Pull Request - State: closed - Opened by ajjimeno about 1 year ago - 1 comment

#247 - test issue from inference

Issue - State: closed - Opened by qued about 1 year ago

#246 - Core 1941/super gradients integration

Pull Request - State: closed - Opened by pravin-unstructured about 1 year ago

#244 - Fix: sort elements extracted by `pdfminer`

Pull Request - State: closed - Opened by christinestraub about 1 year ago

#243 - chore(deps-dev): Bump ipython from 8.12.3 to 8.16.1 in /requirements

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago - 1 comment
Labels: dependencies, python

#242 - Feat/download quantized model

Pull Request - State: closed - Opened by benjats07 about 1 year ago

#241 - faster chipper

Pull Request - State: closed - Opened by ajjimeno about 1 year ago

#240 - Add parent bbox from children location

Pull Request - State: closed - Opened by ajjimeno about 1 year ago

#239 - feat: Apple Silicon support for Chipper Model

Issue - State: open - Opened by dsanmart about 1 year ago - 4 comments

#238 - chore(deps): Bump actions/checkout from 3 to 4

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago
Labels: dependencies, github_actions

#237 - chore(deps): Bump huggingface-hub from 0.17.2 to 0.17.3 in /requirements

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago - 3 comments
Labels: dependencies, python

#236 - chore(deps): Bump opencv-python from 4.8.0.76 to 4.8.1.78 in /requirements

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago - 1 comment
Labels: dependencies, python

#235 - chore(deps): Bump ruff from 0.0.290 to 0.0.291 in /requirements

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago - 1 comment
Labels: dependencies, python

#234 - chore(deps): Bump rapidfuzz from 3.3.0 to 3.3.1 in /requirements

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago - 1 comment
Labels: dependencies, python

#233 - chore(deps-dev): Bump ipython from 8.12.2 to 8.16.0 in /requirements

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago - 1 comment
Labels: dependencies, python

#232 - Feat/chipper v2

Pull Request - State: closed - Opened by benjats07 about 1 year ago - 7 comments

#231 - Refactor: Remove OCR related code for entire page OCR

Pull Request - State: closed - Opened by yuming-long about 1 year ago

#230 - chore: changelog fix, cut release 0.6.5

Pull Request - State: closed - Opened by christinestraub about 1 year ago

#229 - fix: padded boxes are not rescaled/shifted correctly

Pull Request - State: closed - Opened by badGarnet about 1 year ago

#228 - Fix/pdf miner source property

Pull Request - State: closed - Opened by benjats07 about 1 year ago - 1 comment

#227 - Bug: pdf miner elements don't contain source property correctly filled

Issue - State: closed - Opened by benjats07 about 1 year ago
Labels: bug

#226 - chore: stop passing language code from tesseract mapping to paddle

Pull Request - State: open - Opened by yuming-long about 1 year ago - 3 comments

#225 - Feat/219 keep extracted image elements

Pull Request - State: closed - Opened by christinestraub about 1 year ago

#224 - feat: make table transformer parameters configurable

Pull Request - State: closed - Opened by badGarnet about 1 year ago

#223 - fix: update default background padding value to pass ingest test

Pull Request - State: closed - Opened by badGarnet about 1 year ago

#221 - chore: changelog repair

Pull Request - State: closed - Opened by cragwolfe about 1 year ago

#220 - feat: add pre commit hook

Pull Request - State: closed - Opened by badGarnet about 1 year ago

#218 - feat: add config class

Pull Request - State: closed - Opened by badGarnet about 1 year ago

#217 - ERROR Image size, could be decompression bomb DOS attack

Issue - State: open - Opened by undernightcore about 1 year ago - 3 comments

#216 - feat: add evaluation metric for table extraction

Pull Request - State: closed - Opened by badGarnet about 1 year ago - 1 comment

#215 - enhancement: Get only "true" embedded images when extracting elements from PDF pages

Issue - State: closed - Opened by christinestraub about 1 year ago - 1 comment
Labels: enhancement

#214 - chore: skip paddle unittests local for mac

Pull Request - State: closed - Opened by yuming-long about 1 year ago - 3 comments

#213 - Unset env var after test

Pull Request - State: closed - Opened by tabossert about 1 year ago

#212 - Enhance/duplicated bboxes all document

Pull Request - State: closed - Opened by benjats07 about 1 year ago

#211 - Fix enhance/duplicated bboxes

Pull Request - State: closed - Opened by benjats07 about 1 year ago

#210 - feat: add autoscaling for table images

Pull Request - State: closed - Opened by badGarnet about 1 year ago

#209 - Chore: add env `ENTIRE_PAGE_OCR` to specify paddle/tesseract for entire page ocr

Pull Request - State: closed - Opened by yuming-long about 1 year ago - 2 comments

#208 - Feat/save embedded images in pdf

Pull Request - State: closed - Opened by christinestraub about 1 year ago - 3 comments

#207 - chore: support paddle with both cpu and gpu if it is installed

Pull Request - State: closed - Opened by yuming-long about 1 year ago - 1 comment

#205 - add padding before structure detection

Pull Request - State: closed - Opened by badGarnet about 1 year ago - 2 comments

#204 - remove cv2 preprocessing

Pull Request - State: closed - Opened by badGarnet about 1 year ago

#203 - fix table to html bug

Pull Request - State: closed - Opened by badGarnet about 1 year ago - 2 comments

#202 - use tesseract info to assist table structure

Pull Request - State: closed - Opened by badGarnet about 1 year ago - 2 comments

#201 - Fix/overlapping of bboxes

Pull Request - State: closed - Opened by benjats07 about 1 year ago - 1 comment