Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / DS4SD/docling issues and pull requests

#268 - cli and PDF: wrong table output

Issue - State: open - Opened by aborruso 11 days ago - 1 comment
Labels: bug, table structure

#267 - Can't find a way to use `ImageRefMode.EMBEDDED` in `generate_multimodal_pages`

Issue - State: open - Opened by sunwoongc 11 days ago - 1 comment
Labels: question

#267 - Can't find a way to use `ImageRefMode.EMBEDDED` in `generate_multimodal_pages`

Issue - State: open - Opened by sunwoongc 11 days ago - 1 comment
Labels: question

#262 - Unable to run.

Issue - State: closed - Opened by ashunaveed 12 days ago - 4 comments
Labels: bug

#262 - Unable to run.

Issue - State: closed - Opened by ashunaveed 12 days ago - 4 comments
Labels: bug

#261 - Handle vector-image-converted text in PDFs

Issue - State: open - Opened by maxmnemonic 12 days ago - 5 comments
Labels: enhancement, priority:high

#258 - Support Excel files

Issue - State: open - Opened by ImadSaddik 12 days ago - 11 comments
Labels: enhancement

#257 - Add docling support to bee agent framework

Issue - State: open - Opened by PeterStaar-IBM 12 days ago
Labels: help wanted, question

#257 - Add docling support to bee agent framework

Issue - State: open - Opened by PeterStaar-IBM 12 days ago
Labels: help wanted, question

#256 - How can I annotate/caption the image and display it when exporting it to markdown or text file?

Issue - State: closed - Opened by sunwoongc 12 days ago - 6 comments
Labels: question

#256 - How can I annotate/caption the image and display it when exporting it to markdown or text file?

Issue - State: closed - Opened by sunwoongc 12 days ago - 6 comments
Labels: question

#255 - Specific language for easyOCR

Issue - State: open - Opened by jonaskahn 12 days ago - 5 comments
Labels: documentation, question, ocr

#255 - Specific language for easyOCR

Issue - State: open - Opened by jonaskahn 12 days ago - 4 comments
Labels: documentation, question, ocr

#244 - OCR Extracted Information

Issue - State: open - Opened by maliktalha370 13 days ago - 2 comments
Labels: question, ocr

#244 - OCR Extracted Information

Issue - State: open - Opened by maliktalha370 13 days ago - 2 comments
Labels: question, ocr

#240 - Dev/update html parser with h1

Pull Request - State: open - Opened by PeterStaar-IBM 13 days ago

#240 - Dev/update html parser with h1

Pull Request - State: open - Opened by PeterStaar-IBM 13 days ago

#225 - Convert pdf to md simplified Chinese character issue

Issue - State: open - Opened by JerryXu2023 14 days ago - 7 comments

#222 - Unable to extract code block in HTML page

Issue - State: closed - Opened by twelveand0 14 days ago - 1 comment
Labels: bug

#222 - Unable to extract code block in HTML page

Issue - State: closed - Opened by twelveand0 14 days ago - 1 comment
Labels: bug

#210 - The results of the table recognition for the example PDF are incorrect.

Issue - State: closed - Opened by pJahad 15 days ago - 1 comment
Labels: table structure

#210 - The results of the table recognition for the example PDF are incorrect.

Issue - State: closed - Opened by pJahad 15 days ago - 1 comment
Labels: table structure

#207 - Issue with Extracting Tables with Merged Rows

Issue - State: open - Opened by MahmoudAtef999 16 days ago - 3 comments
Labels: table structure

#207 - Issue with Extracting Tables with Merged Rows

Issue - State: open - Opened by MahmoudAtef999 16 days ago - 3 comments
Labels: table structure

#206 - Javascript alternative?

Issue - State: closed - Opened by vtempest 16 days ago - 7 comments

#206 - Javascript alternative?

Issue - State: closed - Opened by vtempest 16 days ago - 7 comments

#198 - GPU support on Windows?

Issue - State: closed - Opened by wbste 17 days ago - 3 comments
Labels: enhancement

#198 - GPU support on Windows?

Issue - State: closed - Opened by wbste 17 days ago - 3 comments
Labels: enhancement

#197 - Documentation for metadata extraction

Issue - State: open - Opened by jcoyne 17 days ago - 6 comments
Labels: documentation, enhancement, icebox

#194 - reuse existing chunk/meta types, fix minor issues, lint

Pull Request - State: closed - Opened by vagenas 17 days ago - 1 comment

#194 - reuse existing chunk/meta types, fix minor issues, lint

Pull Request - State: closed - Opened by vagenas 17 days ago - 1 comment

#165 - Conversion error?

Issue - State: closed - Opened by maurogatti 28 days ago - 2 comments

#133 - State of GPU support

Issue - State: open - Opened by ViktorooReps about 1 month ago - 4 comments

#133 - State of GPU support

Issue - State: open - Opened by ViktorooReps about 1 month ago - 4 comments

#126 - Categorise table of contents as a new category

Issue - State: closed - Opened by tskvivekmani about 2 months ago - 1 comment

#126 - Categorise table of contents as a new category

Issue - State: closed - Opened by tskvivekmani about 2 months ago - 1 comment

#124 - Investigate how to remove dependency of OpenCV

Issue - State: open - Opened by PeterStaar-IBM about 2 months ago - 3 comments
Labels: help wanted

#107 - Support Html as native input document type

Issue - State: closed - Opened by dolfim-ibm about 2 months ago
Labels: enhancement

#107 - Support Html as native input document type

Issue - State: closed - Opened by dolfim-ibm about 2 months ago
Labels: enhancement

#106 - Support Docx via PDF conversion

Issue - State: open - Opened by dolfim-ibm about 2 months ago - 1 comment
Labels: enhancement

#106 - Support Docx via PDF conversion

Issue - State: open - Opened by dolfim-ibm about 2 months ago - 1 comment
Labels: enhancement

#105 - Support Docx as native input document type

Issue - State: closed - Opened by dolfim-ibm about 2 months ago
Labels: enhancement

#105 - Support Docx as native input document type

Issue - State: closed - Opened by dolfim-ibm about 2 months ago
Labels: enhancement

#104 - Enable native support for Windows

Issue - State: closed - Opened by dolfim-ibm about 2 months ago - 1 comment
Labels: enhancement

#104 - Enable native support for Windows

Issue - State: closed - Opened by dolfim-ibm about 2 months ago - 1 comment
Labels: enhancement

#103 - chore: move examples extras to respective group

Pull Request - State: closed - Opened by vagenas about 2 months ago - 1 comment

#103 - chore: move examples extras to respective group

Pull Request - State: closed - Opened by vagenas about 2 months ago - 1 comment

#102 - fix: fix OCR setting for pypdfium, minor refactor

Pull Request - State: closed - Opened by vagenas about 2 months ago

#102 - fix: fix OCR setting for pypdfium, minor refactor

Pull Request - State: closed - Opened by vagenas about 2 months ago

#101 - chore: add RAG notebook titles

Pull Request - State: closed - Opened by vagenas about 2 months ago

#101 - chore: add RAG notebook titles

Pull Request - State: closed - Opened by vagenas about 2 months ago

#100 - docs: document CLI, minor README revamp

Pull Request - State: closed - Opened by vagenas about 2 months ago

#100 - docs: document CLI, minor README revamp

Pull Request - State: closed - Opened by vagenas about 2 months ago

#99 - feat: add URL support to CLI

Pull Request - State: closed - Opened by vagenas about 2 months ago

#99 - feat: add URL support to CLI

Pull Request - State: closed - Opened by vagenas about 2 months ago

#98 - feat: add figure in markdown

Pull Request - State: closed - Opened by dolfim-ibm about 2 months ago

#98 - feat: add figure in markdown

Pull Request - State: closed - Opened by dolfim-ibm about 2 months ago

#95 - experimental: introduce img understand pipeline

Pull Request - State: closed - Opened by dolfim-ibm about 2 months ago - 3 comments

#95 - experimental: introduce img understand pipeline

Pull Request - State: open - Opened by dolfim-ibm about 2 months ago - 3 comments

#94 - Add figures in markdown output

Issue - State: closed - Opened by dolfim-ibm about 2 months ago
Labels: enhancement

#94 - Add figures in markdown output

Issue - State: closed - Opened by dolfim-ibm about 2 months ago
Labels: enhancement

#93 - fix: updated the render_as_doctags with the new arguments from docling-core

Pull Request - State: closed - Opened by PeterStaar-IBM about 2 months ago - 1 comment

#93 - fix: updated the render_as_doctags with the new arguments from docling-core

Pull Request - State: closed - Opened by PeterStaar-IBM about 2 months ago - 1 comment

#92 - chore: switch to gh apps user

Pull Request - State: closed - Opened by dolfim-ibm about 2 months ago

#92 - chore: switch to gh apps user

Pull Request - State: closed - Opened by dolfim-ibm about 2 months ago

#91 - feat: Establish DoclingDocument format (experimental)

Pull Request - State: closed - Opened by cau-git about 2 months ago - 1 comment

#91 - feat: Establish DoclingDocument format (experimental)

Pull Request - State: closed - Opened by cau-git about 2 months ago - 1 comment

#90 - feat: Support tableformer model choice

Pull Request - State: closed - Opened by cau-git about 2 months ago

#90 - feat: Support tableformer model choice

Pull Request - State: closed - Opened by cau-git about 2 months ago

#89 - UnboundLocalError and Loss of Data from Multiple Documents

Issue - State: open - Opened by imene-swaan 2 months ago - 1 comment

#89 - UnboundLocalError and Loss of Data from Multiple Documents

Issue - State: open - Opened by imene-swaan 2 months ago - 1 comment

#88 - docs: Updated Docling logo.png with transparent background

Pull Request - State: closed - Opened by maxmnemonic 2 months ago

#88 - docs: Updated Docling logo.png with transparent background

Pull Request - State: closed - Opened by maxmnemonic 2 months ago

#86 - feat: add table exports

Pull Request - State: closed - Opened by dolfim-ibm 2 months ago

#86 - feat: add table exports

Pull Request - State: closed - Opened by dolfim-ibm 2 months ago

#85 - Advanced Integration with LlamaIndex

Issue - State: closed - Opened by oedemis 2 months ago - 2 comments

#85 - Advanced Integration with LlamaIndex

Issue - State: closed - Opened by oedemis 2 months ago - 2 comments

#84 - feat: working on adding HF models for figure analysis

Pull Request - State: closed - Opened by PeterStaar-IBM 2 months ago - 1 comment

#84 - feat: working on adding HF models for figure analysis

Pull Request - State: closed - Opened by PeterStaar-IBM 2 months ago - 1 comment

#83 - fix: bumped the glm version and adjusted the tests

Pull Request - State: closed - Opened by PeterStaar-IBM 2 months ago

#83 - fix: bumped the glm version and adjusted the tests

Pull Request - State: closed - Opened by PeterStaar-IBM 2 months ago

#81 - chore: Add PR template

Pull Request - State: closed - Opened by dolfim-ibm 2 months ago

#81 - chore: Add PR template

Pull Request - State: closed - Opened by dolfim-ibm 2 months ago

#80 - fix: Initialize docling PDF parser on module level

Pull Request - State: closed - Opened by cau-git 2 months ago - 2 comments

#80 - fix: Initialize docling PDF parser on module level

Pull Request - State: closed - Opened by cau-git 2 months ago - 2 comments

#79 - fix: CLI compatibility with python 3.10 and 3.11

Pull Request - State: closed - Opened by dolfim-ibm 2 months ago

#79 - fix: CLI compatibility with python 3.10 and 3.11

Pull Request - State: closed - Opened by dolfim-ibm 2 months ago

#78 - Is there an Option to specify where extracted images are saved

Issue - State: closed - Opened by yogeesh-agarwal 2 months ago - 1 comment

#78 - Is there an Option to specify where extracted images are saved

Issue - State: closed - Opened by yogeesh-agarwal 2 months ago - 1 comment

#77 - Hierarchical Topic Parsing

Issue - State: closed - Opened by thusithaC 2 months ago - 4 comments