Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / DS4SD/docling issues and pull requests

#353 - How do I use the downloaded ds4sd/docling-models?

Issue - State: closed - Opened by Runningwater2357 2 days ago - 3 comments
Labels: bug

#352 - How to ignore equation ?

Issue - State: closed - Opened by kh4n9373 2 days ago - 1 comment
Labels: question

#350 - ci: fix mergify

Pull Request - State: closed - Opened by dolfim-ibm 3 days ago - 1 comment

#349 - feat: Extracting picture data for raster images found in PPTX

Pull Request - State: open - Opened by maxmnemonic 3 days ago - 1 comment

#348 - What are doctags

Issue - State: closed - Opened by pwright 3 days ago
Labels: question

#346 - Analyzing PDf files is too slow

Issue - State: open - Opened by langzichai 3 days ago - 1 comment
Labels: question

#346 - Analyzing PDf files is too slow

Issue - State: open - Opened by langzichai 3 days ago
Labels: question

#343 - Add LaTex and mathpix-markdown-it as outputs

Issue - State: open - Opened by sirus20x6 4 days ago - 1 comment
Labels: enhancement

#343 - Add LaTex and mathpix-markdown-it as outputs

Issue - State: open - Opened by sirus20x6 4 days ago - 2 comments
Labels: enhancement

#342 - Add Markdown-based table serialization in chunking

Issue - State: open - Opened by vagenas 4 days ago
Labels: enhancement

#342 - Add Markdown-based table serialization in chunking

Issue - State: open - Opened by vagenas 4 days ago
Labels: enhancement

#341 - docs: add architecture outline

Pull Request - State: closed - Opened by vagenas 4 days ago

#340 - chore: Broken ci

Pull Request - State: closed - Opened by dolfim-ibm 4 days ago

#340 - chore: Broken ci

Pull Request - State: closed - Opened by dolfim-ibm 4 days ago

#339 - ci(Mergify): configuration update

Pull Request - State: closed - Opened by dolfim-ibm 4 days ago - 1 comment

#337 - How do I use the downloaded model?

Issue - State: closed - Opened by Zhengyu-Ju 4 days ago - 5 comments
Labels: question

#336 - Fix documentation: DocumentStream gets parameter 'name' and not 'filename'

Issue - State: closed - Opened by tsurelad 4 days ago - 1 comment
Labels: bug

#336 - Fix documentation: DocumentStream gets parameter 'name' and not 'filename'

Issue - State: closed - Opened by tsurelad 4 days ago - 1 comment
Labels: bug

#334 - feat: added excel backend

Pull Request - State: open - Opened by PeterStaar-IBM 4 days ago - 1 comment

#332 - docs: fix parameter in usage.md

Pull Request - State: closed - Opened by capsenz 5 days ago - 2 comments

#331 - Fix documentation for DocumentStream in usage.md

Issue - State: closed - Opened by capsenz 5 days ago - 1 comment
Labels: bug

#331 - Fix documentation for DocumentStream in usage.md

Issue - State: closed - Opened by capsenz 5 days ago - 1 comment
Labels: bug

#330 - fix: Fixing images in the input Word files

Pull Request - State: closed - Opened by maxmnemonic 5 days ago

#330 - fix: Fixing images in the input Word files

Pull Request - State: closed - Opened by maxmnemonic 5 days ago

#328 - SSL error on "Downloading detection model "

Issue - State: closed - Opened by JensGM 5 days ago - 2 comments
Labels: bug

#328 - SSL error on "Downloading detection model "

Issue - State: closed - Opened by JensGM 5 days ago - 2 comments
Labels: bug

#327 - Standardized Access to Common Email and Calendar Formats

Issue - State: open - Opened by ByteMeFree 5 days ago - 5 comments
Labels: enhancement

#327 - Standardized Access to Common Email and Calendar Formats

Issue - State: open - Opened by ByteMeFree 5 days ago - 5 comments
Labels: enhancement

#326 - Intranet usage requirements

Issue - State: closed - Opened by paul-yangmy 5 days ago - 5 comments
Labels: enhancement

#326 - Intranet usage requirements

Issue - State: closed - Opened by paul-yangmy 5 days ago - 5 comments
Labels: enhancement

#325 - docs: add automatic generation of CLI reference

Pull Request - State: closed - Opened by dolfim-ibm 5 days ago - 2 comments

#323 - fix: reduce logging by keeping option for more verbose

Pull Request - State: closed - Opened by dolfim-ibm 5 days ago

#323 - fix: reduce logging by keeping option for more verbose

Pull Request - State: closed - Opened by dolfim-ibm 5 days ago

#322 - fix: skip glm model downloads

Pull Request - State: closed - Opened by dolfim-ibm 5 days ago

#322 - fix: skip glm model downloads

Pull Request - State: closed - Opened by dolfim-ibm 5 days ago

#321 - Can you integrate with new alternative OCR such as Surya OCR, Please

Issue - State: closed - Opened by Teera21 5 days ago - 1 comment
Labels: enhancement

#321 - Can you integrate with new alternative OCR such as Surya OCR, Please

Issue - State: closed - Opened by Teera21 5 days ago - 1 comment
Labels: enhancement

#320 - enhancement: Add timeout limit to document parsing job. #270

Pull Request - State: open - Opened by ab-shrek 5 days ago - 1 comment

#319 - chore: fix Qdrant notebook Colab link

Pull Request - State: closed - Opened by vagenas 6 days ago

#319 - chore: fix Qdrant notebook Colab link

Pull Request - State: closed - Opened by vagenas 6 days ago

#318 - Docling crashes when using EasyOCR on Windows 11

Issue - State: open - Opened by cau-git 6 days ago - 2 comments
Labels: bug, help wanted

#318 - Docling crashes when using EasyOCR on Windows 11

Issue - State: open - Opened by cau-git 6 days ago - 2 comments
Labels: bug, help wanted

#317 - Enable control over log verbosity on the docling CLI

Issue - State: closed - Opened by cau-git 6 days ago
Labels: enhancement

#317 - Enable control over log verbosity on the docling CLI

Issue - State: closed - Opened by cau-git 6 days ago
Labels: enhancement

#316 - docs: add Data Prep Kit integration

Pull Request - State: closed - Opened by vagenas 6 days ago

#316 - docs: add Data Prep Kit integration

Pull Request - State: closed - Opened by vagenas 6 days ago

#315 - fix: Configure env prefix for docling settings

Pull Request - State: closed - Opened by cau-git 6 days ago

#314 - fix: Handling of single-cell tables in DOCX backend

Pull Request - State: closed - Opened by maxmnemonic 6 days ago

#312 - docs: Hybrid RAG with Qdrant

Pull Request - State: closed - Opened by Anush008 6 days ago - 1 comment

#312 - docs: Hybrid RAG with Qdrant

Pull Request - State: closed - Opened by Anush008 6 days ago - 1 comment

#310 - consolidate advanced chunker notebook

Pull Request - State: open - Opened by vagenas 7 days ago

#310 - consolidate advanced chunker notebook

Pull Request - State: open - Opened by vagenas 7 days ago

#309 - Add option to export_to_markdown to mark page breaks

Issue - State: open - Opened by cau-git 7 days ago - 5 comments
Labels: enhancement

#309 - Add option to export_to_markdown to mark page breaks

Issue - State: open - Opened by cau-git 7 days ago - 3 comments
Labels: enhancement

#308 - Convert model weights to safetensors format

Issue - State: open - Opened by cau-git 7 days ago
Labels: enhancement

#308 - Convert model weights to safetensors format

Issue - State: open - Opened by cau-git 7 days ago
Labels: enhancement

#307 - fix: Added handling of grouped elements in pptx backend

Pull Request - State: closed - Opened by maxmnemonic 7 days ago

#307 - fix: Added handling of grouped elements in pptx backend

Pull Request - State: closed - Opened by maxmnemonic 7 days ago

#305 - docs: add navigation indices

Pull Request - State: closed - Opened by vagenas 7 days ago

#305 - docs: add navigation indices

Pull Request - State: closed - Opened by vagenas 7 days ago

#304 - Unable to run inference on GPU

Issue - State: open - Opened by poojitha0892 7 days ago - 1 comment
Labels: bug

#304 - Unable to run inference on GPU

Issue - State: open - Opened by poojitha0892 7 days ago - 1 comment
Labels: bug

#303 - Deployment of docling using Docker

Issue - State: open - Opened by Desmond-Fon 7 days ago - 1 comment
Labels: question

#303 - Deployment of docling using Docker

Issue - State: open - Opened by Desmond-Fon 7 days ago - 1 comment
Labels: question

#302 - fix: Added handling of code blocks in html with <pre> tag

Pull Request - State: closed - Opened by maxmnemonic 7 days ago

#302 - fix: Added handling of code blocks in html with <pre> tag

Pull Request - State: closed - Opened by maxmnemonic 7 days ago

#300 - Support export of DoclingDocument to HTML

Issue - State: open - Opened by cau-git 7 days ago - 2 comments
Labels: enhancement

#300 - Support export of DoclingDocument to HTML

Issue - State: open - Opened by cau-git 7 days ago - 2 comments
Labels: enhancement

#299 - Allow extraction of formula images similar to tables and pages

Issue - State: open - Opened by cau-git 7 days ago
Labels: enhancement

#299 - Allow extraction of formula images similar to tables and pages

Issue - State: open - Opened by cau-git 7 days ago
Labels: enhancement

#298 - Support Google Docs

Issue - State: open - Opened by vtempest 7 days ago - 7 comments
Labels: enhancement

#298 - Support Google Docs

Issue - State: open - Opened by vtempest 7 days ago - 7 comments
Labels: enhancement

#295 - EasyOCR does not extract text properly

Issue - State: open - Opened by simonschoe 7 days ago - 2 comments
Labels: bug, ocr

#295 - EasyOCR does not extract text properly

Issue - State: open - Opened by simonschoe 7 days ago - 2 comments
Labels: bug, ocr

#293 - Is there a Docker deployment solution or a FastAPI server setup available for docling?

Issue - State: closed - Opened by ShedrachJonah11 7 days ago - 1 comment
Labels: question

#292 - In a specific PowerPoint, an issue with missing text occurred during parsing.

Issue - State: closed - Opened by Crespo522 7 days ago - 4 comments
Labels: bug, pptx

#291 - Missing Text Inside Tables When Converting from DOCX to Markdown

Issue - State: closed - Opened by VitoFe 8 days ago - 3 comments
Labels: bug, docx

#291 - Missing Text Inside Tables When Converting from DOCX to Markdown

Issue - State: closed - Opened by VitoFe 8 days ago - 3 comments
Labels: bug, docx

#288 - FastAPI server for docling

Issue - State: closed - Opened by Gyde04 8 days ago - 1 comment
Labels: question

#288 - FastAPI server for docling

Issue - State: closed - Opened by Gyde04 8 days ago - 1 comment
Labels: question

#287 - Chunking Hierarchy Identification

Issue - State: open - Opened by Shubhamkumar782 9 days ago - 11 comments
Labels: question, PDF parsing

#287 - Chunking Hierarchy Identification

Issue - State: open - Opened by Shubhamkumar782 9 days ago - 11 comments
Labels: question, PDF parsing

#286 - fix: allow mps usage for easyocr

Pull Request - State: closed - Opened by dolfim-ibm 9 days ago - 2 comments

#286 - fix: allow mps usage for easyocr

Pull Request - State: closed - Opened by dolfim-ibm 9 days ago - 2 comments

#285 - Leverage word bbox from pdf-parser-v2 in the layout- and table-model

Issue - State: open - Opened by PeterStaar-IBM 9 days ago - 3 comments
Labels: enhancement, PDF parsing

#280 - Enhanced Table Extraction for Complex Formats

Issue - State: open - Opened by AdBaWa 10 days ago - 4 comments
Labels: enhancement, table structure, icebox

#280 - Enhanced Table Extraction for Complex Formats

Issue - State: open - Opened by AdBaWa 10 days ago - 4 comments
Labels: enhancement, table structure, icebox

#278 - For long tables, fields are being truncated

Issue - State: open - Opened by PrathamGupta06 10 days ago - 2 comments
Labels: bug, table structure

#277 - result viewer web app

Issue - State: open - Opened by pJahad 10 days ago - 3 comments
Labels: enhancement

#277 - result viewer web app

Issue - State: open - Opened by pJahad 10 days ago - 3 comments
Labels: enhancement

#274 - Complex Table Conversion Issue (Wrong order, key-value regions)

Issue - State: open - Opened by DAVIDCRUZ0202 11 days ago - 3 comments
Labels: bug

#274 - Complex Table Conversion Issue (Wrong order, key-value regions)

Issue - State: open - Opened by DAVIDCRUZ0202 11 days ago - 3 comments
Labels: bug

#273 - Identification of images in docx

Issue - State: closed - Opened by jkindahood 11 days ago - 6 comments
Labels: enhancement

#273 - Identification of images in docx

Issue - State: closed - Opened by jkindahood 11 days ago - 6 comments
Labels: enhancement

#270 - Add timeout limit to document parsing job.

Issue - State: open - Opened by PeterStaar-IBM 11 days ago - 5 comments
Labels: enhancement, priority:high

#270 - Add timeout limit to document parsing job.

Issue - State: open - Opened by PeterStaar-IBM 11 days ago - 5 comments
Labels: enhancement, priority:high

#268 - cli and PDF: wrong table output

Issue - State: open - Opened by aborruso 11 days ago - 1 comment
Labels: bug, table structure