Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / ocrmypdf/OCRmyPDF issues and pull requests

#1477 - [Bug]:

Issue - State: closed - Opened by Silgrond 8 days ago - 1 comment

#1476 - [3rdparty]: paperless-ngx - ocrmypdf fails with AttributeError

Issue - State: closed - Opened by winnieXY 12 days ago - 1 comment

#1472 - handwritten recognition

Pull Request - State: closed - Opened by MufeezQadri-main 13 days ago - 1 comment

#1471 - [Bug]: ModuleNotFoundError: No module named 'img2pdf'

Issue - State: closed - Opened by wsshin 13 days ago - 3 comments

#1468 - Process hOCR textangle attribute

Pull Request - State: open - Opened by 0dinD 18 days ago - 3 comments

#1467 - [Feature]: Process hOCR textangle attribute in hOCR to PDF transform

Issue - State: open - Opened by 0dinD 18 days ago
Labels: enhancement, triage

#1466 - Process ocr_caption lines

Pull Request - State: closed - Opened by 0dinD 18 days ago - 1 comment

#1465 - [Bug]: pdf remains skewed

Issue - State: closed - Opened by sadden3194 20 days ago - 1 comment

#1463 - [Feature]: OCR only if there is no text

Issue - State: closed - Opened by electro-logic 24 days ago - 1 comment
Labels: enhancement

#1462 - Add appstream metainfo file + screenshot

Pull Request - State: open - Opened by PunkPangolin about 1 month ago

#1461 - [Feature]: Flatpak support + Flathub

Issue - State: open - Opened by PunkPangolin about 1 month ago - 7 comments
Labels: enhancement

#1460 - [Bug]: Issue in the paramater/argument in ocrmypdf

Issue - State: open - Opened by Sasank2635 about 1 month ago
Labels: triage

#1458 - [Issue]: issue installing ocrmypdf > 16.5.0

Issue - State: closed - Opened by hiilmiee about 1 month ago - 2 comments

#1456 - Improve ProgressBar docstring to clarify usage of increments and completion

Pull Request - State: closed - Opened by QuentinFuxa about 1 month ago - 1 comment

#1455 - [Bug]: Traceback Error with --skip-text option

Issue - State: closed - Opened by FelixKrickl about 1 month ago - 1 comment

#1454 - [Bug]: Replace pngquant with something not requiring Rust

Issue - State: closed - Opened by barracuda156 about 1 month ago - 2 comments

#1453 - [Bug]: produced pdf is empty

Issue - State: closed - Opened by leosenko about 1 month ago - 1 comment

#1453 - [Bug]: produced pdf is empty

Issue - State: closed - Opened by leosenko about 1 month ago - 1 comment

#1452 - [Bug]: unable to install jbig2enc (incl. solution)

Issue - State: closed - Opened by Johan446 about 2 months ago - 2 comments

#1451 - [Feature]: progress callback for ocrmypdf (background usage)

Issue - State: closed - Opened by QuentinFuxa about 2 months ago - 5 comments
Labels: enhancement

#1449 - Bump astral-sh/setup-uv from 4 to 5

Pull Request - State: closed - Opened by dependabot[bot] about 2 months ago
Labels: dependencies

#1448 - graft: fix invisible text appearing after strip_invisible_text

Pull Request - State: closed - Opened by pajowu 2 months ago - 2 comments

#1447 - [Feature]: Aggressive image optimization without color quantization

Issue - State: open - Opened by user1823 2 months ago - 1 comment
Labels: enhancement

#1446 - hocr: only add space if boxwidth is positive

Pull Request - State: closed - Opened by pajowu 2 months ago - 6 comments

#1445 - [Bug]: scanned pdf containig electronics schematic

Issue - State: closed - Opened by saadb 2 months ago - 2 comments
Labels: triage

#1443 - Update intersphinx mapping to current format

Pull Request - State: closed - Opened by QuLogic 2 months ago - 2 comments

#1441 - Fix "Scanning contents" progress bar with --redo-ocr

Pull Request - State: open - Opened by aliemjay 2 months ago - 1 comment

#1440 - fix minor grammar mistake

Pull Request - State: closed - Opened by joskezelensky 3 months ago

#1439 - [Bug]: OCR Output Quality Regression on Ubuntu 24.04

Issue - State: open - Opened by guilhermebferreira 3 months ago - 2 comments
Labels: triage

#1438 - [Bug]: deskew results in "empty" output file

Issue - State: open - Opened by hatl 3 months ago - 1 comment
Labels: bug

#1437 - Documentation for ''ocrmypdf.ocr()" not found

Issue - State: closed - Opened by fatsciock 3 months ago - 2 comments

#1436 - Bump astral-sh/setup-uv from 3 to 4

Pull Request - State: closed - Opened by dependabot[bot] 3 months ago
Labels: dependencies

#1435 - [Feature]: Option to remove OCR

Issue - State: open - Opened by user1823 3 months ago - 2 comments
Labels: enhancement, triage

#1434 - [Feature]: Feature Request - Use Google Document AI or VIsion AI instead of Tesseract

Issue - State: open - Opened by epatels 3 months ago
Labels: enhancement, triage

#1433 - Bump codecov/codecov-action from 4 to 5

Pull Request - State: closed - Opened by dependabot[bot] 3 months ago
Labels: dependencies

#1432 - [Bug]: pikepdf PdfMatrix module unavailale

Issue - State: closed - Opened by IsaacSugden 3 months ago - 2 comments
Labels: triage

#1430 - [Feature]: Add drop caps support

Issue - State: closed - Opened by 4F2E4A2E 3 months ago - 2 comments
Labels: enhancement, triage

#1428 - ocrmypdf isn't installing on termux

Issue - State: closed - Opened by eelalzep 3 months ago - 2 comments

#1427 - [Bug]: HOCRResult.from_json() not unpickling correctly

Issue - State: closed - Opened by hoblins 3 months ago
Labels: triage

#1426 - [Bug]: Docker container entry point

Issue - State: closed - Opened by sneakpodbob 3 months ago - 1 comment
Labels: triage

#1425 - [3rdparty]: paperless-ngx

Issue - State: closed - Opened by Checole 3 months ago - 3 comments
Labels: triage

#1423 - [Bug]: test_malformed_docinfo fails with spectacular INTERNALERROR

Issue - State: open - Opened by mcepl 3 months ago - 3 comments
Labels: third party issue

#1422 - [Feature]: Show page numbers when detecting rotation

Issue - State: closed - Opened by tsoernes 3 months ago - 1 comment
Labels: enhancement, triage

#1421 - [Feature]: Show page number in PriorOcrFoundError

Issue - State: closed - Opened by tsoernes 3 months ago - 1 comment
Labels: enhancement, triage

#1420 - [Bug]: '_idat' object has no attribute 'fileno' // No space left on device

Issue - State: closed - Opened by kkduke 4 months ago - 5 comments
Labels: user config

#1415 - [Bug]: Example docker-compose.yml not working anymore

Issue - State: closed - Opened by ckagerer 4 months ago - 2 comments
Labels: triage

#1413 - [3rdparty]: paperless-ngx PDF Fails to Process with InputFileError: PDF content stream is corrupt

Issue - State: open - Opened by singlatushar07 4 months ago - 1 comment
Labels: bug, need test file

#1412 - [Bug]: "remove-background is temporarily not implemented" error on linux

Issue - State: closed - Opened by dimyself 4 months ago - 2 comments
Labels: triage

#1411 - [Bug]: Unable to proceed with a custom language lacking a dictionary

Issue - State: closed - Opened by vchgan 4 months ago - 1 comment
Labels: triage

#1409 - [Bug]: Unpaper Not Found: "Warning: using insecure memory!"

Issue - State: closed - Opened by vfilby 4 months ago - 2 comments
Labels: triage

#1407 - Data privacy when using OCRmyPDF

Issue - State: closed - Opened by etroci 4 months ago - 2 comments

#1406 - [Bug]: cannot import name 'PdfMatrix' from 'pikepdf'

Issue - State: closed - Opened by kdbreck 4 months ago - 1 comment
Labels: triage

#1405 - [Feature]: support for Apple vision framework

Issue - State: closed - Opened by santiagozky 4 months ago - 2 comments
Labels: enhancement, triage

#1404 - Doc: new infix for temp files; snap temp files folder

Pull Request - State: closed - Opened by mayeulk 4 months ago

#1403 - [Bug]: Refuses to process old book with existing OCR

Issue - State: closed - Opened by themaster567 4 months ago - 1 comment
Labels: triage

#1400 - [Bug]: File generated by OCRmyPDF doesn't open in all PDF editors

Issue - State: open - Opened by sklart 5 months ago - 4 comments
Labels: need test file

#1399 - [Bug]: Highlights/annotations repeated on all pages

Issue - State: open - Opened by Jmuccigr 5 months ago - 1 comment
Labels: triage

#1398 - [Bug]: pikepdf cropbox/mediabox/trimbox as list can return strings in the list

Issue - State: open - Opened by jozuas 5 months ago - 2 comments
Labels: triage

#1396 - [Bug]: Cannot create a file when that file already exists

Issue - State: closed - Opened by user1823 5 months ago - 1 comment
Labels: triage

#1395 - [Bug]: Tesseract fails on Alpine 3.20.3

Issue - State: closed - Opened by pschichtel 5 months ago - 2 comments

#1394 - [Feature]: Align pages to text baseline

Issue - State: closed - Opened by swxxii 5 months ago - 2 comments
Labels: enhancement, triage

#1393 - How to remove the image-with-text from the PDF

Issue - State: open - Opened by SurinameClubcard 5 months ago - 1 comment

#1392 - Bump sigstore/gh-action-sigstore-python from 2.1.1 to 3.0.0

Pull Request - State: closed - Opened by dependabot[bot] 6 months ago
Labels: dependencies

#1391 - 当使用ocrmypdf输入 PDF 为中文时,结果 复制PDF 中有额外的空格

Issue - State: open - Opened by deict 6 months ago - 1 comment
Labels: triage

#1390 - [3rdparty]: 当使用ocrmypdf输入 PDF 为中文时,结果 复制PDF 中有额外的空格

Issue - State: closed - Opened by deict 6 months ago - 1 comment
Labels: triage

#1389 - [Feature]: Add a flag to enable ocrmypdf to write "last-modified attribute" to the OCR'ed file.

Issue - State: closed - Opened by ashrockd 6 months ago - 2 comments
Labels: enhancement

#1384 - Add mdate preservation

Pull Request - State: closed - Opened by ferdiga 6 months ago - 1 comment

#1382 - Fix broken test_rotate_page_level

Pull Request - State: closed - Opened by QuLogic 6 months ago - 1 comment

#1380 - [Bug]: Scan time regression in 16.4.3 with `--redo-ocr`

Issue - State: closed - Opened by aliemjay 6 months ago - 16 comments

#1379 - [Bug/Feature]: a way to disable Ghostscript requirement & broken plugin_manager option

Issue - State: open - Opened by nikitar 6 months ago - 13 comments
Labels: triage

#1378 - [Bug]: Scan time increases quadratically with page count

Issue - State: closed - Opened by aliemjay 6 months ago - 9 comments

#1377 - [Bug]: Regression in 16.4

Issue - State: closed - Opened by gringus 6 months ago - 7 comments
Labels: triage

#1376 - [Bug]: NotImplementedError in colorspace

Issue - State: closed - Opened by macdeport 6 months ago - 6 comments

#1375 - [Bug]: ocrmypdf: error: unrecognized arguments: input.pdf output.pdf

Issue - State: closed - Opened by KNDaniel 6 months ago - 3 comments
Labels: triage

#1374 - [Feature]: Result Improvement with OpenCV + Pillow Preprocessing

Issue - State: closed - Opened by vishaldwdi 6 months ago - 3 comments
Labels: enhancement, triage

#1373 - does not ocr 90° rotated texts

Issue - State: closed - Opened by stfnx 6 months ago - 1 comment

#1372 - [Bug]: Output file is okay but is not PDF/A

Issue - State: closed - Opened by tcurdt 7 months ago - 3 comments
Labels: triage

#1370 - [Query]: docker watched folder environment variables, optimize how?

Issue - State: closed - Opened by jaxjexjox 7 months ago - 2 comments
Labels: triage

#1369 - [Bug]: Large file size increases due to PDF/A font substitution

Issue - State: open - Opened by ferdiga 7 months ago - 9 comments
Labels: bug

#1368 - [Bug]: maximum recursion depth exceeded

Issue - State: closed - Opened by you-healthtap 7 months ago - 2 comments
Labels: triage

#1367 - [Bug]: The generated PDF is INVALID

Issue - State: closed - Opened by user1823 7 months ago - 4 comments
Labels: triage

#1366 - [Bug]: Output PDF is too large

Issue - State: open - Opened by user1823 7 months ago
Labels: triage

#1365 - [Bug]: The width is not correct for detected words

Issue - State: closed - Opened by you-healthtap 7 months ago - 4 comments
Labels: triage

#1364 - [Bug]: cannot add non-opaque RGBA color to RGB palette

Issue - State: closed - Opened by jozuas 7 months ago - 2 comments
Labels: third party issue

#1362 - [Bug]: Ghostscript rasterizing failed

Issue - State: closed - Opened by user1823 7 months ago
Labels: triage

#1361 - [Bug]: pdfminer.pdfexceptions.PDFTypeError: invalid length: 6

Issue - State: closed - Opened by user1823 7 months ago - 9 comments
Labels: need test file

#1360 - ocrmypdf produces wrong page size

Issue - State: open - Opened by femifrak 7 months ago - 3 comments