Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / ocrmypdf/OCRmyPDF issues and pull requests

#1356 - [Bug]: FileNotFoundError: [Errno 2] No such file or directory: 'gs'

Issue - State: closed - Opened by 459737087 7 months ago - 2 comments
Labels: user config

#1355 - Update installation.rst "python -m venv .venv"

Pull Request - State: closed - Opened by JoKalliauer 7 months ago

#1354 - Add '--needed' flag to arch base-devel install command

Pull Request - State: closed - Opened by mersenne-twister 7 months ago

#1353 - --sidecar writes text content and messages to file

Issue - State: closed - Opened by gerritgriebel 7 months ago - 2 comments

#1352 - [Bug]: files signed with a-trust are not recognised as digitally signed and hence processed

Issue - State: closed - Opened by ferdiga 8 months ago - 1 comment
Labels: old version

#1351 - [Bug]: Ghostscript rasterizing failed

Issue - State: closed - Opened by JoKalliauer 8 months ago - 3 comments
Labels: bug

#1350 - [Bug]: KeyError: '/Subtype'

Issue - State: closed - Opened by user1823 8 months ago
Labels: triage

#1348 - [Bug]: problem with tif "DPI is not credible". Estimate dpi

Issue - State: closed - Opened by drnicolas 8 months ago - 3 comments
Labels: triage

#1346 - [Bug]: OSError: [Errno 28] No space left on device

Issue - State: closed - Opened by Salvodif 8 months ago - 4 comments
Labels: triage

#1345 - Output file images are corrupted

Issue - State: closed - Opened by robmclear 8 months ago - 1 comment
Labels: third party issue

#1344 - [Bug]: doesn't always parse Latin with diacritics

Issue - State: closed - Opened by arsinclair 8 months ago - 3 comments
Labels: third party issue

#1343 - [Feature]: Enable execution on GPU

Issue - State: closed - Opened by danielfcastro 8 months ago - 1 comment
Labels: enhancement, triage

#1342 - [Request]: Please make rich logging library an optional dependency

Issue - State: open - Opened by lucasgadams 8 months ago - 1 comment
Labels: enhancement

#1337 - [Bug]: Existing text is completely replaced with other characters

Issue - State: open - Opened by david-sledge 8 months ago - 3 comments
Labels: third party issue

#1336 - [Bug]: ocrmypdf (16.3.1) and Tesseract 5.4.1

Issue - State: closed - Opened by Johnnie390 8 months ago - 3 comments
Labels: bug, triage

#1335 - [Bug]: `lots of diacritics - possibly poor OCR` but using standalone tesseract works perfectly

Issue - State: closed - Opened by KAGEYAM4 8 months ago - 1 comment
Labels: bug, triage

#1334 - [Bug]: No errors and no output for large DPI files

Issue - State: closed - Opened by dan-ryan 8 months ago - 2 comments
Labels: bug, triage

#1332 - [Bug]: MetadataProgress does not respect progress_bar=False argument

Issue - State: closed - Opened by DavidMChan 8 months ago
Labels: bug, triage

#1331 - [Bug]: Paperless-ngx Release 2.9.0 Ghostscript rasterizing failed

Issue - State: closed - Opened by Johnnie390 8 months ago - 1 comment
Labels: bug

#1329 - [Bug]: ocrmypdf 16.3.1 fails on a file on Arch that 13.4.0 on Ubuntu handles well

Issue - State: closed - Opened by Fifis 8 months ago - 1 comment
Labels: bug

#1328 - [Bug]: crashes with tesseract 5.4.0

Issue - State: closed - Opened by mplx 9 months ago - 8 comments
Labels: bug

#1327 - Update docker.rst

Pull Request - State: closed - Opened by omidraha 9 months ago

#1326 - Incorrect behavior of text color setting in hocrtransform

Issue - State: closed - Opened by ep0p 9 months ago - 2 comments

#1325 - [Bug]: --tesseract-pagesegmode is not sufficiently documented

Issue - State: closed - Opened by thomas2net 9 months ago - 1 comment
Labels: bug

#1323 - [Bug]: OCR not complete. Parts of all pages are ignored

Issue - State: closed - Opened by 0lm 9 months ago - 1 comment
Labels: bug

#1322 - [Bug]: multiple spaces not supported for delimitation of bbox parameters

Issue - State: closed - Opened by Tehgg 9 months ago - 1 comment
Labels: bug

#1321 - [Bug]: Flood of "Recursion depth exceeded in _find_image_xrefs_page"

Issue - State: closed - Opened by user1584 9 months ago - 6 comments
Labels: bug

#1318 - [Bug]:

Issue - State: closed - Opened by Firestar-Reimu 9 months ago - 4 comments
Labels: bug

#1317 - Pushed docker image is always Ubuntu instead of alpine

Issue - State: closed - Opened by vihtap 9 months ago - 1 comment

#1316 - [Bug]: test_semfree fails with ghostscript 10.03.0+

Issue - State: closed - Opened by gringus 9 months ago
Labels: bug

#1315 - [Bug]: NotImplementedError: not sure how to get colorspace

Issue - State: open - Opened by macdeport 9 months ago - 2 comments
Labels: bug

#1314 - [Feature]: If page has text, force OCR and rasterize page

Issue - State: open - Opened by mikejokic 9 months ago - 1 comment
Labels: enhancement

#1313 - Show progress during postprocessing

Issue - State: open - Opened by user1823 9 months ago - 5 comments
Labels: enhancement

#1312 - [Bug]: Crash on multiple .pdf files

Issue - State: closed - Opened by olafure 9 months ago - 5 comments
Labels: bug

#1311 - Indian Numbers on Arabic text

Issue - State: closed - Opened by MedoHamdani 9 months ago - 1 comment

#1309 - Make usage of --rotate-pages-threshold clearer

Issue - State: closed - Opened by stegl83 9 months ago

#1308 - [Bug]: cannot import name 'PDFTextSeq' from 'pdfminer.pdfdevice'

Issue - State: closed - Opened by user1823 10 months ago - 3 comments
Labels: bug

#1307 - [Bug]: No longer works - macos-11.7 x86_64 Python 3.10

Issue - State: closed - Opened by atanasj 10 months ago - 11 comments
Labels: bug, user config

#1306 - [Bug]: File size increased

Issue - State: closed - Opened by user1823 10 months ago - 7 comments
Labels: bug

#1304 - [Bug]: conda installation

Issue - State: closed - Opened by kevinkaw 10 months ago - 2 comments
Labels: bug

#1303 - [Bug]: ValueError: ObjectList must have 6 elements

Issue - State: closed - Opened by macdeport 10 months ago - 3 comments
Labels: bug

#1302 - not user friendly

Issue - State: closed - Opened by abood-az 10 months ago - 1 comment
Labels: bug

#1301 - [Feature]: JPEG XL support

Issue - State: closed - Opened by Lyapsus 10 months ago - 3 comments
Labels: enhancement

#1300 - Fix wrong env var for GS path in Snap

Pull Request - State: closed - Opened by helkaluin 10 months ago - 1 comment

#1299 - [Feature]: Change demo format to VHS

Issue - State: open - Opened by jbarlow83 10 months ago
Labels: enhancement

#1297 - [Bug]: real text replaced by � � (visually unchanged, only by copying)

Issue - State: open - Opened by JoKalliauer 10 months ago - 1 comment
Labels: bug

#1296 - Adding language install docs for archlinux

Pull Request - State: closed - Opened by ahmedsbytes 10 months ago

#1295 - Release notes don't include the latest versions

Issue - State: closed - Opened by user1823 10 months ago - 1 comment

#1293 - [Bug]: Warning: "xref 473: While extracting this image, an error occurred"

Issue - State: closed - Opened by macdeport 10 months ago - 1 comment
Labels: bug

#1290 - [Bug]: Memory Error

Issue - State: open - Opened by user1823 11 months ago
Labels: bug

#1289 - [Bug]: DecompressionBombWarning

Issue - State: closed - Opened by user1823 11 months ago - 1 comment
Labels: bug

#1287 - Update the typer[all] dependency to typer-slim[standard]

Pull Request - State: closed - Opened by musicinmybrain 11 months ago - 2 comments

#1286 - added Macports install information

Pull Request - State: closed - Opened by akierig 11 months ago

#1283 - max_workers must be greater than 0

Issue - State: closed - Opened by nope999 11 months ago - 2 comments
Labels: need test file

#1282 - [Feature]: Choose between NFKC and NFC normalization for Unicode characters so copy-pasting works

Issue - State: open - Opened by sfllaw 11 months ago - 5 comments
Labels: enhancement

#1281 - [Bug] SubprocessOutputError

Issue - State: closed - Opened by user1823 11 months ago - 4 comments
Labels: bug

#1279 - Allow resuming OCR after DecompressionBombError

Issue - State: closed - Opened by user1823 11 months ago - 3 comments
Labels: enhancement

#1278 - [Bug]: The file size increases significantly by OCR even without image recompression

Issue - State: open - Opened by ybeltukov 11 months ago - 2 comments
Labels: bug

#1277 - batch example: added archive, small corrections and optimizations

Pull Request - State: closed - Opened by NilsRo 11 months ago - 1 comment

#1275 - Fix Broken Documentation Links

Pull Request - State: closed - Opened by danloveg 11 months ago

#1274 - Recommended settings for dealing with text superimposed on clipart?

Issue - State: closed - Opened by MBYlt 11 months ago - 1 comment

#1272 - [Bug]: Missing support for certain unicode characters

Issue - State: open - Opened by vera-bernhard 12 months ago - 3 comments
Labels: bug

#1271 - [Bug]: AttributeError: 'NoneType' object has no attribute 'get'

Issue - State: closed - Opened by nikitar 12 months ago
Labels: bug

#1269 - [Bug]: "Corrupt JPEG data: premature end of data segment" with some files

Issue - State: closed - Opened by macdeport 12 months ago - 3 comments
Labels: user config

#1268 - Update Dockerfile.alpine

Pull Request - State: closed - Opened by emielmolenaar 12 months ago

#1267 - [Bug]: Ghostscript PDF/A rendering failed

Issue - State: closed - Opened by davide125 12 months ago - 1 comment
Labels: bug

#1264 - [Bug]: dpi-problem with rasterizing text

Issue - State: closed - Opened by JoKalliauer 12 months ago - 5 comments
Labels: bug

#1262 - Error: jbig2 not found on path, even though installed

Issue - State: closed - Opened by anaxonda 12 months ago - 4 comments
Labels: user config, third party issue

#1261 - [Bug]: OCRmyPDF succeeded with warning(s): InputFileError: pdfminer could not process page 0

Issue - State: closed - Opened by Markoise 12 months ago - 1 comment
Labels: invalid, need test file, third party issue

#1260 - Fix entrypoint for docker commands

Pull Request - State: closed - Opened by SirRegion 12 months ago - 1 comment

#1259 - [Bug]: version confusion

Issue - State: closed - Opened by branko623 about 1 year ago - 1 comment
Labels: bug

#1258 - [Bug]: Watcher doesnt notice changes after update

Issue - State: closed - Opened by Major2828 about 1 year ago
Labels: bug

#1257 - Handle PermissionError when finding tools

Pull Request - State: closed - Opened by grembo about 1 year ago - 4 comments

#1256 - Trying to debug OCR_ON_SUCCESS_DELETE flag not being executed - add exit code to watcher.py?

Issue - State: closed - Opened by wabarkley about 1 year ago - 2 comments
Labels: bug

#1255 - PDF-A produces lossy result

Issue - State: closed - Opened by YutMarma about 1 year ago - 5 comments

#1253 - [Feature]: Support RapidOCR engine

Issue - State: closed - Opened by saccohuo about 1 year ago - 1 comment
Labels: enhancement

#1252 - [Feature]: sidecar Support Text Output to io.StringIO()

Issue - State: closed - Opened by MAbdElRaouf about 1 year ago
Labels: enhancement

#1251 - [Bug]: OCRmyPDF not adding any text to document v 1.4

Issue - State: closed - Opened by maxi07 about 1 year ago - 1 comment
Labels: bug

#1250 - [Feature]: Integrations with other backends via hOcr (naive implementation of easyOcr backend inside)

Issue - State: open - Opened by coffepowered about 1 year ago - 4 comments
Labels: enhancement

#1249 - [Documentation]: Upgrade via pip after system install needs a different command

Issue - State: closed - Opened by dajare about 1 year ago - 1 comment
Labels: enhancement

#1248 - Update README.md

Pull Request - State: closed - Opened by rudolphos about 1 year ago - 1 comment

#1247 - Bump codecov/codecov-action from 3 to 4

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago
Labels: dependencies

#1246 - [Feature]: convert grayscale PDF to jbig monochrome while doing OCR

Issue - State: closed - Opened by callegar about 1 year ago - 1 comment
Labels: enhancement

#1243 - [Feature]: Add support for docTR as alternate OCR backend?

Issue - State: closed - Opened by victorhooi about 1 year ago - 4 comments
Labels: enhancement

#1241 - [Bug]: Unknown tesseract error, returns non-zero

Issue - State: closed - Opened by nepomuc about 1 year ago - 1 comment
Labels: bug

#1240 - [Bug]: Memory access error if using a German terminal

Issue - State: closed - Opened by Pete1976 about 1 year ago - 2 comments
Labels: bug

#1237 - Doc suggestion: also great for just removing the text layer!

Issue - State: closed - Opened by hmijail about 1 year ago - 1 comment
Labels: enhancement

#1236 - [Feature]: More Accessible Via Consistently connecting words to form sentences.

Issue - State: closed - Opened by PiggiesGoSqueal about 1 year ago - 2 comments
Labels: enhancement

#1235 - [Feature]: Explain on the docs how to change the language of OCR on watcher.py

Issue - State: closed - Opened by iohann95 about 1 year ago - 1 comment
Labels: enhancement

#1232 - [Bug]: Conda - pikepdf is unavailable

Issue - State: closed - Opened by kielbowicz about 1 year ago - 1 comment
Labels: bug

#1232 - [Bug]: Conda - pikepdf is unavailable

Issue - State: closed - Opened by kielbowicz about 1 year ago - 1 comment
Labels: bug