Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / ocrmypdf/OCRmyPDF issues and pull requests
#1359 - [Bug]: with the latest version of Ghostscript 10.03.1, ocrmypdf is passing file names to Ghostscript in the wrong order
Issue -
State: closed - Opened by alan-sandollar 7 months ago
Labels: triage
#1356 - [Bug]: FileNotFoundError: [Errno 2] No such file or directory: 'gs'
Issue -
State: closed - Opened by 459737087 7 months ago
- 2 comments
Labels: user config
#1355 - Update installation.rst "python -m venv .venv"
Pull Request -
State: closed - Opened by JoKalliauer 7 months ago
#1354 - Add '--needed' flag to arch base-devel install command
Pull Request -
State: closed - Opened by mersenne-twister 7 months ago
#1353 - --sidecar writes text content and messages to file
Issue -
State: closed - Opened by gerritgriebel 7 months ago
- 2 comments
#1352 - [Bug]: files signed with a-trust are not recognised as digitally signed and hence processed
Issue -
State: closed - Opened by ferdiga 8 months ago
- 1 comment
Labels: old version
#1351 - [Bug]: Ghostscript rasterizing failed
Issue -
State: closed - Opened by JoKalliauer 8 months ago
- 3 comments
Labels: bug
#1350 - [Bug]: KeyError: '/Subtype'
Issue -
State: closed - Opened by user1823 8 months ago
Labels: triage
#1349 - [Bug]: Ghostscript can't create a PDF/A-file (Page object was reserved for an Annotation destination)
Issue -
State: closed - Opened by JoKalliauer 8 months ago
- 3 comments
Labels: triage
#1348 - [Bug]: problem with tif "DPI is not credible". Estimate dpi
Issue -
State: closed - Opened by drnicolas 8 months ago
- 3 comments
Labels: triage
#1346 - [Bug]: OSError: [Errno 28] No space left on device
Issue -
State: closed - Opened by Salvodif 8 months ago
- 4 comments
Labels: triage
#1345 - Output file images are corrupted
Issue -
State: closed - Opened by robmclear 8 months ago
- 1 comment
Labels: third party issue
#1344 - [Bug]: doesn't always parse Latin with diacritics
Issue -
State: closed - Opened by arsinclair 8 months ago
- 3 comments
Labels: third party issue
#1343 - [Feature]: Enable execution on GPU
Issue -
State: closed - Opened by danielfcastro 8 months ago
- 1 comment
Labels: enhancement, triage
#1342 - [Request]: Please make rich logging library an optional dependency
Issue -
State: open - Opened by lucasgadams 8 months ago
- 1 comment
Labels: enhancement
#1337 - [Bug]: Existing text is completely replaced with other characters
Issue -
State: open - Opened by david-sledge 8 months ago
- 3 comments
Labels: third party issue
#1336 - [Bug]: ocrmypdf (16.3.1) and Tesseract 5.4.1
Issue -
State: closed - Opened by Johnnie390 8 months ago
- 3 comments
Labels: bug, triage
#1335 - [Bug]: `lots of diacritics - possibly poor OCR` but using standalone tesseract works perfectly
Issue -
State: closed - Opened by KAGEYAM4 8 months ago
- 1 comment
Labels: bug, triage
#1334 - [Bug]: No errors and no output for large DPI files
Issue -
State: closed - Opened by dan-ryan 8 months ago
- 2 comments
Labels: bug, triage
#1332 - [Bug]: MetadataProgress does not respect progress_bar=False argument
Issue -
State: closed - Opened by DavidMChan 8 months ago
Labels: bug, triage
#1331 - [Bug]: Paperless-ngx Release 2.9.0 Ghostscript rasterizing failed
Issue -
State: closed - Opened by Johnnie390 8 months ago
- 1 comment
Labels: bug
#1330 - [Feature]: Alternative AI OCR "surya" as opposed to EasyOCR, Just found it today and it dominated the accuracy and speed of Tesseract & EasyOCR
Issue -
State: closed - Opened by abclution 8 months ago
- 6 comments
Labels: enhancement
#1329 - [Bug]: ocrmypdf 16.3.1 fails on a file on Arch that 13.4.0 on Ubuntu handles well
Issue -
State: closed - Opened by Fifis 8 months ago
- 1 comment
Labels: bug
#1328 - [Bug]: crashes with tesseract 5.4.0
Issue -
State: closed - Opened by mplx 9 months ago
- 8 comments
Labels: bug
#1327 - Update docker.rst
Pull Request -
State: closed - Opened by omidraha 9 months ago
#1326 - Incorrect behavior of text color setting in hocrtransform
Issue -
State: closed - Opened by ep0p 9 months ago
- 2 comments
#1325 - [Bug]: --tesseract-pagesegmode is not sufficiently documented
Issue -
State: closed - Opened by thomas2net 9 months ago
- 1 comment
Labels: bug
#1324 - Error occurred while consuming document out1.pdf: SubprocessOutputError: Ghostscript rasterizing failed.
Issue -
State: closed - Opened by dekoenpi 9 months ago
- 1 comment
Labels: bug
#1323 - [Bug]: OCR not complete. Parts of all pages are ignored
Issue -
State: closed - Opened by 0lm 9 months ago
- 1 comment
Labels: bug
#1322 - [Bug]: multiple spaces not supported for delimitation of bbox parameters
Issue -
State: closed - Opened by Tehgg 9 months ago
- 1 comment
Labels: bug
#1321 - [Bug]: Flood of "Recursion depth exceeded in _find_image_xrefs_page"
Issue -
State: closed - Opened by user1584 9 months ago
- 6 comments
Labels: bug
#1318 - [Bug]:
Issue -
State: closed - Opened by Firestar-Reimu 9 months ago
- 4 comments
Labels: bug
#1317 - Pushed docker image is always Ubuntu instead of alpine
Issue -
State: closed - Opened by vihtap 9 months ago
- 1 comment
#1316 - [Bug]: test_semfree fails with ghostscript 10.03.0+
Issue -
State: closed - Opened by gringus 9 months ago
Labels: bug
#1315 - [Bug]: NotImplementedError: not sure how to get colorspace
Issue -
State: open - Opened by macdeport 9 months ago
- 2 comments
Labels: bug
#1314 - [Feature]: If page has text, force OCR and rasterize page
Issue -
State: open - Opened by mikejokic 9 months ago
- 1 comment
Labels: enhancement
#1313 - Show progress during postprocessing
Issue -
State: open - Opened by user1823 9 months ago
- 5 comments
Labels: enhancement
#1312 - [Bug]: Crash on multiple .pdf files
Issue -
State: closed - Opened by olafure 9 months ago
- 5 comments
Labels: bug
#1311 - Indian Numbers on Arabic text
Issue -
State: closed - Opened by MedoHamdani 9 months ago
- 1 comment
#1309 - Make usage of --rotate-pages-threshold clearer
Issue -
State: closed - Opened by stegl83 9 months ago
#1308 - [Bug]: cannot import name 'PDFTextSeq' from 'pdfminer.pdfdevice'
Issue -
State: closed - Opened by user1823 10 months ago
- 3 comments
Labels: bug
#1307 - [Bug]: No longer works - macos-11.7 x86_64 Python 3.10
Issue -
State: closed - Opened by atanasj 10 months ago
- 11 comments
Labels: bug, user config
#1306 - [Bug]: File size increased
Issue -
State: closed - Opened by user1823 10 months ago
- 7 comments
Labels: bug
#1304 - [Bug]: conda installation
Issue -
State: closed - Opened by kevinkaw 10 months ago
- 2 comments
Labels: bug
#1303 - [Bug]: ValueError: ObjectList must have 6 elements
Issue -
State: closed - Opened by macdeport 10 months ago
- 3 comments
Labels: bug
#1302 - not user friendly
Issue -
State: closed - Opened by abood-az 10 months ago
- 1 comment
Labels: bug
#1301 - [Feature]: JPEG XL support
Issue -
State: closed - Opened by Lyapsus 10 months ago
- 3 comments
Labels: enhancement
#1300 - Fix wrong env var for GS path in Snap
Pull Request -
State: closed - Opened by helkaluin 10 months ago
- 1 comment
#1299 - [Feature]: Change demo format to VHS
Issue -
State: open - Opened by jbarlow83 10 months ago
Labels: enhancement
#1297 - [Bug]: real text replaced by � � (visually unchanged, only by copying)
Issue -
State: open - Opened by JoKalliauer 10 months ago
- 1 comment
Labels: bug
#1296 - Adding language install docs for archlinux
Pull Request -
State: closed - Opened by ahmedsbytes 10 months ago
#1295 - Release notes don't include the latest versions
Issue -
State: closed - Opened by user1823 10 months ago
- 1 comment
#1294 - [Bug]: watcher.py requires the "ARCHIVE" folder to be assigned, even if the option is disabled
Issue -
State: closed - Opened by clodobox 10 months ago
- 1 comment
#1293 - [Bug]: Warning: "xref 473: While extracting this image, an error occurred"
Issue -
State: closed - Opened by macdeport 10 months ago
- 1 comment
Labels: bug
#1290 - [Bug]: Memory Error
Issue -
State: open - Opened by user1823 11 months ago
Labels: bug
#1289 - [Bug]: DecompressionBombWarning
Issue -
State: closed - Opened by user1823 11 months ago
- 1 comment
Labels: bug
#1287 - Update the typer[all] dependency to typer-slim[standard]
Pull Request -
State: closed - Opened by musicinmybrain 11 months ago
- 2 comments
#1286 - added Macports install information
Pull Request -
State: closed - Opened by akierig 11 months ago
#1284 - [Feature]: Could watcher.py be enhanced to support the conversion of single or multi TIF and JPG files to PDF?
Issue -
State: closed - Opened by EvilQoo 11 months ago
- 1 comment
Labels: enhancement
#1283 - max_workers must be greater than 0
Issue -
State: closed - Opened by nope999 11 months ago
- 2 comments
Labels: need test file
#1282 - [Feature]: Choose between NFKC and NFC normalization for Unicode characters so copy-pasting works
Issue -
State: open - Opened by sfllaw 11 months ago
- 5 comments
Labels: enhancement
#1281 - [Bug] SubprocessOutputError
Issue -
State: closed - Opened by user1823 11 months ago
- 4 comments
Labels: bug
#1279 - Allow resuming OCR after DecompressionBombError
Issue -
State: closed - Opened by user1823 11 months ago
- 3 comments
Labels: enhancement
#1278 - [Bug]: The file size increases significantly by OCR even without image recompression
Issue -
State: open - Opened by ybeltukov 11 months ago
- 2 comments
Labels: bug
#1277 - batch example: added archive, small corrections and optimizations
Pull Request -
State: closed - Opened by NilsRo 11 months ago
- 1 comment
#1275 - Fix Broken Documentation Links
Pull Request -
State: closed - Opened by danloveg 11 months ago
#1274 - Recommended settings for dealing with text superimposed on clipart?
Issue -
State: closed - Opened by MBYlt 11 months ago
- 1 comment
#1272 - [Bug]: Missing support for certain unicode characters
Issue -
State: open - Opened by vera-bernhard 12 months ago
- 3 comments
Labels: bug
#1271 - [Bug]: AttributeError: 'NoneType' object has no attribute 'get'
Issue -
State: closed - Opened by nikitar 12 months ago
Labels: bug
#1269 - [Bug]: "Corrupt JPEG data: premature end of data segment" with some files
Issue -
State: closed - Opened by macdeport 12 months ago
- 3 comments
Labels: user config
#1268 - Update Dockerfile.alpine
Pull Request -
State: closed - Opened by emielmolenaar 12 months ago
#1267 - [Bug]: Ghostscript PDF/A rendering failed
Issue -
State: closed - Opened by davide125 12 months ago
- 1 comment
Labels: bug
#1264 - [Bug]: dpi-problem with rasterizing text
Issue -
State: closed - Opened by JoKalliauer 12 months ago
- 5 comments
Labels: bug
#1263 - [Bug]: OCRmyPDF Docker Hot Folder Option OCR_ON_SUCCESS_ARCHIVE OCR_ON_SUCCESS_DELETE doesnt work
Issue -
State: open - Opened by mazi19 12 months ago
Labels: bug
#1262 - Error: jbig2 not found on path, even though installed
Issue -
State: closed - Opened by anaxonda 12 months ago
- 4 comments
Labels: user config, third party issue
#1261 - [Bug]: OCRmyPDF succeeded with warning(s): InputFileError: pdfminer could not process page 0
Issue -
State: closed - Opened by Markoise 12 months ago
- 1 comment
Labels: invalid, need test file, third party issue
#1260 - Fix entrypoint for docker commands
Pull Request -
State: closed - Opened by SirRegion 12 months ago
- 1 comment
#1259 - [Bug]: version confusion
Issue -
State: closed - Opened by branko623 about 1 year ago
- 1 comment
Labels: bug
#1258 - [Bug]: Watcher doesnt notice changes after update
Issue -
State: closed - Opened by Major2828 about 1 year ago
Labels: bug
#1257 - Handle PermissionError when finding tools
Pull Request -
State: closed - Opened by grembo about 1 year ago
- 4 comments
#1256 - Trying to debug OCR_ON_SUCCESS_DELETE flag not being executed - add exit code to watcher.py?
Issue -
State: closed - Opened by wabarkley about 1 year ago
- 2 comments
Labels: bug
#1255 - PDF-A produces lossy result
Issue -
State: closed - Opened by YutMarma about 1 year ago
- 5 comments
#1253 - [Feature]: Support RapidOCR engine
Issue -
State: closed - Opened by saccohuo about 1 year ago
- 1 comment
Labels: enhancement
#1252 - [Feature]: sidecar Support Text Output to io.StringIO()
Issue -
State: closed - Opened by MAbdElRaouf about 1 year ago
Labels: enhancement
#1251 - [Bug]: OCRmyPDF not adding any text to document v 1.4
Issue -
State: closed - Opened by maxi07 about 1 year ago
- 1 comment
Labels: bug
#1250 - [Feature]: Integrations with other backends via hOcr (naive implementation of easyOcr backend inside)
Issue -
State: open - Opened by coffepowered about 1 year ago
- 4 comments
Labels: enhancement
#1249 - [Documentation]: Upgrade via pip after system install needs a different command
Issue -
State: closed - Opened by dajare about 1 year ago
- 1 comment
Labels: enhancement
#1248 - Update README.md
Pull Request -
State: closed - Opened by rudolphos about 1 year ago
- 1 comment
#1247 - Bump codecov/codecov-action from 3 to 4
Pull Request -
State: closed - Opened by dependabot[bot] about 1 year ago
Labels: dependencies
#1246 - [Feature]: convert grayscale PDF to jbig monochrome while doing OCR
Issue -
State: closed - Opened by callegar about 1 year ago
- 1 comment
Labels: enhancement
#1245 - [Bug]: installation failed due to ghostcript in-compatible version and can not upgraded ghostscript in Ubuntu 20.04
Issue -
State: closed - Opened by rohan-paul about 1 year ago
- 1 comment
Labels: bug
#1244 - [Bug]: OCR on .pdf isn't the same as tesseract but the format is correct on .txt file
Issue -
State: open - Opened by matsumurae about 1 year ago
Labels: bug
#1243 - [Feature]: Add support for docTR as alternate OCR backend?
Issue -
State: closed - Opened by victorhooi about 1 year ago
- 4 comments
Labels: enhancement
#1241 - [Bug]: Unknown tesseract error, returns non-zero
Issue -
State: closed - Opened by nepomuc about 1 year ago
- 1 comment
Labels: bug
#1240 - [Bug]: Memory access error if using a German terminal
Issue -
State: closed - Opened by Pete1976 about 1 year ago
- 2 comments
Labels: bug
#1237 - Doc suggestion: also great for just removing the text layer!
Issue -
State: closed - Opened by hmijail about 1 year ago
- 1 comment
Labels: enhancement
#1236 - [Feature]: More Accessible Via Consistently connecting words to form sentences.
Issue -
State: closed - Opened by PiggiesGoSqueal about 1 year ago
- 2 comments
Labels: enhancement
#1235 - [Feature]: Explain on the docs how to change the language of OCR on watcher.py
Issue -
State: closed - Opened by iohann95 about 1 year ago
- 1 comment
Labels: enhancement
#1232 - [Bug]: Conda - pikepdf is unavailable
Issue -
State: closed - Opened by kielbowicz about 1 year ago
- 1 comment
Labels: bug
#1232 - [Bug]: Conda - pikepdf is unavailable
Issue -
State: closed - Opened by kielbowicz about 1 year ago
- 1 comment
Labels: bug