Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / ocrmypdf/OCRmyPDF issues and pull requests
#1399 - [Bug]: Highlights/annotations repeated on all pages
Issue -
State: open - Opened by Jmuccigr 5 days ago
Labels: triage
#1398 - [Bug]: pikepdf cropbox/mediabox/trimbox as list can return strings in the list
Issue -
State: open - Opened by jozuas 6 days ago
- 1 comment
Labels: triage
#1396 - [Bug]: Cannot create a file when that file already exists
Issue -
State: open - Opened by user1823 11 days ago
Labels: triage
#1395 - [Bug]: Tesseract fails on Alpine 3.20.3
Issue -
State: closed - Opened by pschichtel 13 days ago
- 1 comment
Labels: triage
#1394 - [Feature]: Align pages to text baseline
Issue -
State: closed - Opened by swxxii 19 days ago
- 2 comments
Labels: enhancement, triage
#1393 - How to remove the image-with-text from the PDF
Issue -
State: open - Opened by SurinameClubcard 21 days ago
#1392 - Bump sigstore/gh-action-sigstore-python from 2.1.1 to 3.0.0
Pull Request -
State: closed - Opened by dependabot[bot] 27 days ago
Labels: dependencies
#1391 - 当使用ocrmypdf输入 PDF 为中文时,结果 复制PDF 中有额外的空格
Issue -
State: open - Opened by deict 30 days ago
- 1 comment
Labels: triage
#1390 - [3rdparty]: 当使用ocrmypdf输入 PDF 为中文时,结果 复制PDF 中有额外的空格
Issue -
State: closed - Opened by deict about 1 month ago
- 1 comment
Labels: triage
#1389 - [Feature]: Add a flag to enable ocrmypdf to write "last-modified attribute" to the OCR'ed file.
Issue -
State: closed - Opened by ashrockd about 1 month ago
- 2 comments
Labels: enhancement
#1388 - [Feature]: decrypt file if qpdf is installed (EncryptedPdfError: Input PDF is encrypted. The encryption must be removed to perform OCR.)
Issue -
State: closed - Opened by JoKalliauer about 1 month ago
- 1 comment
Labels: enhancement, triage
#1387 - [Bug]: "AttributeError: module 'numpy.typing' has no attribute 'NDArray'" after Homebrew installation
Issue -
State: closed - Opened by tillboehringer about 1 month ago
- 6 comments
Labels: triage
#1385 - Recommended way of running ocrmypdf with memory limits
Issue -
State: closed - Opened by andersfylling about 1 month ago
#1384 - Add mdate preservation
Pull Request -
State: closed - Opened by ferdiga about 1 month ago
- 1 comment
#1382 - Fix broken test_rotate_page_level
Pull Request -
State: closed - Opened by QuLogic about 1 month ago
- 1 comment
#1380 - [Bug]: Scan time regression in 16.4.3 with `--redo-ocr`
Issue -
State: open - Opened by aliemjay about 1 month ago
- 14 comments
#1379 - [Bug/Feature]: a way to disable Ghostscript requirement & broken plugin_manager option
Issue -
State: open - Opened by nikitar about 1 month ago
- 12 comments
Labels: triage
#1378 - [Bug]: Scan time increases quadratically with page count
Issue -
State: open - Opened by aliemjay about 1 month ago
- 8 comments
#1377 - [Bug]: Regression in 16.4
Issue -
State: closed - Opened by gringus about 2 months ago
- 7 comments
Labels: triage
#1376 - [Bug]: NotImplementedError in colorspace
Issue -
State: closed - Opened by macdeport about 2 months ago
- 6 comments
#1375 - [Bug]: ocrmypdf: error: unrecognized arguments: input.pdf output.pdf
Issue -
State: closed - Opened by KNDaniel about 2 months ago
- 3 comments
Labels: triage
#1374 - [Feature]: Result Improvement with OpenCV + Pillow Preprocessing
Issue -
State: closed - Opened by vishaldwdi about 2 months ago
- 3 comments
Labels: enhancement, triage
#1373 - does not ocr 90° rotated texts
Issue -
State: closed - Opened by stfnx about 2 months ago
- 1 comment
#1372 - [Bug]: Output file is okay but is not PDF/A
Issue -
State: closed - Opened by tcurdt about 2 months ago
- 3 comments
Labels: triage
#1370 - [Query]: docker watched folder environment variables, optimize how?
Issue -
State: closed - Opened by jaxjexjox about 2 months ago
- 2 comments
Labels: triage
#1369 - [Bug]: Large file size increases due to PDF/A font substitution
Issue -
State: open - Opened by ferdiga about 2 months ago
- 9 comments
Labels: bug
#1368 - [Bug]: maximum recursion depth exceeded
Issue -
State: closed - Opened by you-healthtap about 2 months ago
- 2 comments
Labels: triage
#1367 - [Bug]: The generated PDF is INVALID
Issue -
State: closed - Opened by user1823 about 2 months ago
- 4 comments
Labels: triage
#1366 - [Bug]: Output PDF is too large
Issue -
State: open - Opened by user1823 about 2 months ago
Labels: triage
#1365 - [Bug]: The width is not correct for detected words
Issue -
State: closed - Opened by you-healthtap about 2 months ago
- 4 comments
Labels: triage
#1364 - [Bug]: cannot add non-opaque RGBA color to RGB palette
Issue -
State: open - Opened by jozuas 2 months ago
- 2 comments
Labels: third party issue
#1363 - [Bug]: subprocess.CalledProcessError: Command '['D:\\latex\\texlive\\2020\\bin\\win32\\jbig2.EXE', '--version']' returned non-zero exit status 3.
Issue -
State: closed - Opened by 459737087 2 months ago
- 1 comment
Labels: bug
#1362 - [Bug]: Ghostscript rasterizing failed
Issue -
State: closed - Opened by user1823 2 months ago
Labels: triage
#1361 - [Bug]: pdfminer.pdfexceptions.PDFTypeError: invalid length: 6
Issue -
State: closed - Opened by user1823 2 months ago
- 9 comments
Labels: need test file
#1360 - ocrmypdf produces wrong page size
Issue -
State: open - Opened by femifrak 2 months ago
- 3 comments
#1359 - [Bug]: with the latest version of Ghostscript 10.03.1, ocrmypdf is passing file names to Ghostscript in the wrong order
Issue -
State: closed - Opened by alan-sandollar 2 months ago
Labels: triage
#1356 - [Bug]: FileNotFoundError: [Errno 2] No such file or directory: 'gs'
Issue -
State: closed - Opened by 459737087 2 months ago
- 2 comments
Labels: user config
#1355 - Update installation.rst "python -m venv .venv"
Pull Request -
State: closed - Opened by JoKalliauer 2 months ago
#1354 - Add '--needed' flag to arch base-devel install command
Pull Request -
State: closed - Opened by mersenne-twister 3 months ago
#1353 - --sidecar writes text content and messages to file
Issue -
State: closed - Opened by gerritgriebel 3 months ago
- 2 comments
#1352 - [Bug]: files signed with a-trust are not recognised as digitally signed and hence processed
Issue -
State: closed - Opened by ferdiga 3 months ago
- 1 comment
Labels: old version
#1351 - [Bug]: Ghostscript rasterizing failed
Issue -
State: closed - Opened by JoKalliauer 3 months ago
- 3 comments
Labels: bug
#1350 - [Bug]: KeyError: '/Subtype'
Issue -
State: closed - Opened by user1823 3 months ago
Labels: triage
#1349 - [Bug]: Ghostscript can't create a PDF/A-file (Page object was reserved for an Annotation destination)
Issue -
State: closed - Opened by JoKalliauer 3 months ago
- 3 comments
Labels: triage
#1348 - [Bug]: problem with tif "DPI is not credible". Estimate dpi
Issue -
State: closed - Opened by drnicolas 3 months ago
- 3 comments
Labels: triage
#1346 - [Bug]: OSError: [Errno 28] No space left on device
Issue -
State: closed - Opened by Salvodif 3 months ago
- 4 comments
Labels: triage
#1345 - Output file images are corrupted
Issue -
State: closed - Opened by robmclear 3 months ago
- 1 comment
Labels: third party issue
#1344 - [Bug]: doesn't always parse Latin with diacritics
Issue -
State: closed - Opened by arsinclair 3 months ago
- 3 comments
Labels: third party issue
#1343 - [Feature]: Enable execution on GPU
Issue -
State: closed - Opened by danielfcastro 3 months ago
- 1 comment
Labels: enhancement, triage
#1342 - [Request]: Please make rich logging library an optional dependency
Issue -
State: open - Opened by lucasgadams 3 months ago
- 1 comment
Labels: enhancement
#1337 - [Bug]: Existing text is completely replaced with other characters
Issue -
State: open - Opened by david-sledge 3 months ago
- 3 comments
Labels: third party issue
#1336 - [Bug]: ocrmypdf (16.3.1) and Tesseract 5.4.1
Issue -
State: closed - Opened by Johnnie390 3 months ago
- 3 comments
Labels: bug, triage
#1335 - [Bug]: `lots of diacritics - possibly poor OCR` but using standalone tesseract works perfectly
Issue -
State: closed - Opened by KAGEYAM4 3 months ago
- 1 comment
Labels: bug, triage
#1334 - [Bug]: No errors and no output for large DPI files
Issue -
State: closed - Opened by dan-ryan 3 months ago
- 2 comments
Labels: bug, triage
#1332 - [Bug]: MetadataProgress does not respect progress_bar=False argument
Issue -
State: closed - Opened by DavidMChan 4 months ago
Labels: bug, triage
#1331 - [Bug]: Paperless-ngx Release 2.9.0 Ghostscript rasterizing failed
Issue -
State: closed - Opened by Johnnie390 4 months ago
- 1 comment
Labels: bug
#1330 - [Feature]: Alternative AI OCR "surya" as opposed to EasyOCR, Just found it today and it dominated the accuracy and speed of Tesseract & EasyOCR
Issue -
State: open - Opened by abclution 4 months ago
- 3 comments
Labels: enhancement
#1329 - [Bug]: ocrmypdf 16.3.1 fails on a file on Arch that 13.4.0 on Ubuntu handles well
Issue -
State: closed - Opened by Fifis 4 months ago
- 1 comment
Labels: bug
#1328 - [Bug]: crashes with tesseract 5.4.0
Issue -
State: closed - Opened by mplx 4 months ago
- 8 comments
Labels: bug
#1327 - Update docker.rst
Pull Request -
State: closed - Opened by omidraha 4 months ago
#1326 - Incorrect behavior of text color setting in hocrtransform
Issue -
State: closed - Opened by ep0p 4 months ago
- 2 comments
#1325 - [Bug]: --tesseract-pagesegmode is not sufficiently documented
Issue -
State: closed - Opened by thomas2net 4 months ago
- 1 comment
Labels: bug
#1324 - Error occurred while consuming document out1.pdf: SubprocessOutputError: Ghostscript rasterizing failed.
Issue -
State: closed - Opened by dekoenpi 4 months ago
- 1 comment
Labels: bug
#1323 - [Bug]: OCR not complete. Parts of all pages are ignored
Issue -
State: closed - Opened by 0lm 4 months ago
- 1 comment
Labels: bug
#1322 - [Bug]: multiple spaces not supported for delimitation of bbox parameters
Issue -
State: closed - Opened by Tehgg 4 months ago
- 1 comment
Labels: bug
#1321 - [Bug]: Flood of "Recursion depth exceeded in _find_image_xrefs_page"
Issue -
State: closed - Opened by user1584 4 months ago
- 5 comments
Labels: bug
#1318 - [Bug]:
Issue -
State: closed - Opened by Firestar-Reimu 4 months ago
- 4 comments
Labels: bug
#1317 - Pushed docker image is always Ubuntu instead of alpine
Issue -
State: closed - Opened by vihtap 4 months ago
- 1 comment
#1316 - [Bug]: test_semfree fails with ghostscript 10.03.0+
Issue -
State: closed - Opened by gringus 4 months ago
Labels: bug
#1315 - [Bug]: NotImplementedError: not sure how to get colorspace
Issue -
State: open - Opened by macdeport 4 months ago
- 2 comments
Labels: bug
#1314 - [Feature]: If page has text, force OCR and rasterize page
Issue -
State: open - Opened by mikejokic 5 months ago
- 1 comment
Labels: enhancement
#1313 - Show progress during postprocessing
Issue -
State: open - Opened by user1823 5 months ago
- 5 comments
Labels: enhancement
#1312 - [Bug]: Crash on multiple .pdf files
Issue -
State: closed - Opened by olafure 5 months ago
- 5 comments
Labels: bug
#1311 - Indian Numbers on Arabic text
Issue -
State: open - Opened by MedoHamdani 5 months ago
#1309 - Make usage of --rotate-pages-threshold clearer
Issue -
State: closed - Opened by stegl83 5 months ago
#1308 - [Bug]: cannot import name 'PDFTextSeq' from 'pdfminer.pdfdevice'
Issue -
State: closed - Opened by user1823 5 months ago
- 3 comments
Labels: bug
#1307 - [Bug]: No longer works - macos-11.7 x86_64 Python 3.10
Issue -
State: closed - Opened by atanasj 5 months ago
- 11 comments
Labels: bug, user config
#1306 - [Bug]: File size increased
Issue -
State: closed - Opened by user1823 5 months ago
- 7 comments
Labels: bug
#1304 - [Bug]: conda installation
Issue -
State: closed - Opened by kevinkaw 5 months ago
- 2 comments
Labels: bug
#1303 - [Bug]: ValueError: ObjectList must have 6 elements
Issue -
State: closed - Opened by macdeport 5 months ago
- 3 comments
Labels: bug
#1302 - not user friendly
Issue -
State: closed - Opened by abood-az 5 months ago
- 1 comment
Labels: bug
#1301 - [Feature]: JPEG XL support
Issue -
State: closed - Opened by Lyapsus 5 months ago
- 3 comments
Labels: enhancement
#1300 - Fix wrong env var for GS path in Snap
Pull Request -
State: closed - Opened by helkaluin 5 months ago
- 1 comment
#1299 - [Feature]: Change demo format to VHS
Issue -
State: open - Opened by jbarlow83 5 months ago
Labels: enhancement
#1297 - [Bug]: real text replaced by � � (visually unchanged, only by copying)
Issue -
State: open - Opened by JoKalliauer 5 months ago
Labels: bug
#1296 - Adding language install docs for archlinux
Pull Request -
State: closed - Opened by ahmedsbytes 5 months ago
#1295 - Release notes don't include the latest versions
Issue -
State: closed - Opened by user1823 5 months ago
- 1 comment
#1294 - [Bug]: watcher.py requires the "ARCHIVE" folder to be assigned, even if the option is disabled
Issue -
State: closed - Opened by clodobox 5 months ago
- 1 comment
#1293 - [Bug]: Warning: "xref 473: While extracting this image, an error occurred"
Issue -
State: closed - Opened by macdeport 6 months ago
- 1 comment
Labels: bug
#1290 - [Bug]: Memory Error
Issue -
State: open - Opened by user1823 6 months ago
Labels: bug
#1289 - [Bug]: DecompressionBombWarning
Issue -
State: closed - Opened by user1823 6 months ago
- 1 comment
Labels: bug
#1287 - Update the typer[all] dependency to typer-slim[standard]
Pull Request -
State: closed - Opened by musicinmybrain 6 months ago
- 2 comments
#1286 - added Macports install information
Pull Request -
State: closed - Opened by akierig 6 months ago
#1284 - [Feature]: Could watcher.py be enhanced to support the conversion of single or multi TIF and JPG files to PDF?
Issue -
State: closed - Opened by EvilQoo 6 months ago
- 1 comment
Labels: enhancement
#1283 - max_workers must be greater than 0
Issue -
State: closed - Opened by nope999 6 months ago
- 2 comments
Labels: need test file
#1282 - [Feature]: Choose between NFKC and NFC normalization for Unicode characters so copy-pasting works
Issue -
State: open - Opened by sfllaw 6 months ago
- 5 comments
Labels: enhancement
#1281 - [Bug] SubprocessOutputError
Issue -
State: closed - Opened by user1823 6 months ago
- 4 comments
Labels: bug
#1279 - Allow resuming OCR after DecompressionBombError
Issue -
State: closed - Opened by user1823 7 months ago
- 3 comments
Labels: enhancement
#1278 - [Bug]: The file size increases significantly by OCR even without image recompression
Issue -
State: open - Opened by ybeltukov 7 months ago
- 2 comments
Labels: bug
#1277 - batch example: added archive, small corrections and optimizations
Pull Request -
State: closed - Opened by NilsRo 7 months ago
- 1 comment