Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / ocrmypdf/OCRmyPDF issues and pull requests

#1399 - [Bug]: Highlights/annotations repeated on all pages

Issue - State: open - Opened by Jmuccigr 5 days ago
Labels: triage

#1398 - [Bug]: pikepdf cropbox/mediabox/trimbox as list can return strings in the list

Issue - State: open - Opened by jozuas 5 days ago - 1 comment
Labels: triage

#1396 - [Bug]: Cannot create a file when that file already exists

Issue - State: open - Opened by user1823 10 days ago
Labels: triage

#1395 - [Bug]: Tesseract fails on Alpine 3.20.3

Issue - State: closed - Opened by pschichtel 13 days ago - 1 comment
Labels: triage

#1394 - [Feature]: Align pages to text baseline

Issue - State: closed - Opened by swxxii 19 days ago - 2 comments
Labels: enhancement, triage

#1392 - Bump sigstore/gh-action-sigstore-python from 2.1.1 to 3.0.0

Pull Request - State: closed - Opened by dependabot[bot] 27 days ago
Labels: dependencies

#1391 - 当使用ocrmypdf输入 PDF 为中文时,结果 复制PDF 中有额外的空格

Issue - State: open - Opened by deict 30 days ago - 1 comment
Labels: triage

#1390 - [3rdparty]: 当使用ocrmypdf输入 PDF 为中文时,结果 复制PDF 中有额外的空格

Issue - State: closed - Opened by deict about 1 month ago - 1 comment
Labels: triage

#1389 - [Feature]: Add a flag to enable ocrmypdf to write "last-modified attribute" to the OCR'ed file.

Issue - State: closed - Opened by ashrockd about 1 month ago - 2 comments
Labels: enhancement

#1385 - Recommended way of running ocrmypdf with memory limits

Issue - State: closed - Opened by andersfylling about 1 month ago

#1384 - Add mdate preservation

Pull Request - State: closed - Opened by ferdiga about 1 month ago - 1 comment

#1382 - Fix broken test_rotate_page_level

Pull Request - State: closed - Opened by QuLogic about 1 month ago - 1 comment

#1380 - [Bug]: Scan time regression in 16.4.3 with `--redo-ocr`

Issue - State: open - Opened by aliemjay about 1 month ago - 14 comments

#1379 - [Bug/Feature]: a way to disable Ghostscript requirement & broken plugin_manager option

Issue - State: open - Opened by nikitar about 1 month ago - 12 comments
Labels: triage

#1378 - [Bug]: Scan time increases quadratically with page count

Issue - State: open - Opened by aliemjay about 1 month ago - 8 comments

#1377 - [Bug]: Regression in 16.4

Issue - State: closed - Opened by gringus about 2 months ago - 7 comments
Labels: triage

#1376 - [Bug]: NotImplementedError in colorspace

Issue - State: closed - Opened by macdeport about 2 months ago - 6 comments

#1375 - [Bug]: ocrmypdf: error: unrecognized arguments: input.pdf output.pdf

Issue - State: closed - Opened by KNDaniel about 2 months ago - 3 comments
Labels: triage

#1374 - [Feature]: Result Improvement with OpenCV + Pillow Preprocessing

Issue - State: closed - Opened by vishaldwdi about 2 months ago - 3 comments
Labels: enhancement, triage

#1373 - does not ocr 90° rotated texts

Issue - State: closed - Opened by stfnx about 2 months ago - 1 comment

#1372 - [Bug]: Output file is okay but is not PDF/A

Issue - State: closed - Opened by tcurdt about 2 months ago - 3 comments
Labels: triage

#1370 - [Query]: docker watched folder environment variables, optimize how?

Issue - State: closed - Opened by jaxjexjox about 2 months ago - 2 comments
Labels: triage

#1369 - [Bug]: Large file size increases due to PDF/A font substitution

Issue - State: open - Opened by ferdiga about 2 months ago - 9 comments
Labels: bug

#1368 - [Bug]: maximum recursion depth exceeded

Issue - State: closed - Opened by you-healthtap about 2 months ago - 2 comments
Labels: triage

#1367 - [Bug]: The generated PDF is INVALID

Issue - State: closed - Opened by user1823 about 2 months ago - 4 comments
Labels: triage

#1366 - [Bug]: Output PDF is too large

Issue - State: open - Opened by user1823 about 2 months ago
Labels: triage

#1365 - [Bug]: The width is not correct for detected words

Issue - State: closed - Opened by you-healthtap about 2 months ago - 4 comments
Labels: triage

#1364 - [Bug]: cannot add non-opaque RGBA color to RGB palette

Issue - State: open - Opened by jozuas 2 months ago - 2 comments
Labels: third party issue

#1362 - [Bug]: Ghostscript rasterizing failed

Issue - State: closed - Opened by user1823 2 months ago
Labels: triage

#1361 - [Bug]: pdfminer.pdfexceptions.PDFTypeError: invalid length: 6

Issue - State: closed - Opened by user1823 2 months ago - 9 comments
Labels: need test file

#1360 - ocrmypdf produces wrong page size

Issue - State: open - Opened by femifrak 2 months ago - 3 comments

#1356 - [Bug]: FileNotFoundError: [Errno 2] No such file or directory: 'gs'

Issue - State: closed - Opened by 459737087 2 months ago - 2 comments
Labels: user config

#1355 - Update installation.rst "python -m venv .venv"

Pull Request - State: closed - Opened by JoKalliauer 2 months ago

#1354 - Add '--needed' flag to arch base-devel install command

Pull Request - State: closed - Opened by mersenne-twister 3 months ago

#1353 - --sidecar writes text content and messages to file

Issue - State: closed - Opened by gerritgriebel 3 months ago - 2 comments

#1352 - [Bug]: files signed with a-trust are not recognised as digitally signed and hence processed

Issue - State: closed - Opened by ferdiga 3 months ago - 1 comment
Labels: old version

#1351 - [Bug]: Ghostscript rasterizing failed

Issue - State: closed - Opened by JoKalliauer 3 months ago - 3 comments
Labels: bug

#1350 - [Bug]: KeyError: '/Subtype'

Issue - State: closed - Opened by user1823 3 months ago
Labels: triage

#1348 - [Bug]: problem with tif "DPI is not credible". Estimate dpi

Issue - State: closed - Opened by drnicolas 3 months ago - 3 comments
Labels: triage

#1346 - [Bug]: OSError: [Errno 28] No space left on device

Issue - State: closed - Opened by Salvodif 3 months ago - 4 comments
Labels: triage

#1345 - Output file images are corrupted

Issue - State: closed - Opened by robmclear 3 months ago - 1 comment
Labels: third party issue

#1344 - [Bug]: doesn't always parse Latin with diacritics

Issue - State: closed - Opened by arsinclair 3 months ago - 3 comments
Labels: third party issue

#1343 - [Feature]: Enable execution on GPU

Issue - State: closed - Opened by danielfcastro 3 months ago - 1 comment
Labels: enhancement, triage

#1342 - [Request]: Please make rich logging library an optional dependency

Issue - State: open - Opened by lucasgadams 3 months ago - 1 comment
Labels: enhancement

#1337 - [Bug]: Existing text is completely replaced with other characters

Issue - State: open - Opened by david-sledge 3 months ago - 3 comments
Labels: third party issue

#1336 - [Bug]: ocrmypdf (16.3.1) and Tesseract 5.4.1

Issue - State: closed - Opened by Johnnie390 3 months ago - 3 comments
Labels: bug, triage

#1335 - [Bug]: `lots of diacritics - possibly poor OCR` but using standalone tesseract works perfectly

Issue - State: closed - Opened by KAGEYAM4 3 months ago - 1 comment
Labels: bug, triage

#1334 - [Bug]: No errors and no output for large DPI files

Issue - State: closed - Opened by dan-ryan 3 months ago - 2 comments
Labels: bug, triage

#1332 - [Bug]: MetadataProgress does not respect progress_bar=False argument

Issue - State: closed - Opened by DavidMChan 4 months ago
Labels: bug, triage

#1331 - [Bug]: Paperless-ngx Release 2.9.0 Ghostscript rasterizing failed

Issue - State: closed - Opened by Johnnie390 4 months ago - 1 comment
Labels: bug

#1329 - [Bug]: ocrmypdf 16.3.1 fails on a file on Arch that 13.4.0 on Ubuntu handles well

Issue - State: closed - Opened by Fifis 4 months ago - 1 comment
Labels: bug

#1328 - [Bug]: crashes with tesseract 5.4.0

Issue - State: closed - Opened by mplx 4 months ago - 8 comments
Labels: bug

#1327 - Update docker.rst

Pull Request - State: closed - Opened by omidraha 4 months ago

#1326 - Incorrect behavior of text color setting in hocrtransform

Issue - State: closed - Opened by ep0p 4 months ago - 2 comments

#1325 - [Bug]: --tesseract-pagesegmode is not sufficiently documented

Issue - State: closed - Opened by thomas2net 4 months ago - 1 comment
Labels: bug

#1323 - [Bug]: OCR not complete. Parts of all pages are ignored

Issue - State: closed - Opened by 0lm 4 months ago - 1 comment
Labels: bug

#1322 - [Bug]: multiple spaces not supported for delimitation of bbox parameters

Issue - State: closed - Opened by Tehgg 4 months ago - 1 comment
Labels: bug

#1321 - [Bug]: Flood of "Recursion depth exceeded in _find_image_xrefs_page"

Issue - State: closed - Opened by user1584 4 months ago - 5 comments
Labels: bug

#1318 - [Bug]:

Issue - State: closed - Opened by Firestar-Reimu 4 months ago - 4 comments
Labels: bug

#1317 - Pushed docker image is always Ubuntu instead of alpine

Issue - State: closed - Opened by vihtap 4 months ago - 1 comment

#1316 - [Bug]: test_semfree fails with ghostscript 10.03.0+

Issue - State: closed - Opened by gringus 4 months ago
Labels: bug

#1315 - [Bug]: NotImplementedError: not sure how to get colorspace

Issue - State: open - Opened by macdeport 4 months ago - 2 comments
Labels: bug

#1314 - [Feature]: If page has text, force OCR and rasterize page

Issue - State: open - Opened by mikejokic 4 months ago - 1 comment
Labels: enhancement

#1313 - Show progress during postprocessing

Issue - State: open - Opened by user1823 5 months ago - 5 comments
Labels: enhancement

#1312 - [Bug]: Crash on multiple .pdf files

Issue - State: closed - Opened by olafure 5 months ago - 5 comments
Labels: bug

#1311 - Indian Numbers on Arabic text

Issue - State: open - Opened by MedoHamdani 5 months ago

#1309 - Make usage of --rotate-pages-threshold clearer

Issue - State: closed - Opened by stegl83 5 months ago

#1308 - [Bug]: cannot import name 'PDFTextSeq' from 'pdfminer.pdfdevice'

Issue - State: closed - Opened by user1823 5 months ago - 3 comments
Labels: bug

#1307 - [Bug]: No longer works - macos-11.7 x86_64 Python 3.10

Issue - State: closed - Opened by atanasj 5 months ago - 11 comments
Labels: bug, user config

#1306 - [Bug]: File size increased

Issue - State: closed - Opened by user1823 5 months ago - 7 comments
Labels: bug

#1304 - [Bug]: conda installation

Issue - State: closed - Opened by kevinkaw 5 months ago - 2 comments
Labels: bug

#1303 - [Bug]: ValueError: ObjectList must have 6 elements

Issue - State: closed - Opened by macdeport 5 months ago - 3 comments
Labels: bug

#1302 - not user friendly

Issue - State: closed - Opened by abood-az 5 months ago - 1 comment
Labels: bug

#1301 - [Feature]: JPEG XL support

Issue - State: closed - Opened by Lyapsus 5 months ago - 3 comments
Labels: enhancement

#1300 - Fix wrong env var for GS path in Snap

Pull Request - State: closed - Opened by helkaluin 5 months ago - 1 comment

#1299 - [Feature]: Change demo format to VHS

Issue - State: open - Opened by jbarlow83 5 months ago
Labels: enhancement

#1296 - Adding language install docs for archlinux

Pull Request - State: closed - Opened by ahmedsbytes 5 months ago

#1295 - Release notes don't include the latest versions

Issue - State: closed - Opened by user1823 5 months ago - 1 comment

#1293 - [Bug]: Warning: "xref 473: While extracting this image, an error occurred"

Issue - State: closed - Opened by macdeport 6 months ago - 1 comment
Labels: bug

#1290 - [Bug]: Memory Error

Issue - State: open - Opened by user1823 6 months ago
Labels: bug

#1289 - [Bug]: DecompressionBombWarning

Issue - State: closed - Opened by user1823 6 months ago - 1 comment
Labels: bug

#1287 - Update the typer[all] dependency to typer-slim[standard]

Pull Request - State: closed - Opened by musicinmybrain 6 months ago - 2 comments

#1286 - added Macports install information

Pull Request - State: closed - Opened by akierig 6 months ago

#1283 - max_workers must be greater than 0

Issue - State: closed - Opened by nope999 6 months ago - 2 comments
Labels: need test file

#1282 - [Feature]: Choose between NFKC and NFC normalization for Unicode characters so copy-pasting works

Issue - State: open - Opened by sfllaw 6 months ago - 5 comments
Labels: enhancement

#1281 - [Bug] SubprocessOutputError

Issue - State: closed - Opened by user1823 6 months ago - 4 comments
Labels: bug

#1279 - Allow resuming OCR after DecompressionBombError

Issue - State: closed - Opened by user1823 7 months ago - 3 comments
Labels: enhancement

#1278 - [Bug]: The file size increases significantly by OCR even without image recompression

Issue - State: open - Opened by ybeltukov 7 months ago - 2 comments
Labels: bug

#1277 - batch example: added archive, small corrections and optimizations

Pull Request - State: closed - Opened by NilsRo 7 months ago - 1 comment