Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / ocrmypdf/OCRmyPDF issues and pull requests

#1159 - [Bug]: MissingDependencyError: tesseract on Heroku despite setting environment variables

Issue - State: closed - Opened by troublesprouter about 1 year ago - 1 comment
Labels: user config

#1158 - [Feature]: Manually correcting OCR errors

Issue - State: closed - Opened by tslivnik about 1 year ago - 4 comments
Labels: enhancement

#1157 - OCR-Generated Text Layers Not Readable by PDF Readers for RTL Languages Like Persian

Issue - State: open - Opened by PSEUDO-SAPPHO about 1 year ago - 8 comments
Labels: third party issue

#1156 - [Bug]: Always get "FileNoFoundError on input fiel

Issue - State: closed - Opened by drnicolas about 1 year ago - 5 comments
Labels: user config

#1155 - [Feature]: Add parameter to ignore "Invalid rotation" errors from img2pdf

Issue - State: closed - Opened by iohann95 about 1 year ago - 2 comments
Labels: enhancement

#1154 - [Bug]: --remove-background it is showing an error

Issue - State: closed - Opened by wellingto198 about 1 year ago - 1 comment
Labels: bug

#1153 - Bump docker/setup-buildx-action from 2 to 3

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago - 1 comment
Labels: dependencies

#1152 - Bump docker/login-action from 2 to 3

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago - 1 comment
Labels: dependencies

#1151 - Bump docker/setup-qemu-action from 2 to 3

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago - 1 comment
Labels: dependencies

#1150 - [Bug]: JBIG2 corruption of scanned pages & some pages overwriting other pages

Issue - State: closed - Opened by gwern about 1 year ago - 3 comments
Labels: bug

#1149 - Bump actions/checkout from 3 to 4

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago - 1 comment
Labels: dependencies

#1148 - [Feature]: language translation

Issue - State: closed - Opened by vdun about 1 year ago - 1 comment
Labels: enhancement

#1147 - [Bug]: AttributeError: '_idat' object has no attribute 'fileno'

Issue - State: closed - Opened by 875d about 1 year ago - 3 comments
Labels: bug

#1146 - Change skip-ocr to skip-text for fish completion

Pull Request - State: closed - Opened by ss8931 about 1 year ago

#1145 - [Bug]: JBIG2 - 2 colors

Issue - State: closed - Opened by zvezdochiot about 1 year ago - 4 comments
Labels: bug

#1143 - [Feature]: Make Ghostscript Colour Conversion Configurable

Issue - State: closed - Opened by marcules about 1 year ago - 1 comment
Labels: enhancement

#1141 - [Bug]: No error, but 0-byte PDF produced

Issue - State: closed - Opened by matt-cassinelli about 1 year ago - 6 comments
Labels: user config

#1139 - [Bug]: SubprocessOutputError when scanning a specific PDF

Issue - State: closed - Opened by AgustinOrdonez about 1 year ago - 11 comments
Labels: bug, need test file

#1138 - [Bug]: `--jpeg-quality` does nothing useful and is extremely confusing

Issue - State: open - Opened by Atemu about 1 year ago - 3 comments
Labels: bug, need test file

#1137 - Complete train wreck of a PDF, trying to OCR rotated.

Issue - State: open - Opened by pinballelectronica about 1 year ago - 1 comment
Labels: bug

#1136 - [Bug]: Docker build fails

Issue - State: closed - Opened by zaphoodb about 1 year ago - 1 comment
Labels: bug

#1135 - Remove Unused Dependency: Deprecation

Pull Request - State: closed - Opened by gdrosos about 1 year ago - 2 comments

#1134 - Add installation instructions for Gentoo Linux to README.md

Pull Request - State: closed - Opened by fonic about 1 year ago - 2 comments

#1133 - [Bug]: Is jbig2 encoder being called?

Issue - State: closed - Opened by gtusr about 1 year ago - 9 comments
Labels: bug

#1132 - Enables creation of a release and uploading the build assets to it

Pull Request - State: closed - Opened by stumpylog about 1 year ago - 2 comments

#1131 - OCRmyPDF appends a space to each text element at the end of the line

Issue - State: closed - Opened by gowallasnewpony about 1 year ago - 6 comments
Labels: bug

#1129 - [Feature]: Add Gentoo Linux to section 'Installation' of README.md

Issue - State: closed - Opened by fonic about 1 year ago - 2 comments
Labels: enhancement

#1128 - [Feature]: Distribute on Scoop?

Issue - State: closed - Opened by ShadowCreator250 about 1 year ago - 1 comment
Labels: enhancement

#1127 - [Feature]: Switch to remove images?

Issue - State: closed - Opened by pinballelectronica about 1 year ago - 2 comments
Labels: enhancement

#1126 - [Feature]: Remove images of text recognized

Issue - State: closed - Opened by Kimi-Arthur about 1 year ago - 1 comment
Labels: enhancement

#1125 - [Feature]: test

Issue - State: closed - Opened by jbarlow83 about 1 year ago
Labels: enhancement

#1123 - does ocrmypdf create an invisible text layer?

Issue - State: closed - Opened by lbr991 about 1 year ago - 11 comments

#1122 - Confused about --unpaper-args

Issue - State: closed - Opened by al1coch over 1 year ago - 4 comments
Labels: bug

#1121 - [Feature]: Parameter to automatically remove blank pages

Issue - State: closed - Opened by GrabbenD over 1 year ago - 2 comments

#1120 - orcmypdf not working in HTML/browser

Issue - State: closed - Opened by Prabal1902 over 1 year ago - 4 comments
Labels: bug

#1119 - [Bug]: Can not transfer image into editable text in pdf

Issue - State: closed - Opened by ericosmic over 1 year ago - 1 comment
Labels: bug

#1118 - [Bug]: PDF/A-3B files generated with a widely used commercial encoder generate garbage OCR content

Issue - State: closed - Opened by jce-zz over 1 year ago - 20 comments
Labels: bug

#1117 - Allow title, subject, author, and keywords to be unset with an empty string argument

Pull Request - State: closed - Opened by f-hansen over 1 year ago - 1 comment

#1116 - [Bug]: Problem when OCR heavy PDFs - freezes at 0%

Issue - State: closed - Opened by dariofilipe over 1 year ago - 2 comments
Labels: bug

#1115 - Problem when OCR heavy PDFs - freezes at 0%

Issue - State: closed - Opened by dariofilipe over 1 year ago - 1 comment

#1114 - do OCR if text boxs of minimum 15

Pull Request - State: closed - Opened by pkrsreddy over 1 year ago - 2 comments

#1113 - Fix randomly ordered languages from set()

Pull Request - State: closed - Opened by abwiersma over 1 year ago

#1112 - [Bug]: Inconsistent language order in tesseract calls

Issue - State: closed - Opened by abwiersma over 1 year ago - 2 comments
Labels: bug

#1111 - [Feature]: just curious/wondering about Tesseract 5 support

Issue - State: closed - Opened by alejohern over 1 year ago - 1 comment
Labels: enhancement

#1110 - [Feature]: OCR on pages with multiple text rotations

Issue - State: open - Opened by matthuszagh over 1 year ago - 2 comments
Labels: enhancement

#1107 - Would be nice to be able to choose the temporary directory

Issue - State: closed - Opened by al1coch over 1 year ago - 4 comments

#1106 - Support for PDF-A/4

Issue - State: open - Opened by rafaelfcmaria over 1 year ago - 1 comment
Labels: enhancement, third party issue

#1105 - OCRmyPDF not rotating the file correctly using the version 14.2.1

Issue - State: closed - Opened by gilsonbergamine over 1 year ago - 1 comment

#1104 - [BUG] 'DecompressionBombError' on a ACM PDF - need resolution limit on high DPI

Issue - State: closed - Opened by gwern over 1 year ago - 7 comments

#1103 - [BUG] Bold font in PDF is replaced by black bars

Issue - State: closed - Opened by tobox over 1 year ago - 2 comments

#1102 - [BUG] ghostscript fails due to small resolution value

Issue - State: open - Opened by neurolabs over 1 year ago - 3 comments

#1101 - How to get the deskew angle

Issue - State: closed - Opened by GoN49 over 1 year ago - 1 comment

#1100 - Replace text from original PDF with OCR'd Text

Issue - State: closed - Opened by FrancisBaileyH over 1 year ago - 2 comments

#1098 - Remove image layer after OCR?

Issue - State: closed - Opened by Frooodle over 1 year ago - 2 comments

#1097 - WSL support

Issue - State: closed - Opened by pinballelectronica over 1 year ago - 3 comments

#1095 - [BUG] deletes most of a page

Issue - State: closed - Opened by gwern over 1 year ago - 3 comments

#1094 - Feature Request: Provide for downloading of language models

Issue - State: closed - Opened by simsong over 1 year ago - 1 comment

#1093 - Feature Request: Provide for usage with cloud-based OCR engines

Issue - State: closed - Opened by simsong over 1 year ago - 4 comments

#1092 - How to handle already ocred files efficiently?

Issue - State: closed - Opened by drnicolas over 1 year ago - 1 comment

#1091 - [HELP] Inconsistent Reading order

Issue - State: closed - Opened by emtee14 over 1 year ago - 2 comments

#1090 - Snap package shouldn't ship all of the Tesseract OCR language files

Issue - State: open - Opened by brlin-tw over 1 year ago - 1 comment
Labels: help wanted

#1089 - Fix snap package building (#1082)

Pull Request - State: closed - Opened by brlin-tw over 1 year ago - 3 comments

#1088 - [BUG] #addopts = pytest -n "auto" no option?

Issue - State: closed - Opened by shaynababe over 1 year ago - 2 comments

#1087 - Fix typos

Pull Request - State: closed - Opened by kianmeng over 1 year ago - 1 comment

#1086 - Only generate text files without generating PDF files

Issue - State: open - Opened by rodrigomorales1 over 1 year ago - 11 comments

#1085 - Use Github Releases for notifications

Issue - State: closed - Opened by fabiante over 1 year ago - 2 comments

#1084 - ocrmypdf generating white patch in output pdf?

Issue - State: closed - Opened by gogineniravikumar over 1 year ago - 1 comment

#1083 - Improve PDF rasterisation safety

Pull Request - State: closed - Opened by sihil over 1 year ago - 1 comment

#1082 - [BUG] Snap Package not Working

Issue - State: closed - Opened by lhhel9l3 over 1 year ago - 6 comments

#1081 - Correct way to deskew PDF already processed by OCRmyPDF?

Issue - State: open - Opened by pimlottc over 1 year ago - 7 comments

#1080 - PDFs not created with fast web view

Issue - State: open - Opened by dklinger over 1 year ago - 1 comment

#1078 - [BUG] pikepdf warning about missing decoders

Issue - State: closed - Opened by ajweber over 1 year ago - 3 comments

#1077 - JBIG2 not legally secure in many countries

Issue - State: closed - Opened by dklinger over 1 year ago - 2 comments

#1076 - [BUG] PIL.Image.DecompressionBombError

Issue - State: closed - Opened by JohnLockeG over 1 year ago - 1 comment

#1075 - [BUG] crashes with `TypeError: 'NoneType' object is not subscriptable`

Issue - State: closed - Opened by frrad over 1 year ago - 1 comment

#1074 - [BUG] cannot ocr the numbers on left side of page

Issue - State: closed - Opened by sushmitxo over 1 year ago - 1 comment

#1073 - Optimize images with SMask

Issue - State: open - Opened by benbro over 1 year ago - 3 comments

#1072 - Use paddleocr instead of tesseract

Issue - State: closed - Opened by aymenmtibaa over 1 year ago - 1 comment

#1071 - Feature Request: GPU OCR pipeline e.g. via EasyOCR

Issue - State: closed - Opened by systemofapwne over 1 year ago - 4 comments
Labels: enhancement

#1070 - [BUG] Wrong optimize ratio and savings

Issue - State: closed - Opened by homocomputeris over 1 year ago - 6 comments

#1069 - [BUG] Possible to force OCR without losing vector data?

Issue - State: closed - Opened by moksamedia over 1 year ago - 2 comments

#1068 - Avoid deleting /dev/null when run as root

Pull Request - State: closed - Opened by jbarlow83 over 1 year ago

#1067 - [BUG] /dev/null gets deleted when run as root (inside a Docker container)

Issue - State: closed - Opened by andymwood over 1 year ago - 1 comment

#1066 - handle case when candidate is None

Pull Request - State: closed - Opened by frrad over 1 year ago - 1 comment

#1065 - [QUESTION] Render hocr with python

Issue - State: closed - Opened by jcuenod over 1 year ago - 2 comments

#1064 - tesseract-osd is also required on fedora

Pull Request - State: closed - Opened by white-gecko over 1 year ago

#1063 - added setting RETRIES_LOADING_FILE to watcher.py

Pull Request - State: closed - Opened by comzine over 1 year ago

#1062 - [BUG] tesseract returns SIGFPE Signal

Issue - State: closed - Opened by C0D3D3V over 1 year ago - 4 comments

#1060 - Error processing shell script on file

Issue - State: closed - Opened by danilichti over 1 year ago - 2 comments

#1059 - Allow title, subject, author, and keywords to be unset with an empty string argument

Pull Request - State: closed - Opened by f-hansen over 1 year ago - 3 comments