Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / ocrmypdf/OCRmyPDF issues and pull requests
#1275 - Fix Broken Documentation Links
Pull Request -
State: closed - Opened by danloveg 7 months ago
#1274 - Recommended settings for dealing with text superimposed on clipart?
Issue -
State: closed - Opened by MBYlt 7 months ago
- 1 comment
#1272 - [Bug]: Missing support for certain unicode characters
Issue -
State: open - Opened by vera-bernhard 7 months ago
- 3 comments
Labels: bug
#1271 - [Bug]: AttributeError: 'NoneType' object has no attribute 'get'
Issue -
State: closed - Opened by nikitar 7 months ago
Labels: bug
#1269 - [Bug]: "Corrupt JPEG data: premature end of data segment" with some files
Issue -
State: closed - Opened by macdeport 7 months ago
- 3 comments
Labels: user config
#1268 - Update Dockerfile.alpine
Pull Request -
State: closed - Opened by emielmolenaar 7 months ago
#1267 - [Bug]: Ghostscript PDF/A rendering failed
Issue -
State: closed - Opened by davide125 7 months ago
- 1 comment
Labels: bug
#1264 - [Bug]: dpi-problem with rasterizing text
Issue -
State: closed - Opened by JoKalliauer 7 months ago
- 5 comments
Labels: bug
#1263 - [Bug]: OCRmyPDF Docker Hot Folder Option OCR_ON_SUCCESS_ARCHIVE OCR_ON_SUCCESS_DELETE doesnt work
Issue -
State: open - Opened by mazi19 7 months ago
Labels: bug
#1262 - Error: jbig2 not found on path, even though installed
Issue -
State: closed - Opened by anaxonda 7 months ago
- 4 comments
Labels: user config, third party issue
#1261 - [Bug]: OCRmyPDF succeeded with warning(s): InputFileError: pdfminer could not process page 0
Issue -
State: closed - Opened by Markoise 7 months ago
- 1 comment
Labels: invalid, need test file, third party issue
#1260 - Fix entrypoint for docker commands
Pull Request -
State: closed - Opened by SirRegion 7 months ago
- 1 comment
#1259 - [Bug]: version confusion
Issue -
State: closed - Opened by branko623 7 months ago
- 1 comment
Labels: bug
#1258 - [Bug]: Watcher doesnt notice changes after update
Issue -
State: closed - Opened by Major2828 7 months ago
Labels: bug
#1257 - Handle PermissionError when finding tools
Pull Request -
State: closed - Opened by grembo 7 months ago
- 4 comments
#1256 - Trying to debug OCR_ON_SUCCESS_DELETE flag not being executed - add exit code to watcher.py?
Issue -
State: closed - Opened by wabarkley 7 months ago
- 2 comments
Labels: bug
#1255 - PDF-A produces lossy result
Issue -
State: closed - Opened by YutMarma 7 months ago
- 5 comments
#1253 - [Feature]: Support RapidOCR engine
Issue -
State: closed - Opened by saccohuo 7 months ago
- 1 comment
Labels: enhancement
#1252 - [Feature]: sidecar Support Text Output to io.StringIO()
Issue -
State: closed - Opened by MAbdElRaouf 8 months ago
Labels: enhancement
#1251 - [Bug]: OCRmyPDF not adding any text to document v 1.4
Issue -
State: closed - Opened by maxi07 8 months ago
- 1 comment
Labels: bug
#1250 - [Feature]: Integrations with other backends via hOcr (naive implementation of easyOcr backend inside)
Issue -
State: open - Opened by coffepowered 8 months ago
- 4 comments
Labels: enhancement
#1249 - [Documentation]: Upgrade via pip after system install needs a different command
Issue -
State: closed - Opened by dajare 8 months ago
- 1 comment
Labels: enhancement
#1248 - Update README.md
Pull Request -
State: closed - Opened by rudolphos 8 months ago
- 1 comment
#1247 - Bump codecov/codecov-action from 3 to 4
Pull Request -
State: closed - Opened by dependabot[bot] 8 months ago
Labels: dependencies
#1246 - [Feature]: convert grayscale PDF to jbig monochrome while doing OCR
Issue -
State: closed - Opened by callegar 8 months ago
- 1 comment
Labels: enhancement
#1245 - [Bug]: installation failed due to ghostcript in-compatible version and can not upgraded ghostscript in Ubuntu 20.04
Issue -
State: closed - Opened by rohan-paul 8 months ago
- 1 comment
Labels: bug
#1244 - [Bug]: OCR on .pdf isn't the same as tesseract but the format is correct on .txt file
Issue -
State: open - Opened by matsumurae 8 months ago
Labels: bug
#1243 - [Feature]: Add support for docTR as alternate OCR backend?
Issue -
State: open - Opened by victorhooi 8 months ago
Labels: enhancement
#1241 - [Bug]: Unknown tesseract error, returns non-zero
Issue -
State: closed - Opened by nepomuc 8 months ago
- 1 comment
Labels: bug
#1240 - [Bug]: Memory access error if using a German terminal
Issue -
State: closed - Opened by Pete1976 8 months ago
- 2 comments
Labels: bug
#1237 - Doc suggestion: also great for just removing the text layer!
Issue -
State: closed - Opened by hmijail 8 months ago
- 1 comment
Labels: enhancement
#1236 - [Feature]: More Accessible Via Consistently connecting words to form sentences.
Issue -
State: closed - Opened by PiggiesGoSqueal 8 months ago
- 2 comments
Labels: enhancement
#1235 - [Feature]: Explain on the docs how to change the language of OCR on watcher.py
Issue -
State: closed - Opened by iohann95 8 months ago
- 1 comment
Labels: enhancement
#1232 - [Bug]: Conda - pikepdf is unavailable
Issue -
State: closed - Opened by kielbowicz 9 months ago
- 1 comment
Labels: bug
#1232 - [Bug]: Conda - pikepdf is unavailable
Issue -
State: closed - Opened by kielbowicz 9 months ago
- 1 comment
Labels: bug
#1231 - [Bug]: 'File not found' error in latest versions
Issue -
State: closed - Opened by templeman 9 months ago
- 4 comments
Labels: bug
#1231 - [Bug]: 'File not found' error in latest versions
Issue -
State: closed - Opened by templeman 9 months ago
- 3 comments
Labels: bug
#1230 - Add autotools automake libtool and leptonica requirements
Pull Request -
State: closed - Opened by maxi07 9 months ago
- 1 comment
#1230 - Add autotools automake libtool and leptonica requirements
Pull Request -
State: closed - Opened by maxi07 9 months ago
- 1 comment
#1229 - Minor english correction in Docs
Pull Request -
State: closed - Opened by Sapkotaanish 9 months ago
#1228 - Update gs dependency & instructions for RHEL
Pull Request -
State: closed - Opened by nisbet-hubbard 9 months ago
#1227 - [Bug]: Bunch of incomprehensible OCR content to delete
Issue -
State: closed - Opened by nicolas-75 9 months ago
- 3 comments
Labels: bug
#1226 - [Feature]: Only optimise file, skip OCR completely
Issue -
State: closed - Opened by Atrate 9 months ago
- 2 comments
Labels: enhancement
#1225 - [Bug]: RHEL 9 requires ghostscript 9.54 to work
Issue -
State: closed - Opened by nisbet-hubbard 9 months ago
- 6 comments
Labels: bug
#1223 - [Bug]: PDF graphics stack overflowed spec limit
Issue -
State: closed - Opened by Gedankenleser 9 months ago
- 1 comment
Labels: bug
#1222 - fixed a spelling mistake
Pull Request -
State: closed - Opened by Anthony-Nabil 9 months ago
#1221 - Thank you!
Issue -
State: open - Opened by zWhdmB5T 9 months ago
Labels: enhancement
#1220 - [Bug]: OCRmyPDF does not preserve existing XMP metadata
Issue -
State: open - Opened by jkorinth 9 months ago
- 3 comments
Labels: bug
#1219 - [Bug]: Package 'pngquant' not found, exists on PATH
Issue -
State: closed - Opened by dumoulinalex 9 months ago
- 1 comment
Labels: bug
#1218 - [Feature]: Are tesseract scripts supported?
Issue -
State: closed - Opened by eightfiftytwo 9 months ago
- 1 comment
Labels: enhancement
#1217 - allow resolution over ride that might improve text recognition etc
Pull Request -
State: closed - Opened by john-peterson 9 months ago
- 1 comment
#1216 - [Bug]: Every PDF I OCR has the text misaligned with the image
Issue -
State: closed - Opened by advert665 9 months ago
- 7 comments
Labels: bug
#1215 - [Bug]: Every PDF I OCR has the text misaligned with the image.
Issue -
State: closed - Opened by advert665 9 months ago
- 4 comments
Labels: bug
#1214 - [Bug]: Persian rendering and text positioning errors in 16.0.1 with new renderer
Issue -
State: closed - Opened by Rosti2022 9 months ago
- 1 comment
Labels: bug
#1213 - [Bug]: OCRmyPDF does not preserve existing XMP metadata
Issue -
State: closed - Opened by jkorinth 9 months ago
- 1 comment
Labels: bug
#1212 - [Bug]: OCR gets stuck
Issue -
State: closed - Opened by philmas 9 months ago
- 4 comments
Labels: bug, need test file
#1211 - [Bug]: NotImplementedError
Issue -
State: closed - Opened by gdandersson 9 months ago
- 10 comments
Labels: bug
#1210 - I scan the documents with my Brother MFC-L8690CDW and it worked until v15.4.4
Issue -
State: closed - Opened by knabed 9 months ago
- 1 comment
#1209 - [Bug]: complete letter salad
Issue -
State: closed - Opened by knabed 9 months ago
- 2 comments
Labels: need test file
#1208 - Bump actions/upload-artifact from 3 to 4
Pull Request -
State: closed - Opened by dependabot[bot] 10 months ago
- 2 comments
Labels: dependencies
#1207 - Bump actions/download-artifact from 3 to 4
Pull Request -
State: closed - Opened by dependabot[bot] 10 months ago
- 2 comments
Labels: dependencies
#1206 - Fix performance advice to match --fast-web-view documentation
Pull Request -
State: closed - Opened by Androbin 10 months ago
- 1 comment
#1205 - Bump actions/setup-python from 4 to 5
Pull Request -
State: closed - Opened by dependabot[bot] 10 months ago
- 2 comments
Labels: dependencies
#1204 - [Bug]: Accented characters not correct in PDF/A output
Issue -
State: closed - Opened by stumpylog 10 months ago
- 3 comments
Labels: bug
#1203 - [Feature]: Not to save images and opaque text
Issue -
State: closed - Opened by edwvee 10 months ago
- 1 comment
Labels: enhancement
#1201 - [Bug]: ocrmypdf invoked oom-killer
Issue -
State: closed - Opened by munzirtaha 10 months ago
- 4 comments
Labels: bug
#1200 - [Feature]: More details for exception ColorConversionNeededError
Issue -
State: closed - Opened by noseshimself 10 months ago
- 8 comments
Labels: enhancement
#1199 - [Bug]: Progress Bar is missing when running in Google Colab
Issue -
State: closed - Opened by Warborn123 10 months ago
- 1 comment
Labels: bug, help wanted
#1198 - [Bug]: Unable to install GhostScript using Winget
Issue -
State: closed - Opened by xd003 10 months ago
- 1 comment
Labels: bug
#1197 - [Bug]: No module named 'lxml'
Issue -
State: closed - Opened by tcurdt 10 months ago
- 1 comment
Labels: bug
#1196 - [Bug]: ImportError: cannot import name 'PDFTextSeq' from 'pdfminer.pdfdevice'
Issue -
State: closed - Opened by MimoGraphix 10 months ago
- 1 comment
Labels: bug
#1195 - [Bug]: incorrect file mode
Issue -
State: closed - Opened by svenha 10 months ago
- 3 comments
Labels: bug
#1193 - [Bug]: InputFileError
Issue -
State: closed - Opened by JoKalliauer 10 months ago
- 4 comments
Labels: bug
#1191 - [Bug]: sandwich renders differently than hocr
Issue -
State: closed - Opened by femifrak 11 months ago
- 2 comments
Labels: bug
#1190 - [Bug]: Centos7 Install OCRmyPdf run command failed [Help]
Issue -
State: closed - Opened by huxinghai 11 months ago
- 4 comments
Labels: bug
#1189 - [Bug]: User warning: missing specialized decoders (probably JBIG2)
Issue -
State: closed - Opened by femifrak 11 months ago
- 6 comments
Labels: bug
#1188 - [Feature]: enable use of Ghostscript glyph-level Unicode map generation
Issue -
State: open - Opened by jbarlow83 11 months ago
Labels: enhancement
#1187 - [Bug]: "Adobe Acrobat Reader" isn't able to open outputfile any more
Issue -
State: closed - Opened by JoKalliauer 11 months ago
- 5 comments
Labels: bug
#1186 - [Feature]: Support Azure Recognition Service
Issue -
State: open - Opened by hcoona 11 months ago
Labels: enhancement
#1185 - [Bug]: OCR_JSON_SETTINGS does not accept JSON string anymore
Issue -
State: closed - Opened by JohnDoe2991 11 months ago
Labels: bug
#1184 - [Bug]: [WinError 2] The system cannot find the file specified
Issue -
State: closed - Opened by MvCast 11 months ago
- 1 comment
Labels: bug
#1183 - [Bug]: watcher.py: execute_ocrmypdf() takes 0 positional arguments but 1 positional argument were given
Issue -
State: closed - Opened by Major2828 11 months ago
- 1 comment
Labels: bug
#1182 - Does OCRmyPDF break macOS "Live Text" feature?
Issue -
State: closed - Opened by rennefJ 11 months ago
- 2 comments
#1181 - TrimBox and CropBox not retained when "force OCR" is used
Issue -
State: open - Opened by jbarlow83 11 months ago
- 3 comments
Labels: bug
#1180 - [Question]: How to pass --skip-text to watcher.py in docker container?
Issue -
State: closed - Opened by dolorosus 11 months ago
- 2 comments
Labels: bug
#1179 - [Bug]: "inplace" + --skip-text on PDF with only text modifies / outputs a file
Issue -
State: closed - Opened by jrz 11 months ago
- 4 comments
Labels: bug
#1177 - Is building OCRmyPDF and it's dependencies in a docker environment less performant?
Issue -
State: closed - Opened by wadeflash12 11 months ago
- 1 comment
Labels: bug
#1176 - [Bug]: --version gives 0.0.0 on Ubuntu snap
Issue -
State: open - Opened by pseudomonas 11 months ago
- 5 comments
Labels: bug, snapcraft
#1175 - How can I pass tesseract argument " -c preserve_interword_spaces 1" ?
Issue -
State: closed - Opened by languagemaniac 11 months ago
- 4 comments
Labels: bug, need test file
#1174 - [Bug]: Ghostscript process fails when running OCRmyPDF on all PDFs
Issue -
State: closed - Opened by eoinosullivan 11 months ago
- 5 comments
Labels: bug
#1173 - Correct the archive dir name in `Watched folders with Docker`
Pull Request -
State: closed - Opened by mflagg2814 12 months ago
#1172 - Handle OSError exceptions in watcher.py
Pull Request -
State: closed - Opened by mflagg2814 12 months ago
- 1 comment
#1169 - [Bug]: PDF expands to 1.5G from 14M
Issue -
State: closed - Opened by alephpiece 12 months ago
- 2 comments
Labels: bug
#1168 - [Bug] [Help Needed] Command "ocrmypdf" not found [Windows 11]
Issue -
State: closed - Opened by Alaiya 12 months ago
- 2 comments
Labels: user config
#1167 - [Bug]: ocrmypdf v15.1.0+git8.2b0e1498 (snap): GPL Ghostscript 9.55.0: Can't find initialization file gs_init.ps. ghostscript.py:118
Issue -
State: closed - Opened by zWhdmB5T 12 months ago
- 19 comments
Labels: bug, help wanted, third party issue, snapcraft
#1166 - How can I remove extra space between every characters
Issue -
State: closed - Opened by hhiyorimi 12 months ago
- 1 comment
Labels: enhancement
#1164 - [Bug]: Some PDFs are blank in macOS Safari and Preview
Issue -
State: open - Opened by rezafouladian 12 months ago
- 2 comments
Labels: bug, third party issue
#1163 - Delete or prune StructTreeRoot for --force-ocr/--redo-ocr/--skip-text and post warnings
Issue -
State: open - Opened by jbarlow83 12 months ago
#1162 - [Bug]: Error: /typecheck in --runpdf--
Issue -
State: closed - Opened by muramasatheninja 12 months ago
- 3 comments
Labels: third party issue
#1161 - Add custom deskew and page rotation logic before OCR
Issue -
State: open - Opened by wadeflash12 12 months ago
- 8 comments