Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / ocrmypdf/OCRmyPDF issues and pull requests

#1275 - Fix Broken Documentation Links

Pull Request - State: closed - Opened by danloveg 7 months ago

#1274 - Recommended settings for dealing with text superimposed on clipart?

Issue - State: closed - Opened by MBYlt 7 months ago - 1 comment

#1272 - [Bug]: Missing support for certain unicode characters

Issue - State: open - Opened by vera-bernhard 7 months ago - 3 comments
Labels: bug

#1271 - [Bug]: AttributeError: 'NoneType' object has no attribute 'get'

Issue - State: closed - Opened by nikitar 7 months ago
Labels: bug

#1269 - [Bug]: "Corrupt JPEG data: premature end of data segment" with some files

Issue - State: closed - Opened by macdeport 7 months ago - 3 comments
Labels: user config

#1268 - Update Dockerfile.alpine

Pull Request - State: closed - Opened by emielmolenaar 7 months ago

#1267 - [Bug]: Ghostscript PDF/A rendering failed

Issue - State: closed - Opened by davide125 7 months ago - 1 comment
Labels: bug

#1264 - [Bug]: dpi-problem with rasterizing text

Issue - State: closed - Opened by JoKalliauer 7 months ago - 5 comments
Labels: bug

#1262 - Error: jbig2 not found on path, even though installed

Issue - State: closed - Opened by anaxonda 7 months ago - 4 comments
Labels: user config, third party issue

#1261 - [Bug]: OCRmyPDF succeeded with warning(s): InputFileError: pdfminer could not process page 0

Issue - State: closed - Opened by Markoise 7 months ago - 1 comment
Labels: invalid, need test file, third party issue

#1260 - Fix entrypoint for docker commands

Pull Request - State: closed - Opened by SirRegion 7 months ago - 1 comment

#1259 - [Bug]: version confusion

Issue - State: closed - Opened by branko623 7 months ago - 1 comment
Labels: bug

#1258 - [Bug]: Watcher doesnt notice changes after update

Issue - State: closed - Opened by Major2828 7 months ago
Labels: bug

#1257 - Handle PermissionError when finding tools

Pull Request - State: closed - Opened by grembo 7 months ago - 4 comments

#1255 - PDF-A produces lossy result

Issue - State: closed - Opened by YutMarma 7 months ago - 5 comments

#1253 - [Feature]: Support RapidOCR engine

Issue - State: closed - Opened by saccohuo 7 months ago - 1 comment
Labels: enhancement

#1252 - [Feature]: sidecar Support Text Output to io.StringIO()

Issue - State: closed - Opened by MAbdElRaouf 8 months ago
Labels: enhancement

#1251 - [Bug]: OCRmyPDF not adding any text to document v 1.4

Issue - State: closed - Opened by maxi07 8 months ago - 1 comment
Labels: bug

#1249 - [Documentation]: Upgrade via pip after system install needs a different command

Issue - State: closed - Opened by dajare 8 months ago - 1 comment
Labels: enhancement

#1248 - Update README.md

Pull Request - State: closed - Opened by rudolphos 8 months ago - 1 comment

#1247 - Bump codecov/codecov-action from 3 to 4

Pull Request - State: closed - Opened by dependabot[bot] 8 months ago
Labels: dependencies

#1246 - [Feature]: convert grayscale PDF to jbig monochrome while doing OCR

Issue - State: closed - Opened by callegar 8 months ago - 1 comment
Labels: enhancement

#1243 - [Feature]: Add support for docTR as alternate OCR backend?

Issue - State: open - Opened by victorhooi 8 months ago
Labels: enhancement

#1241 - [Bug]: Unknown tesseract error, returns non-zero

Issue - State: closed - Opened by nepomuc 8 months ago - 1 comment
Labels: bug

#1240 - [Bug]: Memory access error if using a German terminal

Issue - State: closed - Opened by Pete1976 8 months ago - 2 comments
Labels: bug

#1237 - Doc suggestion: also great for just removing the text layer!

Issue - State: closed - Opened by hmijail 8 months ago - 1 comment
Labels: enhancement

#1236 - [Feature]: More Accessible Via Consistently connecting words to form sentences.

Issue - State: closed - Opened by PiggiesGoSqueal 8 months ago - 2 comments
Labels: enhancement

#1235 - [Feature]: Explain on the docs how to change the language of OCR on watcher.py

Issue - State: closed - Opened by iohann95 8 months ago - 1 comment
Labels: enhancement

#1232 - [Bug]: Conda - pikepdf is unavailable

Issue - State: closed - Opened by kielbowicz 9 months ago - 1 comment
Labels: bug

#1232 - [Bug]: Conda - pikepdf is unavailable

Issue - State: closed - Opened by kielbowicz 9 months ago - 1 comment
Labels: bug

#1231 - [Bug]: 'File not found' error in latest versions

Issue - State: closed - Opened by templeman 9 months ago - 4 comments
Labels: bug

#1231 - [Bug]: 'File not found' error in latest versions

Issue - State: closed - Opened by templeman 9 months ago - 3 comments
Labels: bug

#1230 - Add autotools automake libtool and leptonica requirements

Pull Request - State: closed - Opened by maxi07 9 months ago - 1 comment

#1230 - Add autotools automake libtool and leptonica requirements

Pull Request - State: closed - Opened by maxi07 9 months ago - 1 comment

#1229 - Minor english correction in Docs

Pull Request - State: closed - Opened by Sapkotaanish 9 months ago

#1228 - Update gs dependency & instructions for RHEL

Pull Request - State: closed - Opened by nisbet-hubbard 9 months ago

#1227 - [Bug]: Bunch of incomprehensible OCR content to delete

Issue - State: closed - Opened by nicolas-75 9 months ago - 3 comments
Labels: bug

#1226 - [Feature]: Only optimise file, skip OCR completely

Issue - State: closed - Opened by Atrate 9 months ago - 2 comments
Labels: enhancement

#1225 - [Bug]: RHEL 9 requires ghostscript 9.54 to work

Issue - State: closed - Opened by nisbet-hubbard 9 months ago - 6 comments
Labels: bug

#1223 - [Bug]: PDF graphics stack overflowed spec limit

Issue - State: closed - Opened by Gedankenleser 9 months ago - 1 comment
Labels: bug

#1222 - fixed a spelling mistake

Pull Request - State: closed - Opened by Anthony-Nabil 9 months ago

#1221 - Thank you!

Issue - State: open - Opened by zWhdmB5T 9 months ago
Labels: enhancement

#1220 - [Bug]: OCRmyPDF does not preserve existing XMP metadata

Issue - State: open - Opened by jkorinth 9 months ago - 3 comments
Labels: bug

#1219 - [Bug]: Package 'pngquant' not found, exists on PATH

Issue - State: closed - Opened by dumoulinalex 9 months ago - 1 comment
Labels: bug

#1218 - [Feature]: Are tesseract scripts supported?

Issue - State: closed - Opened by eightfiftytwo 9 months ago - 1 comment
Labels: enhancement

#1217 - allow resolution over ride that might improve text recognition etc

Pull Request - State: closed - Opened by john-peterson 9 months ago - 1 comment

#1216 - [Bug]: Every PDF I OCR has the text misaligned with the image

Issue - State: closed - Opened by advert665 9 months ago - 7 comments
Labels: bug

#1215 - [Bug]: Every PDF I OCR has the text misaligned with the image.

Issue - State: closed - Opened by advert665 9 months ago - 4 comments
Labels: bug

#1214 - [Bug]: Persian rendering and text positioning errors in 16.0.1 with new renderer

Issue - State: closed - Opened by Rosti2022 9 months ago - 1 comment
Labels: bug

#1213 - [Bug]: OCRmyPDF does not preserve existing XMP metadata

Issue - State: closed - Opened by jkorinth 9 months ago - 1 comment
Labels: bug

#1212 - [Bug]: OCR gets stuck

Issue - State: closed - Opened by philmas 9 months ago - 4 comments
Labels: bug, need test file

#1211 - [Bug]: NotImplementedError

Issue - State: closed - Opened by gdandersson 9 months ago - 10 comments
Labels: bug

#1209 - [Bug]: complete letter salad

Issue - State: closed - Opened by knabed 9 months ago - 2 comments
Labels: need test file

#1208 - Bump actions/upload-artifact from 3 to 4

Pull Request - State: closed - Opened by dependabot[bot] 10 months ago - 2 comments
Labels: dependencies

#1207 - Bump actions/download-artifact from 3 to 4

Pull Request - State: closed - Opened by dependabot[bot] 10 months ago - 2 comments
Labels: dependencies

#1206 - Fix performance advice to match --fast-web-view documentation

Pull Request - State: closed - Opened by Androbin 10 months ago - 1 comment

#1205 - Bump actions/setup-python from 4 to 5

Pull Request - State: closed - Opened by dependabot[bot] 10 months ago - 2 comments
Labels: dependencies

#1204 - [Bug]: Accented characters not correct in PDF/A output

Issue - State: closed - Opened by stumpylog 10 months ago - 3 comments
Labels: bug

#1203 - [Feature]: Not to save images and opaque text

Issue - State: closed - Opened by edwvee 10 months ago - 1 comment
Labels: enhancement

#1201 - [Bug]: ocrmypdf invoked oom-killer

Issue - State: closed - Opened by munzirtaha 10 months ago - 4 comments
Labels: bug

#1200 - [Feature]: More details for exception ColorConversionNeededError

Issue - State: closed - Opened by noseshimself 10 months ago - 8 comments
Labels: enhancement

#1199 - [Bug]: Progress Bar is missing when running in Google Colab

Issue - State: closed - Opened by Warborn123 10 months ago - 1 comment
Labels: bug, help wanted

#1198 - [Bug]: Unable to install GhostScript using Winget

Issue - State: closed - Opened by xd003 10 months ago - 1 comment
Labels: bug

#1197 - [Bug]: No module named 'lxml'

Issue - State: closed - Opened by tcurdt 10 months ago - 1 comment
Labels: bug

#1196 - [Bug]: ImportError: cannot import name 'PDFTextSeq' from 'pdfminer.pdfdevice'

Issue - State: closed - Opened by MimoGraphix 10 months ago - 1 comment
Labels: bug

#1195 - [Bug]: incorrect file mode

Issue - State: closed - Opened by svenha 10 months ago - 3 comments
Labels: bug

#1193 - [Bug]: InputFileError

Issue - State: closed - Opened by JoKalliauer 10 months ago - 4 comments
Labels: bug

#1191 - [Bug]: sandwich renders differently than hocr

Issue - State: closed - Opened by femifrak 11 months ago - 2 comments
Labels: bug

#1190 - [Bug]: Centos7 Install OCRmyPdf run command failed [Help]

Issue - State: closed - Opened by huxinghai 11 months ago - 4 comments
Labels: bug

#1189 - [Bug]: User warning: missing specialized decoders (probably JBIG2)

Issue - State: closed - Opened by femifrak 11 months ago - 6 comments
Labels: bug

#1188 - [Feature]: enable use of Ghostscript glyph-level Unicode map generation

Issue - State: open - Opened by jbarlow83 11 months ago
Labels: enhancement

#1187 - [Bug]: "Adobe Acrobat Reader" isn't able to open outputfile any more

Issue - State: closed - Opened by JoKalliauer 11 months ago - 5 comments
Labels: bug

#1186 - [Feature]: Support Azure Recognition Service

Issue - State: open - Opened by hcoona 11 months ago
Labels: enhancement

#1185 - [Bug]: OCR_JSON_SETTINGS does not accept JSON string anymore

Issue - State: closed - Opened by JohnDoe2991 11 months ago
Labels: bug

#1184 - [Bug]: [WinError 2] The system cannot find the file specified

Issue - State: closed - Opened by MvCast 11 months ago - 1 comment
Labels: bug

#1182 - Does OCRmyPDF break macOS "Live Text" feature?

Issue - State: closed - Opened by rennefJ 11 months ago - 2 comments

#1181 - TrimBox and CropBox not retained when "force OCR" is used

Issue - State: open - Opened by jbarlow83 11 months ago - 3 comments
Labels: bug

#1180 - [Question]: How to pass --skip-text to watcher.py in docker container?

Issue - State: closed - Opened by dolorosus 11 months ago - 2 comments
Labels: bug

#1179 - [Bug]: "inplace" + --skip-text on PDF with only text modifies / outputs a file

Issue - State: closed - Opened by jrz 11 months ago - 4 comments
Labels: bug

#1177 - Is building OCRmyPDF and it's dependencies in a docker environment less performant?

Issue - State: closed - Opened by wadeflash12 11 months ago - 1 comment
Labels: bug

#1176 - [Bug]: --version gives 0.0.0 on Ubuntu snap

Issue - State: open - Opened by pseudomonas 11 months ago - 5 comments
Labels: bug, snapcraft

#1175 - How can I pass tesseract argument " -c preserve_interword_spaces 1" ?

Issue - State: closed - Opened by languagemaniac 11 months ago - 4 comments
Labels: bug, need test file

#1174 - [Bug]: Ghostscript process fails when running OCRmyPDF on all PDFs

Issue - State: closed - Opened by eoinosullivan 11 months ago - 5 comments
Labels: bug

#1173 - Correct the archive dir name in `Watched folders with Docker`

Pull Request - State: closed - Opened by mflagg2814 12 months ago

#1172 - Handle OSError exceptions in watcher.py

Pull Request - State: closed - Opened by mflagg2814 12 months ago - 1 comment

#1169 - [Bug]: PDF expands to 1.5G from 14M

Issue - State: closed - Opened by alephpiece 12 months ago - 2 comments
Labels: bug

#1168 - [Bug] [Help Needed] Command "ocrmypdf" not found [Windows 11]

Issue - State: closed - Opened by Alaiya 12 months ago - 2 comments
Labels: user config

#1167 - [Bug]: ocrmypdf v15.1.0+git8.2b0e1498 (snap): GPL Ghostscript 9.55.0: Can't find initialization file gs_init.ps. ghostscript.py:118

Issue - State: closed - Opened by zWhdmB5T 12 months ago - 19 comments
Labels: bug, help wanted, third party issue, snapcraft

#1166 - How can I remove extra space between every characters

Issue - State: closed - Opened by hhiyorimi 12 months ago - 1 comment
Labels: enhancement

#1164 - [Bug]: Some PDFs are blank in macOS Safari and Preview

Issue - State: open - Opened by rezafouladian 12 months ago - 2 comments
Labels: bug, third party issue

#1162 - [Bug]: Error: /typecheck in --runpdf--

Issue - State: closed - Opened by muramasatheninja 12 months ago - 3 comments
Labels: third party issue

#1161 - Add custom deskew and page rotation logic before OCR

Issue - State: open - Opened by wadeflash12 12 months ago - 8 comments