Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / fritz-hh/ocrmypdf issues and pull requests
#124 - Pagesegmode for tesseract
Issue -
State: closed - Opened by tuxasus almost 9 years ago
#123 - Please submit issues to jbarlow83/OCRmyPDF
Issue -
State: open - Opened by jbarlow83 almost 9 years ago
#122 - highlight the word
Pull Request -
State: closed - Opened by JBragon about 9 years ago
#121 - need scanning recommendation (b/w or grey)
Issue -
State: closed - Opened by femifrak about 9 years ago
- 3 comments
#120 - Some input PDFs with Tesseract-OCR throw error in PyPDF2.utils.PdfReadError(Unexpected escaped string: b'{')' raised in ...
Issue -
State: closed - Opened by Wikinaut about 9 years ago
- 4 comments
#119 - How do you like the idea to start program documentation in the Github Wiki ...
Issue -
State: closed - Opened by Wikinaut about 9 years ago
- 2 comments
#118 - v3.rc8 fails with pdf not recognized
Issue -
State: closed - Opened by MASantos about 9 years ago
- 5 comments
#117 - OCRmyPDF-2.x$ does not pass -l language flag correctly + Resulting size of pdf is far to big
Issue -
State: closed - Opened by lowmaster about 9 years ago
- 1 comment
#115 - OS X Yosemite Install: No such file or directory: 'requirements.txt'
Issue -
State: closed - Opened by frhd about 9 years ago
- 2 comments
#114 - Document Python API?
Issue -
State: closed - Opened by mlissner about 9 years ago
- 1 comment
#113 - Update installation for 3.0.x
Issue -
State: closed - Opened by AlainJanssens about 9 years ago
- 6 comments
#112 - Installation issues. i) root required ii) pip3 install -e . fails because it cannot find tesseract
Issue -
State: closed - Opened by Wikinaut about 9 years ago
- 10 comments
#111 - OCRMyPDF - AttributeError: 'ArrayObject' object has no attribute 'getData' #220
Issue -
State: closed - Opened by tuxasus over 9 years ago
- 15 comments
Labels: bug
#110 - No output pdf file
Issue -
State: closed - Opened by subascha over 9 years ago
- 4 comments
#109 - Apparent Escaping Issue with Basepath Dirname/Python Call
Issue -
State: closed - Opened by majascules over 9 years ago
- 3 comments
#108 - a bit off topic: pdf/a after merging
Issue -
State: closed - Opened by femifrak over 9 years ago
- 2 comments
#107 - Error with text (and/or annotation) on first page - no output file
Issue -
State: closed - Opened by twigbranch over 9 years ago
- 6 comments
#106 - Spell check with aspell
Issue -
State: closed - Opened by witchi over 9 years ago
- 5 comments
Labels: enhancement
#105 - get ocr'ed only
Issue -
State: closed - Opened by ilay32 over 9 years ago
- 1 comment
#104 - Raw image to OCRmyPDF
Issue -
State: closed - Opened by geaplanet over 9 years ago
- 7 comments
#103 - Poor OCR Results
Issue -
State: closed - Opened by Manuel-J almost 10 years ago
- 5 comments
#102 - Feedback on Debian Wheezy
Issue -
State: closed - Opened by saintger almost 10 years ago
- 1 comment
#101 - Createt $curHocr is named bad
Issue -
State: closed - Opened by kreditorro almost 10 years ago
- 3 comments
#100 - bug when OCRing German "et cetera" ("&c.")
Issue -
State: closed - Opened by femifrak almost 10 years ago
#99 - dependecy problem reportlab - allthough installed...
Issue -
State: closed - Opened by andreasotto about 10 years ago
- 25 comments
#98 - Option to remove blank pages
Issue -
State: closed - Opened by drdownload about 10 years ago
- 6 comments
#97 - Script asks me to install Python 2.x although I have Python 2.7.5 installed
Issue -
State: closed - Opened by juancarlosfarah about 10 years ago
- 4 comments
Labels: robustness
#96 - Write test cases
Issue -
State: closed - Opened by fritz-hh about 10 years ago
Labels: robustness
#95 - Could not concatenate all pages to the final PDF/A file
Issue -
State: closed - Opened by kreditorro about 10 years ago
- 3 comments
Labels: robustness
#94 - rewrite ocrmypdf in python 3.4
Issue -
State: closed - Opened by fritz-hh about 10 years ago
- 17 comments
Labels: clean-up
#93 - add a cmd line switch to generate a txt file to along with the pdf
Issue -
State: closed - Opened by fritz-hh about 10 years ago
- 1 comment
Labels: enhancement
#92 - Debian > Tesseract up-to-date not recognized
Issue -
State: closed - Opened by gmarchand about 10 years ago
- 4 comments
#91 - hocrtransform.py: remove patch for handling of grayscale and 1bit depth images
Issue -
State: closed - Opened by fritz-hh about 10 years ago
- 3 comments
Labels: clean-up
#90 - ocrmypdf with incrontab / inotify
Issue -
State: closed - Opened by segro21 about 10 years ago
- 3 comments
Labels: question
#89 - Fix call to readlink on OS X
Pull Request -
State: closed - Opened by jbarlow83 about 10 years ago
- 2 comments
#87 - Ubuntu 14.04 Could not create PDF file from "/tmp
Issue -
State: closed - Opened by HelFANS about 10 years ago
- 1 comment
#86 - typo in Release_Notes.md
Issue -
State: closed - Opened by unsermanninchina about 10 years ago
- 1 comment
#85 - Tesseract 3.03-rc1 and newer git versions have basic integrated(!) mixed-mode single-page PDF rendering support
Issue -
State: closed - Opened by Wikinaut about 10 years ago
- 5 comments
#84 - Can't open /usr/local/bin/src/config.sh
Issue -
State: closed - Opened by kreditorro about 10 years ago
#83 - small changes to make this work on Ubuntu 12.04 called via symlink
Pull Request -
State: closed - Opened by DorianScholz about 10 years ago
- 1 comment
#82 - Fixed typo
Pull Request -
State: closed - Opened by orbitcowboy about 10 years ago
- 1 comment
#81 - fixed tipo ghostcript to ghostscript
Pull Request -
State: closed - Opened by MoritzFago about 10 years ago
- 1 comment
#80 - Make OCRmyPDF.sh symlink compatible
Pull Request -
State: closed - Opened by eMPee584 about 10 years ago
- 2 comments
#79 - Make OCRmyPDF.sh symlink compatible
Issue -
State: closed - Opened by eMPee584 about 10 years ago
- 3 comments
Labels: enhancement
#78 - original images not kept unaltered
Issue -
State: closed - Opened by femifrak over 10 years ago
- 6 comments
Labels: enhancement
#77 - Fixed typo in help text
Pull Request -
State: closed - Opened by andysigner over 10 years ago
#76 - AttributeError: 'module' object has no attribute 'useA85'
Issue -
State: closed - Opened by faulpaul over 10 years ago
- 3 comments
Labels: bug
#75 - problem with unpaper
Issue -
State: closed - Opened by femifrak over 10 years ago
- 2 comments
#74 - no output possible: stat für »/tmp..../hocr.html“ is not possible: file or folder not found)
Issue -
State: closed - Opened by brainer84 over 10 years ago
- 5 comments
#73 - Fixed typo in import of reportlab.
Pull Request -
State: closed - Opened by achrist42 over 10 years ago
- 1 comment
#72 - Fehler beim Erzeugen des PDFs aus .hocr
Issue -
State: closed - Opened by gitmaster2013 over 10 years ago
- 7 comments
Labels: robustness
#71 - hocrTransform.py: Fixes execution in arch linux
Pull Request -
State: closed - Opened by dreuter over 10 years ago
- 1 comment
#70 - output file much bigger (7x), because not original embedded image files copied
Issue -
State: closed - Opened by alphablue52 over 10 years ago
- 2 comments
Labels: enhancement
#69 - Add command line option to skip pages that contain font data
Pull Request -
State: closed - Opened by jbarlow83 almost 11 years ago
- 1 comment
#68 - Check for missing pdftoppm depending on how poppler-utils is installed
Pull Request -
State: closed - Opened by jbarlow83 almost 11 years ago
#67 - Suppress the GNU parallel nag screen if the user has not yet done so
Pull Request -
State: closed - Opened by jbarlow83 almost 11 years ago
- 1 comment
#66 - disconnected words
Issue -
State: closed - Opened by ghost almost 11 years ago
- 1 comment
#65 - Create a Python front-end for OCRmyPDF.sh
Issue -
State: closed - Opened by jbarlow83 almost 11 years ago
- 2 comments
Labels: enhancement
#64 - weird text order
Issue -
State: closed - Opened by femifrak almost 11 years ago
- 4 comments
Labels: robustness
#63 - -C config file leads to exit
Issue -
State: closed - Opened by femifrak almost 11 years ago
- 9 comments
#62 - Reduce pdf size for pages containing several images
Issue -
State: closed - Opened by fritz-hh almost 11 years ago
- 1 comment
Labels: enhancement
#61 - Dramatically improve deskew performance with leptonica
Pull Request -
State: closed - Opened by jbarlow83 almost 11 years ago
- 3 comments
#60 - Check if language provided to tesseract through -l option exists
Issue -
State: closed - Opened by fritz-hh almost 11 years ago
Labels: robustness
#59 - Some changes to make it run on my machine
Pull Request -
State: closed - Opened by oxplot almost 11 years ago
- 1 comment
Labels: robustness
#58 - Do not use tesseract config file to prevent ligature detection
Issue -
State: closed - Opened by fritz-hh almost 11 years ago
Labels: enhancement
#58 - Do not use tesseract config file to prevent ligature detection
Issue -
State: closed - Opened by fritz-hh almost 11 years ago
Labels: enhancement
#57 - Fix temporary folder name generation collisions
Pull Request -
State: closed - Opened by jbarlow83 almost 11 years ago
- 2 comments
#56 - Fix AttributeError on self.width if Tesseract finds no OCR text
Pull Request -
State: closed - Opened by jbarlow83 almost 11 years ago
- 1 comment
#55 - Verify that pdftoppm is the Poppler version, not xpdf version
Pull Request -
State: closed - Opened by jbarlow83 almost 11 years ago
- 1 comment
#54 - Migrate to python3 once reportlab available
Issue -
State: closed - Opened by fritz-hh almost 11 years ago
- 1 comment
Labels: enhancement
#53 - Packaging and automatic dependency resolution
Issue -
State: closed - Opened by Treehopper almost 11 years ago
- 7 comments
Labels: enhancement
#52 - a lot of errors when trying to use with Ubuntu 13.04 (Raring)
Issue -
State: closed - Opened by htc1977 almost 11 years ago
- 6 comments
#51 - Missing dependency checks
Issue -
State: closed - Opened by guptamp almost 11 years ago
- 16 comments
Labels: robustness
#50 - Generate smaller PDFs with monochrome and grayscale images where possible
Pull Request -
State: closed - Opened by jbarlow83 almost 11 years ago
- 3 comments
Labels: enhancement
#49 - Tell script to exit with an error if a variable is not set
Pull Request -
State: closed - Opened by jbarlow83 almost 11 years ago
- 3 comments
Labels: robustness
#48 - Fix some errors running on OSX 10.9 with Homebrew toolchain
Pull Request -
State: closed - Opened by jbarlow83 almost 11 years ago
- 4 comments
#47 - fixes issue with empty FORCE_OCR parameter
Pull Request -
State: closed - Opened by Treehopper almost 11 years ago
- 8 comments
#46 - Auto correct image rotation (-180, -90, 0, +90)
Issue -
State: closed - Opened by fritz-hh almost 11 years ago
- 3 comments
Labels: enhancement
#45 - Fine tune word vertical Placement and vertical size
Issue -
State: closed - Opened by fritz-hh almost 11 years ago
- 4 comments
Labels: enhancement
#44 - Simplify code for dpi computation
Issue -
State: closed - Opened by fritz-hh almost 11 years ago
Labels: enhancement
#43 - Bashism: "local" builtin does not exist in sh
Issue -
State: closed - Opened by fritz-hh almost 11 years ago
- 1 comment
Labels: bug
#42 - Syntax error in shell script OCRmyPDF.sh
Issue -
State: closed - Opened by ossk almost 11 years ago
- 2 comments
#41 - Warn if using tesseract older than 3.02.02
Issue -
State: closed - Opened by fritz-hh almost 11 years ago
Labels: robustness
#40 - Improve structure of tmp files
Issue -
State: closed - Opened by fritz-hh almost 11 years ago
Labels: enhancement
#39 - OCR fails"Could not OCR file"...". Exiting...
Issue -
State: closed - Opened by sch82812121 almost 11 years ago
- 5 comments
#38 - If there is a resolution mismatch, just warn the user, do not exit
Issue -
State: closed - Opened by fritz-hh almost 11 years ago
- 2 comments
Labels: robustness
#37 - Warn the user if the resolution is too low
Issue -
State: closed - Opened by fritz-hh almost 11 years ago
- 1 comment
Labels: enhancement
#36 - Handle PDF Files that contain more than 1 image per page
Issue -
State: closed - Opened by fritz-hh almost 11 years ago
- 1 comment
Labels: enhancement
#35 - Echo version of the used dependencies in debug mode
Issue -
State: closed - Opened by fritz-hh almost 11 years ago
Labels: enhancement
#34 - Script crashes in case input file name contains a "#" character
Issue -
State: closed - Opened by fritz-hh almost 11 years ago
Labels: bug
#33 - No page dimension found in the hocr file
Issue -
State: closed - Opened by cM6k almost 11 years ago
- 8 comments
Labels: robustness
#32 - Bash syntax used in ocrPage.sh but interpreter is /bin/sh
Issue -
State: closed - Opened by ncraun over 11 years ago
- 1 comment
#31 - Crash if PDF file to OCR contains spaces
Issue -
State: closed - Opened by fritz-hh over 11 years ago
Labels: bug
#30 - Final PDF does not comply to PDF/A-1
Issue -
State: closed - Opened by fritz-hh over 11 years ago
- 1 comment
Labels: bug
#29 - JHove conf contains absolute path of my installation
Issue -
State: closed - Opened by fritz-hh over 11 years ago
Labels: bug
#28 - invalid utf-8 encoding hocr
Issue -
State: closed - Opened by felixhayashi over 11 years ago
- 5 comments
Labels: robustness
#27 - sed invalid option
Issue -
State: closed - Opened by felixhayashi over 11 years ago
- 6 comments
Labels: bug
#26 - In debug mode: compute and echo time required for processing
Issue -
State: closed - Opened by fritz-hh over 11 years ago
Labels: enhancement
#25 - Resolutions (x/y) that are nearly equal are not supported
Issue -
State: closed - Opened by fritz-hh over 11 years ago
Labels: robustness