Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / fritz-hh/ocrmypdf issues and pull requests

#124 - Pagesegmode for tesseract

Issue - State: closed - Opened by tuxasus almost 9 years ago

#123 - Please submit issues to jbarlow83/OCRmyPDF

Issue - State: open - Opened by jbarlow83 almost 9 years ago

#122 - highlight the word

Pull Request - State: closed - Opened by JBragon about 9 years ago

#121 - need scanning recommendation (b/w or grey)

Issue - State: closed - Opened by femifrak about 9 years ago - 3 comments

#119 - How do you like the idea to start program documentation in the Github Wiki ...

Issue - State: closed - Opened by Wikinaut about 9 years ago - 2 comments

#118 - v3.rc8 fails with pdf not recognized

Issue - State: closed - Opened by MASantos about 9 years ago - 5 comments

#115 - OS X Yosemite Install: No such file or directory: 'requirements.txt'

Issue - State: closed - Opened by frhd about 9 years ago - 2 comments

#114 - Document Python API?

Issue - State: closed - Opened by mlissner about 9 years ago - 1 comment

#113 - Update installation for 3.0.x

Issue - State: closed - Opened by AlainJanssens about 9 years ago - 6 comments

#111 - OCRMyPDF - AttributeError: 'ArrayObject' object has no attribute 'getData' #220

Issue - State: closed - Opened by tuxasus over 9 years ago - 15 comments
Labels: bug

#110 - No output pdf file

Issue - State: closed - Opened by subascha over 9 years ago - 4 comments

#109 - Apparent Escaping Issue with Basepath Dirname/Python Call

Issue - State: closed - Opened by majascules over 9 years ago - 3 comments

#108 - a bit off topic: pdf/a after merging

Issue - State: closed - Opened by femifrak over 9 years ago - 2 comments

#107 - Error with text (and/or annotation) on first page - no output file

Issue - State: closed - Opened by twigbranch over 9 years ago - 6 comments

#106 - Spell check with aspell

Issue - State: closed - Opened by witchi over 9 years ago - 5 comments
Labels: enhancement

#105 - get ocr'ed only

Issue - State: closed - Opened by ilay32 over 9 years ago - 1 comment

#104 - Raw image to OCRmyPDF

Issue - State: closed - Opened by geaplanet over 9 years ago - 7 comments

#103 - Poor OCR Results

Issue - State: closed - Opened by Manuel-J almost 10 years ago - 5 comments

#102 - Feedback on Debian Wheezy

Issue - State: closed - Opened by saintger almost 10 years ago - 1 comment

#101 - Createt $curHocr is named bad

Issue - State: closed - Opened by kreditorro almost 10 years ago - 3 comments

#100 - bug when OCRing German "et cetera" ("&c.")

Issue - State: closed - Opened by femifrak almost 10 years ago

#99 - dependecy problem reportlab - allthough installed...

Issue - State: closed - Opened by andreasotto about 10 years ago - 25 comments

#98 - Option to remove blank pages

Issue - State: closed - Opened by drdownload about 10 years ago - 6 comments

#97 - Script asks me to install Python 2.x although I have Python 2.7.5 installed

Issue - State: closed - Opened by juancarlosfarah about 10 years ago - 4 comments
Labels: robustness

#96 - Write test cases

Issue - State: closed - Opened by fritz-hh about 10 years ago
Labels: robustness

#95 - Could not concatenate all pages to the final PDF/A file

Issue - State: closed - Opened by kreditorro about 10 years ago - 3 comments
Labels: robustness

#94 - rewrite ocrmypdf in python 3.4

Issue - State: closed - Opened by fritz-hh about 10 years ago - 17 comments
Labels: clean-up

#93 - add a cmd line switch to generate a txt file to along with the pdf

Issue - State: closed - Opened by fritz-hh about 10 years ago - 1 comment
Labels: enhancement

#92 - Debian > Tesseract up-to-date not recognized

Issue - State: closed - Opened by gmarchand about 10 years ago - 4 comments

#91 - hocrtransform.py: remove patch for handling of grayscale and 1bit depth images

Issue - State: closed - Opened by fritz-hh about 10 years ago - 3 comments
Labels: clean-up

#90 - ocrmypdf with incrontab / inotify

Issue - State: closed - Opened by segro21 about 10 years ago - 3 comments
Labels: question

#89 - Fix call to readlink on OS X

Pull Request - State: closed - Opened by jbarlow83 about 10 years ago - 2 comments

#88 - MRC

Issue - State: closed - Opened by v217 about 10 years ago - 3 comments

#87 - Ubuntu 14.04 Could not create PDF file from "/tmp

Issue - State: closed - Opened by HelFANS about 10 years ago - 1 comment

#86 - typo in Release_Notes.md

Issue - State: closed - Opened by unsermanninchina about 10 years ago - 1 comment

#84 - Can't open /usr/local/bin/src/config.sh

Issue - State: closed - Opened by kreditorro about 10 years ago

#83 - small changes to make this work on Ubuntu 12.04 called via symlink

Pull Request - State: closed - Opened by DorianScholz about 10 years ago - 1 comment

#82 - Fixed typo

Pull Request - State: closed - Opened by orbitcowboy about 10 years ago - 1 comment

#81 - fixed tipo ghostcript to ghostscript

Pull Request - State: closed - Opened by MoritzFago about 10 years ago - 1 comment

#80 - Make OCRmyPDF.sh symlink compatible

Pull Request - State: closed - Opened by eMPee584 about 10 years ago - 2 comments

#79 - Make OCRmyPDF.sh symlink compatible

Issue - State: closed - Opened by eMPee584 about 10 years ago - 3 comments
Labels: enhancement

#78 - original images not kept unaltered

Issue - State: closed - Opened by femifrak over 10 years ago - 6 comments
Labels: enhancement

#77 - Fixed typo in help text

Pull Request - State: closed - Opened by andysigner over 10 years ago

#76 - AttributeError: 'module' object has no attribute 'useA85'

Issue - State: closed - Opened by faulpaul over 10 years ago - 3 comments
Labels: bug

#75 - problem with unpaper

Issue - State: closed - Opened by femifrak over 10 years ago - 2 comments

#73 - Fixed typo in import of reportlab.

Pull Request - State: closed - Opened by achrist42 over 10 years ago - 1 comment

#72 - Fehler beim Erzeugen des PDFs aus .hocr

Issue - State: closed - Opened by gitmaster2013 over 10 years ago - 7 comments
Labels: robustness

#71 - hocrTransform.py: Fixes execution in arch linux

Pull Request - State: closed - Opened by dreuter over 10 years ago - 1 comment

#70 - output file much bigger (7x), because not original embedded image files copied

Issue - State: closed - Opened by alphablue52 over 10 years ago - 2 comments
Labels: enhancement

#69 - Add command line option to skip pages that contain font data

Pull Request - State: closed - Opened by jbarlow83 almost 11 years ago - 1 comment

#68 - Check for missing pdftoppm depending on how poppler-utils is installed

Pull Request - State: closed - Opened by jbarlow83 almost 11 years ago

#67 - Suppress the GNU parallel nag screen if the user has not yet done so

Pull Request - State: closed - Opened by jbarlow83 almost 11 years ago - 1 comment

#66 - disconnected words

Issue - State: closed - Opened by ghost almost 11 years ago - 1 comment

#65 - Create a Python front-end for OCRmyPDF.sh

Issue - State: closed - Opened by jbarlow83 almost 11 years ago - 2 comments
Labels: enhancement

#64 - weird text order

Issue - State: closed - Opened by femifrak almost 11 years ago - 4 comments
Labels: robustness

#63 - -C config file leads to exit

Issue - State: closed - Opened by femifrak almost 11 years ago - 9 comments

#62 - Reduce pdf size for pages containing several images

Issue - State: closed - Opened by fritz-hh almost 11 years ago - 1 comment
Labels: enhancement

#61 - Dramatically improve deskew performance with leptonica

Pull Request - State: closed - Opened by jbarlow83 almost 11 years ago - 3 comments

#60 - Check if language provided to tesseract through -l option exists

Issue - State: closed - Opened by fritz-hh almost 11 years ago
Labels: robustness

#59 - Some changes to make it run on my machine

Pull Request - State: closed - Opened by oxplot almost 11 years ago - 1 comment
Labels: robustness

#58 - Do not use tesseract config file to prevent ligature detection

Issue - State: closed - Opened by fritz-hh almost 11 years ago
Labels: enhancement

#58 - Do not use tesseract config file to prevent ligature detection

Issue - State: closed - Opened by fritz-hh almost 11 years ago
Labels: enhancement

#57 - Fix temporary folder name generation collisions

Pull Request - State: closed - Opened by jbarlow83 almost 11 years ago - 2 comments

#56 - Fix AttributeError on self.width if Tesseract finds no OCR text

Pull Request - State: closed - Opened by jbarlow83 almost 11 years ago - 1 comment

#55 - Verify that pdftoppm is the Poppler version, not xpdf version

Pull Request - State: closed - Opened by jbarlow83 almost 11 years ago - 1 comment

#54 - Migrate to python3 once reportlab available

Issue - State: closed - Opened by fritz-hh almost 11 years ago - 1 comment
Labels: enhancement

#53 - Packaging and automatic dependency resolution

Issue - State: closed - Opened by Treehopper almost 11 years ago - 7 comments
Labels: enhancement

#52 - a lot of errors when trying to use with Ubuntu 13.04 (Raring)

Issue - State: closed - Opened by htc1977 almost 11 years ago - 6 comments

#51 - Missing dependency checks

Issue - State: closed - Opened by guptamp almost 11 years ago - 16 comments
Labels: robustness

#50 - Generate smaller PDFs with monochrome and grayscale images where possible

Pull Request - State: closed - Opened by jbarlow83 almost 11 years ago - 3 comments
Labels: enhancement

#49 - Tell script to exit with an error if a variable is not set

Pull Request - State: closed - Opened by jbarlow83 almost 11 years ago - 3 comments
Labels: robustness

#48 - Fix some errors running on OSX 10.9 with Homebrew toolchain

Pull Request - State: closed - Opened by jbarlow83 almost 11 years ago - 4 comments

#47 - fixes issue with empty FORCE_OCR parameter

Pull Request - State: closed - Opened by Treehopper almost 11 years ago - 8 comments

#46 - Auto correct image rotation (-180, -90, 0, +90)

Issue - State: closed - Opened by fritz-hh almost 11 years ago - 3 comments
Labels: enhancement

#45 - Fine tune word vertical Placement and vertical size

Issue - State: closed - Opened by fritz-hh almost 11 years ago - 4 comments
Labels: enhancement

#44 - Simplify code for dpi computation

Issue - State: closed - Opened by fritz-hh almost 11 years ago
Labels: enhancement

#43 - Bashism: "local" builtin does not exist in sh

Issue - State: closed - Opened by fritz-hh almost 11 years ago - 1 comment
Labels: bug

#42 - Syntax error in shell script OCRmyPDF.sh

Issue - State: closed - Opened by ossk almost 11 years ago - 2 comments

#41 - Warn if using tesseract older than 3.02.02

Issue - State: closed - Opened by fritz-hh almost 11 years ago
Labels: robustness

#40 - Improve structure of tmp files

Issue - State: closed - Opened by fritz-hh almost 11 years ago
Labels: enhancement

#39 - OCR fails"Could not OCR file"...". Exiting...

Issue - State: closed - Opened by sch82812121 almost 11 years ago - 5 comments

#38 - If there is a resolution mismatch, just warn the user, do not exit

Issue - State: closed - Opened by fritz-hh almost 11 years ago - 2 comments
Labels: robustness

#37 - Warn the user if the resolution is too low

Issue - State: closed - Opened by fritz-hh almost 11 years ago - 1 comment
Labels: enhancement

#36 - Handle PDF Files that contain more than 1 image per page

Issue - State: closed - Opened by fritz-hh almost 11 years ago - 1 comment
Labels: enhancement

#35 - Echo version of the used dependencies in debug mode

Issue - State: closed - Opened by fritz-hh almost 11 years ago
Labels: enhancement

#34 - Script crashes in case input file name contains a "#" character

Issue - State: closed - Opened by fritz-hh almost 11 years ago
Labels: bug

#33 - No page dimension found in the hocr file

Issue - State: closed - Opened by cM6k almost 11 years ago - 8 comments
Labels: robustness

#32 - Bash syntax used in ocrPage.sh but interpreter is /bin/sh

Issue - State: closed - Opened by ncraun over 11 years ago - 1 comment

#31 - Crash if PDF file to OCR contains spaces

Issue - State: closed - Opened by fritz-hh over 11 years ago
Labels: bug

#30 - Final PDF does not comply to PDF/A-1

Issue - State: closed - Opened by fritz-hh over 11 years ago - 1 comment
Labels: bug

#29 - JHove conf contains absolute path of my installation

Issue - State: closed - Opened by fritz-hh over 11 years ago
Labels: bug

#28 - invalid utf-8 encoding hocr

Issue - State: closed - Opened by felixhayashi over 11 years ago - 5 comments
Labels: robustness

#27 - sed invalid option

Issue - State: closed - Opened by felixhayashi over 11 years ago - 6 comments
Labels: bug

#26 - In debug mode: compute and echo time required for processing

Issue - State: closed - Opened by fritz-hh over 11 years ago
Labels: enhancement

#25 - Resolutions (x/y) that are nearly equal are not supported

Issue - State: closed - Opened by fritz-hh over 11 years ago
Labels: robustness