Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / tesseract-ocr/tesseract issues and pull requests

#3836 - Detect text rotation without running recognition

Issue - State: closed - Opened by Balearica over 2 years ago - 9 comments
Labels: feature request

#3826 - ImportError symbol not found in flat namespace '__ZN9tesseract11TessBaseAPID1Ev'

Issue - State: closed - Opened by lokkasl over 2 years ago - 5 comments
Labels: question

#3817 - Text2Image isn't working properly

Issue - State: open - Opened by Zacharymk1213 almost 3 years ago - 7 comments
Labels: text2image

#3808 - Pdf offset fix

Pull Request - State: closed - Opened by mcsjosh almost 3 years ago - 5 comments
Labels: PDF, 32-bit

#3807 - Q&A: Technical drawings OCR

Issue - State: closed - Opened by ViterAlex almost 3 years ago - 3 comments
Labels: question

#3805 - TessPDFRenderer uses "long int" to store PDF offsets, which is too small on some platforms

Issue - State: closed - Opened by mcsjosh almost 3 years ago - 6 comments
Labels: bug, PDF, 32-bit

#3787 - RFC: Improve positioning of symbol bounding boxes

Pull Request - State: open - Opened by p12tic almost 3 years ago - 4 comments
Labels: bounding box

#3767 - Compiled-in TESSDATA_PREFIX unused on Windows

Issue - State: closed - Opened by CSBVision almost 3 years ago - 11 comments
Labels: bug

#3763 - BCER eval displayed during lstmtraining and that from lstmeval are different

Issue - State: open - Opened by Shreeshrii almost 3 years ago - 11 comments
Labels: training

#3734 - Add a variable to set single pattern without config file

Issue - State: open - Opened by bo-bac about 3 years ago - 6 comments
Labels: feature request, API

#3731 - Build with IronOCR(Tesseract 5 engine) on Unity fails

Issue - State: closed - Opened by suzyrhkr about 3 years ago - 1 comment
Labels: 3rd party tool

#3709 - Add support for Unicode filenames on MS Windows

Issue - State: open - Opened by stweil about 3 years ago - 10 comments
Labels: feature request, unicode

#3693 - Using -l <language>+equ frequently causes floating point exceptions (core dumped)

Issue - State: open - Opened by callegar about 3 years ago - 15 comments
Labels: bug, equation detection

#3679 - 5.0.0: build fails

Issue - State: closed - Opened by kloczek about 3 years ago - 32 comments
Labels: build process, unit tests

#3673 - Plans for tesseract 5.x.y

Issue - State: open - Opened by amitdo about 3 years ago - 138 comments

#3655 - lstmtraining mutex lock failed

Issue - State: open - Opened by stweil over 3 years ago - 2 comments
Labels: bug, training, unexpected termination

#3614 - tosp_old_to_method: Inconsistent in different models

Issue - State: open - Opened by amitdo over 3 years ago - 6 comments
Labels: traineddata

#3610 - Fix issue #1073 (use default language only when necessary)

Pull Request - State: closed - Opened by stweil over 3 years ago - 1 comment
Labels: API

#3587 - How to Diagnose Overfitting and Underfitting of Tesseract Models?

Issue - State: closed - Opened by Mann1904 over 3 years ago - 2 comments
Labels: question, training

#3586 - Potential segmentation fault by wrong format string

Issue - State: open - Opened by autofuzzoss over 3 years ago - 4 comments
Labels: enhancement

#3515 - Crash in old CPU

Issue - State: closed - Opened by hereis00 over 3 years ago - 7 comments
Labels: unexpected termination

#3501 - Solve clang reporting unused variable in ExtractMicros function

Pull Request - State: closed - Opened by zdenop over 3 years ago - 4 comments

#3481 - Fix possible UB when accessing empty vector's data

Pull Request - State: open - Opened by nocun over 3 years ago - 4 comments

#3476 - Fix for LSTM Diplopia issue

Pull Request - State: open - Opened by woodjohndavid over 3 years ago - 27 comments

#3466 - Build failed for ARM64 Windows10

Issue - State: closed - Opened by stonerey over 3 years ago - 16 comments
Labels: build process, awaiting feedback, msvc

#3452 - API: Different results for the same image depending on the order in which the files are processed

Issue - State: closed - Opened by nagadomi over 3 years ago - 9 comments
Labels: bug, regression

#3435 - Support image width and height larger than 32767

Pull Request - State: open - Opened by stweil almost 4 years ago - 20 comments
Labels: enhancement

#3418 - Add more binarization options

Pull Request - State: closed - Opened by amitdo almost 4 years ago - 10 comments
Labels: leptonica, enhancement, binarization

#3406 - Adding --print-fonts-table parameter & tessedit_font_id configuration option

Pull Request - State: closed - Opened by Lucas-C almost 4 years ago - 5 comments
Labels: legacy

#3396 - fix clang cmake build on windows

Pull Request - State: closed - Opened by zdenop almost 4 years ago

#3386 - Tesseract does not generate .lstmf for some images

Issue - State: open - Opened by DavidHribek almost 4 years ago - 8 comments
Labels: bug

#3374 - use the Viewer to debug recognition failed

Issue - State: closed - Opened by martin-matj almost 4 years ago - 17 comments
Labels: bug

#3369 - tesseract process never finishes with specific gif image

Issue - State: open - Opened by wix-andriusb almost 4 years ago - 37 comments
Labels: bug, feature request, performance, leptonica, process hangs, binarization

#3216 - [VS2019] Linker error

Issue - State: closed - Opened by OgreTransporter about 4 years ago - 7 comments

#3184 - Maximum supported image size

Issue - State: open - Opened by MerlijnWajer about 4 years ago - 8 comments
Labels: feature request

#3149 - Lots of uppercase letters instead of lowercase

Issue - State: open - Opened by Masina86 over 4 years ago
Labels: accuracy, ambiguously

#3144 - Character confusion fix suggestion

Issue - State: open - Opened by EucliTs0 over 4 years ago - 44 comments
Labels: accuracy, diplopia

#3131 - Add Kubernetes and Helm Support

Issue - State: closed - Opened by bishtsaurabh5 over 4 years ago - 3 comments
Labels: feature request

#3115 - Tesseract failing to recognize very simple text from a clean image

Issue - State: closed - Opened by gbersac over 4 years ago - 1 comment
Labels: digits

#3109 - Running multiple tesseract instances in parallel is slower than running in serial

Issue - State: open - Opened by ole-tange over 4 years ago - 7 comments
Labels: feature request, performance, OpenMP

#3095 - Specify Custom Installation path in silent install

Issue - State: closed - Opened by jdfcio over 4 years ago - 3 comments
Labels: nsis

#3062 - Alpha 0 text on pdf output

Issue - State: closed - Opened by Maxime-J over 4 years ago - 3 comments
Labels: PDF, alpha channel

#3028 - Unable to detect simple math equations using pytessract

Issue - State: closed - Opened by NavpreetDevpuri over 4 years ago - 20 comments
Labels: wontfix, equation detection

#3021 - Tesseract Empty Page

Issue - State: open - Opened by M3ssman over 4 years ago - 42 comments
Labels: bug, bounding box, binarization

#3000 - Debugger Viewer doesn't work, can't connect to ScrollView server

Issue - State: closed - Opened by ellislau over 4 years ago - 3 comments

#2995 - tesseract doesn't recognize ISO639 code "zho" for chinese

Issue - State: open - Opened by Seegras over 4 years ago - 2 comments
Labels: feature request, traineddata

#2978 - NEON SIMD code.

Pull Request - State: closed - Opened by robinwatts almost 5 years ago - 28 comments
Labels: performance, SIMD, enhancement

#2970 - Inaccurate OCR results for lines with many dots

Issue - State: open - Opened by IdiosApps almost 5 years ago - 9 comments
Labels: TOC

#2949 - Sometimes failing to detect multiple columns

Issue - State: open - Opened by kuhanw almost 5 years ago - 2 comments
Labels: layout analysis

#2945 - tesseract with costura.fody

Issue - State: closed - Opened by HeroinGyrl almost 5 years ago - 3 comments
Labels: wontfix

#2930 - Error during processing of HEIC input files

Issue - State: open - Opened by robskrob almost 5 years ago - 25 comments
Labels: question, leptonica

#2923 - Adding whitelist changes whitespace detection behavior

Issue - State: closed - Opened by CIRLOAM almost 5 years ago - 3 comments
Labels: allowlist / denylist

#2879 - Invisible glyph bounds at wrong positions in PDF

Issue - State: closed - Opened by THausherr about 5 years ago - 57 comments
Labels: PDF

#2876 - Build Tesseract from source with Visual Studio

Issue - State: closed - Opened by essamzaky about 5 years ago - 115 comments
Labels: build process

#2838 - Test suite depends on submodule instead of using gtest from distribution

Issue - State: open - Opened by kloczek about 5 years ago - 14 comments
Labels: feature request, help wanted, unit tests

#2815 - ALTO renderer: move to v4, add Glyphs

Pull Request - State: open - Opened by bertsky about 5 years ago - 34 comments
Labels: enhancement

#2781 - Tesseract cannot detect italics?

Issue - State: open - Opened by spajak about 5 years ago - 8 comments
Labels: question, legacy

#2738 - Duplicate Characters in Output Stream

Issue - State: open - Opened by woodjohndavid over 5 years ago - 19 comments
Labels: accuracy, diplopia

#2702 - Extra spaces in any output except txt for non space delimited languages

Issue - State: open - Opened by FrkBo over 5 years ago - 25 comments
Labels: bug, output, non spaced words

#2695 - Can't encode transcription

Issue - State: closed - Opened by peterbence3 over 5 years ago - 24 comments
Labels: bug, duplicate, training, encoding failed

#2656 - text2image - Error: Call PrepareToWrite before WriteTesseractBoxFile!!

Issue - State: closed - Opened by Shreeshrii over 5 years ago - 3 comments
Labels: duplicate, training

#2654 - text2image - RTL - Null box at index 0

Issue - State: open - Opened by Shreeshrii over 5 years ago - 10 comments
Labels: training, RTL, text2image

#2634 - Small extra blocks with a single letter gets split off of bigger text blocks

Issue - State: open - Opened by hnesk over 5 years ago - 1 comment
Labels: help wanted, layout analysis

#2630 - How to train new lang from scratch?

Issue - State: closed - Opened by Sanaj2060 over 5 years ago - 11 comments
Labels: question

#2629 - Bugs reported by OSS-Fuzz

Issue - State: open - Opened by stweil over 5 years ago - 11 comments
Labels: bug, help wanted

#2596 - ocr_textfloat at some places instead of ocr_line.

Issue - State: open - Opened by bhaveshvyas007 over 5 years ago - 2 comments
Labels: output

#2591 - TessBaseAPIDetectOrientationScript : Getting wrong orientation angle

Issue - State: closed - Opened by vipulpatel2103 over 5 years ago - 2 comments

#2504 - Training with intermediate checkpoints

Issue - State: closed - Opened by kamrapooja over 5 years ago - 3 comments

#2459 - RFC: Remove tessdata directory and replace it by a submodule

Pull Request - State: open - Opened by stweil over 5 years ago - 6 comments
Labels: RFC

#2395 - compute ctc target failed

Issue - State: open - Opened by nijanthan0 almost 6 years ago - 42 comments
Labels: training

#2391 - Change Tesseract output with words coming from an external dictionary

Issue - State: closed - Opened by davideromano almost 6 years ago - 21 comments
Labels: feature request

#2384 - Issue with numbers recognition

Issue - State: open - Opened by nijanthan0 almost 6 years ago - 14 comments

#2363 - Two-column document with ordered lists lose numbers

Issue - State: open - Opened by james-s-w-clark almost 6 years ago - 2 comments
Labels: layout analysis

#2334 - Errors building for arm-apple-darwin64 relating to AVX, SSE, and more

Issue - State: closed - Opened by hamchapman almost 6 years ago - 22 comments
Labels: question, build process

#2263 - Numbers in Arabic script are getting reversed

Issue - State: closed - Opened by Shreeshrii almost 6 years ago - 19 comments
Labels: RTL, TOC

#2257 - Optimize calculation of dot product for double vectors with AVX

Pull Request - State: closed - Opened by stweil about 6 years ago - 9 comments
Labels: performance, SIMD

#2156 - Uzn results differ from manual pre-cropping

Issue - State: open - Opened by sweco-sekrsv about 6 years ago - 2 comments
Labels: layout analysis

#2098 - Add config variable for selection of dot product function

Pull Request - State: closed - Opened by stweil about 6 years ago - 11 comments
Labels: feature request

#2071 - Line Indentation is ignored, no way to enable it.

Issue - State: closed - Opened by mickaelistria about 6 years ago - 2 comments
Labels: feature request, question

#2064 - Tesseract 4.0.0 crashed on Intel I5-8400 CPU with Debian 9.6.0 amd64 (SSE/AVX/AVX2)

Issue - State: closed - Opened by s3vrlinux over 6 years ago - 88 comments
Labels: SIMD, unexpected termination

#2052 - [accuracy] 4.0.0 sees white text "Video Mode" on dark grey background as "Vite [ote Cols"

Issue - State: closed - Opened by AdamWill over 6 years ago - 7 comments
Labels: accuracy

#2032 - Lines misalignment in HOCR File

Issue - State: open - Opened by engahmed1190 over 6 years ago - 2 comments
Labels: layout analysis, tables

#2029 - Tesseract fails to initialize : cannot read traineddata files from $TESSDATA_PREFIX

Issue - State: closed - Opened by srdg over 6 years ago - 9 comments
Labels: question

#1932 - Text-line Extraction based on Deep Learining [Feature Wanted]

Issue - State: closed - Opened by ghost over 6 years ago - 5 comments
Labels: feature request

#1902 - text2image Null box at index 0

Issue - State: open - Opened by YiWenFY over 6 years ago - 17 comments
Labels: bug, training, text2image

#1886 - Word-Level OCR

Issue - State: closed - Opened by ghost over 6 years ago - 15 comments

#1799 - tesseract is not installed or it is not in your path

Issue - State: closed - Opened by pete111 over 6 years ago - 17 comments

#1749 - Public key for tesseract-4.00~git2686-1.1.x86_64.rpm is not installed

Issue - State: closed - Opened by gitscamI over 6 years ago - 5 comments

#1714 - [Feature Request] Table structure extraction at the API

Issue - State: open - Opened by troplin over 6 years ago - 69 comments
Labels: feature request, accuracy, tables

#1702 - Warning. Invalid resolution 0 dpi. Using 70 instead.

Issue - State: closed - Opened by JILeXanDR over 6 years ago - 37 comments
Labels: image resolution

#1674 - [Feature Request]: add an option to text2image that randomly mixes different font styles in the same line

Issue - State: open - Opened by Shreeshrii over 6 years ago - 7 comments
Labels: feature request, training, text2image

#1627 - RFC: Situation with tests in Tesseract

Issue - State: closed - Opened by zamazan4ik over 6 years ago - 54 comments
Labels: feature request, question, unit tests, RFC

#1620 - Tesseract Binary: read_params_file: parameter not found: enable_new_segsearch

Issue - State: closed - Opened by jmercouris over 6 years ago - 13 comments

#1600 - Run Tesseract using more than 4 threads ?

Issue - State: closed - Opened by Chien-Hao over 6 years ago - 8 comments

#1465 - Tesseract inserting additional alternative characters

Issue - State: open - Opened by jghare almost 7 years ago - 18 comments
Labels: accuracy, diplopia

#1446 - Issue while using indian language traineddata.

Issue - State: closed - Opened by shekarnode almost 7 years ago - 7 comments

#1362 - recognizes more characters than present

Issue - State: open - Opened by abieler almost 7 years ago - 5 comments
Labels: accuracy, diplopia