Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / tesseract-ocr/tesseract issues and pull requests
#3843 - Tesseract returns invalid characters for images with lack of text (for PSM=12)
Issue -
State: open - Opened by krzysiekj94 over 2 years ago
- 2 comments
#3836 - Detect text rotation without running recognition
Issue -
State: closed - Opened by Balearica over 2 years ago
- 9 comments
Labels: feature request
#3826 - ImportError symbol not found in flat namespace '__ZN9tesseract11TessBaseAPID1Ev'
Issue -
State: closed - Opened by lokkasl over 2 years ago
- 5 comments
Labels: question
#3817 - Text2Image isn't working properly
Issue -
State: open - Opened by Zacharymk1213 almost 3 years ago
- 7 comments
Labels: text2image
#3808 - Pdf offset fix
Pull Request -
State: closed - Opened by mcsjosh almost 3 years ago
- 5 comments
Labels: PDF, 32-bit
#3807 - Q&A: Technical drawings OCR
Issue -
State: closed - Opened by ViterAlex almost 3 years ago
- 3 comments
Labels: question
#3805 - TessPDFRenderer uses "long int" to store PDF offsets, which is too small on some platforms
Issue -
State: closed - Opened by mcsjosh almost 3 years ago
- 6 comments
Labels: bug, PDF, 32-bit
#3787 - RFC: Improve positioning of symbol bounding boxes
Pull Request -
State: open - Opened by p12tic almost 3 years ago
- 4 comments
Labels: bounding box
#3767 - Compiled-in TESSDATA_PREFIX unused on Windows
Issue -
State: closed - Opened by CSBVision almost 3 years ago
- 11 comments
Labels: bug
#3763 - BCER eval displayed during lstmtraining and that from lstmeval are different
Issue -
State: open - Opened by Shreeshrii almost 3 years ago
- 11 comments
Labels: training
#3734 - Add a variable to set single pattern without config file
Issue -
State: open - Opened by bo-bac about 3 years ago
- 6 comments
Labels: feature request, API
#3731 - Build with IronOCR(Tesseract 5 engine) on Unity fails
Issue -
State: closed - Opened by suzyrhkr about 3 years ago
- 1 comment
Labels: 3rd party tool
#3709 - Add support for Unicode filenames on MS Windows
Issue -
State: open - Opened by stweil about 3 years ago
- 10 comments
Labels: feature request, unicode
#3693 - Using -l <language>+equ frequently causes floating point exceptions (core dumped)
Issue -
State: open - Opened by callegar about 3 years ago
- 15 comments
Labels: bug, equation detection
#3679 - 5.0.0: build fails
Issue -
State: closed - Opened by kloczek about 3 years ago
- 32 comments
Labels: build process, unit tests
#3673 - Plans for tesseract 5.x.y
Issue -
State: open - Opened by amitdo about 3 years ago
- 138 comments
#3655 - lstmtraining mutex lock failed
Issue -
State: open - Opened by stweil over 3 years ago
- 2 comments
Labels: bug, training, unexpected termination
#3614 - tosp_old_to_method: Inconsistent in different models
Issue -
State: open - Opened by amitdo over 3 years ago
- 6 comments
Labels: traineddata
#3610 - Fix issue #1073 (use default language only when necessary)
Pull Request -
State: closed - Opened by stweil over 3 years ago
- 1 comment
Labels: API
#3587 - How to Diagnose Overfitting and Underfitting of Tesseract Models?
Issue -
State: closed - Opened by Mann1904 over 3 years ago
- 2 comments
Labels: question, training
#3586 - Potential segmentation fault by wrong format string
Issue -
State: open - Opened by autofuzzoss over 3 years ago
- 4 comments
Labels: enhancement
#3515 - Crash in old CPU
Issue -
State: closed - Opened by hereis00 over 3 years ago
- 7 comments
Labels: unexpected termination
#3501 - Solve clang reporting unused variable in ExtractMicros function
Pull Request -
State: closed - Opened by zdenop over 3 years ago
- 4 comments
#3481 - Fix possible UB when accessing empty vector's data
Pull Request -
State: open - Opened by nocun over 3 years ago
- 4 comments
#3476 - Fix for LSTM Diplopia issue
Pull Request -
State: open - Opened by woodjohndavid over 3 years ago
- 27 comments
#3466 - Build failed for ARM64 Windows10
Issue -
State: closed - Opened by stonerey over 3 years ago
- 16 comments
Labels: build process, awaiting feedback, msvc
#3452 - API: Different results for the same image depending on the order in which the files are processed
Issue -
State: closed - Opened by nagadomi over 3 years ago
- 9 comments
Labels: bug, regression
#3435 - Support image width and height larger than 32767
Pull Request -
State: open - Opened by stweil almost 4 years ago
- 20 comments
Labels: enhancement
#3421 - lstmeval: Improve output by ensuring 'Truth:' text is encoded the same way as OCR output…
Pull Request -
State: open - Opened by nickjwhite almost 4 years ago
- 5 comments
#3418 - Add more binarization options
Pull Request -
State: closed - Opened by amitdo almost 4 years ago
- 10 comments
Labels: leptonica, enhancement, binarization
#3406 - Adding --print-fonts-table parameter & tessedit_font_id configuration option
Pull Request -
State: closed - Opened by Lucas-C almost 4 years ago
- 5 comments
Labels: legacy
#3396 - fix clang cmake build on windows
Pull Request -
State: closed - Opened by zdenop almost 4 years ago
#3386 - Tesseract does not generate .lstmf for some images
Issue -
State: open - Opened by DavidHribek almost 4 years ago
- 8 comments
Labels: bug
#3374 - use the Viewer to debug recognition failed
Issue -
State: closed - Opened by martin-matj almost 4 years ago
- 17 comments
Labels: bug
#3369 - tesseract process never finishes with specific gif image
Issue -
State: open - Opened by wix-andriusb almost 4 years ago
- 37 comments
Labels: bug, feature request, performance, leptonica, process hangs, binarization
#3303 - hOCR renderer writes "x_size" (instead of "x_fsize") property to ocr_line/ocr_header/...
Issue -
State: open - Opened by MerlijnWajer about 4 years ago
- 9 comments
#3216 - [VS2019] Linker error
Issue -
State: closed - Opened by OgreTransporter about 4 years ago
- 7 comments
#3184 - Maximum supported image size
Issue -
State: open - Opened by MerlijnWajer about 4 years ago
- 8 comments
Labels: feature request
#3149 - Lots of uppercase letters instead of lowercase
Issue -
State: open - Opened by Masina86 over 4 years ago
Labels: accuracy, ambiguously
#3144 - Character confusion fix suggestion
Issue -
State: open - Opened by EucliTs0 over 4 years ago
- 44 comments
Labels: accuracy, diplopia
#3131 - Add Kubernetes and Helm Support
Issue -
State: closed - Opened by bishtsaurabh5 over 4 years ago
- 3 comments
Labels: feature request
#3115 - Tesseract failing to recognize very simple text from a clean image
Issue -
State: closed - Opened by gbersac over 4 years ago
- 1 comment
Labels: digits
#3109 - Running multiple tesseract instances in parallel is slower than running in serial
Issue -
State: open - Opened by ole-tange over 4 years ago
- 7 comments
Labels: feature request, performance, OpenMP
#3095 - Specify Custom Installation path in silent install
Issue -
State: closed - Opened by jdfcio over 4 years ago
- 3 comments
Labels: nsis
#3062 - Alpha 0 text on pdf output
Issue -
State: closed - Opened by Maxime-J over 4 years ago
- 3 comments
Labels: PDF, alpha channel
#3028 - Unable to detect simple math equations using pytessract
Issue -
State: closed - Opened by NavpreetDevpuri over 4 years ago
- 20 comments
Labels: wontfix, equation detection
#3021 - Tesseract Empty Page
Issue -
State: open - Opened by M3ssman over 4 years ago
- 42 comments
Labels: bug, bounding box, binarization
#3000 - Debugger Viewer doesn't work, can't connect to ScrollView server
Issue -
State: closed - Opened by ellislau over 4 years ago
- 3 comments
#2995 - tesseract doesn't recognize ISO639 code "zho" for chinese
Issue -
State: open - Opened by Seegras over 4 years ago
- 2 comments
Labels: feature request, traineddata
#2978 - NEON SIMD code.
Pull Request -
State: closed - Opened by robinwatts almost 5 years ago
- 28 comments
Labels: performance, SIMD, enhancement
#2970 - Inaccurate OCR results for lines with many dots
Issue -
State: open - Opened by IdiosApps almost 5 years ago
- 9 comments
Labels: TOC
#2949 - Sometimes failing to detect multiple columns
Issue -
State: open - Opened by kuhanw almost 5 years ago
- 2 comments
Labels: layout analysis
#2945 - tesseract with costura.fody
Issue -
State: closed - Opened by HeroinGyrl almost 5 years ago
- 3 comments
Labels: wontfix
#2930 - Error during processing of HEIC input files
Issue -
State: open - Opened by robskrob almost 5 years ago
- 25 comments
Labels: question, leptonica
#2923 - Adding whitelist changes whitespace detection behavior
Issue -
State: closed - Opened by CIRLOAM almost 5 years ago
- 3 comments
Labels: allowlist / denylist
#2879 - Invisible glyph bounds at wrong positions in PDF
Issue -
State: closed - Opened by THausherr about 5 years ago
- 57 comments
Labels: PDF
#2876 - Build Tesseract from source with Visual Studio
Issue -
State: closed - Opened by essamzaky about 5 years ago
- 115 comments
Labels: build process
#2838 - Test suite depends on submodule instead of using gtest from distribution
Issue -
State: open - Opened by kloczek about 5 years ago
- 14 comments
Labels: feature request, help wanted, unit tests
#2815 - ALTO renderer: move to v4, add Glyphs
Pull Request -
State: open - Opened by bertsky about 5 years ago
- 34 comments
Labels: enhancement
#2781 - Tesseract cannot detect italics?
Issue -
State: open - Opened by spajak about 5 years ago
- 8 comments
Labels: question, legacy
#2738 - Duplicate Characters in Output Stream
Issue -
State: open - Opened by woodjohndavid over 5 years ago
- 19 comments
Labels: accuracy, diplopia
#2702 - Extra spaces in any output except txt for non space delimited languages
Issue -
State: open - Opened by FrkBo over 5 years ago
- 25 comments
Labels: bug, output, non spaced words
#2695 - Can't encode transcription
Issue -
State: closed - Opened by peterbence3 over 5 years ago
- 24 comments
Labels: bug, duplicate, training, encoding failed
#2656 - text2image - Error: Call PrepareToWrite before WriteTesseractBoxFile!!
Issue -
State: closed - Opened by Shreeshrii over 5 years ago
- 3 comments
Labels: duplicate, training
#2654 - text2image - RTL - Null box at index 0
Issue -
State: open - Opened by Shreeshrii over 5 years ago
- 10 comments
Labels: training, RTL, text2image
#2634 - Small extra blocks with a single letter gets split off of bigger text blocks
Issue -
State: open - Opened by hnesk over 5 years ago
- 1 comment
Labels: help wanted, layout analysis
#2630 - How to train new lang from scratch?
Issue -
State: closed - Opened by Sanaj2060 over 5 years ago
- 11 comments
Labels: question
#2629 - Bugs reported by OSS-Fuzz
Issue -
State: open - Opened by stweil over 5 years ago
- 11 comments
Labels: bug, help wanted
#2596 - ocr_textfloat at some places instead of ocr_line.
Issue -
State: open - Opened by bhaveshvyas007 over 5 years ago
- 2 comments
Labels: output
#2591 - TessBaseAPIDetectOrientationScript : Getting wrong orientation angle
Issue -
State: closed - Opened by vipulpatel2103 over 5 years ago
- 2 comments
#2504 - Training with intermediate checkpoints
Issue -
State: closed - Opened by kamrapooja over 5 years ago
- 3 comments
#2459 - RFC: Remove tessdata directory and replace it by a submodule
Pull Request -
State: open - Opened by stweil over 5 years ago
- 6 comments
Labels: RFC
#2395 - compute ctc target failed
Issue -
State: open - Opened by nijanthan0 almost 6 years ago
- 42 comments
Labels: training
#2391 - Change Tesseract output with words coming from an external dictionary
Issue -
State: closed - Opened by davideromano almost 6 years ago
- 21 comments
Labels: feature request
#2384 - Issue with numbers recognition
Issue -
State: open - Opened by nijanthan0 almost 6 years ago
- 14 comments
#2363 - Two-column document with ordered lists lose numbers
Issue -
State: open - Opened by james-s-w-clark almost 6 years ago
- 2 comments
Labels: layout analysis
#2334 - Errors building for arm-apple-darwin64 relating to AVX, SSE, and more
Issue -
State: closed - Opened by hamchapman almost 6 years ago
- 22 comments
Labels: question, build process
#2263 - Numbers in Arabic script are getting reversed
Issue -
State: closed - Opened by Shreeshrii almost 6 years ago
- 19 comments
Labels: RTL, TOC
#2257 - Optimize calculation of dot product for double vectors with AVX
Pull Request -
State: closed - Opened by stweil about 6 years ago
- 9 comments
Labels: performance, SIMD
#2156 - Uzn results differ from manual pre-cropping
Issue -
State: open - Opened by sweco-sekrsv about 6 years ago
- 2 comments
Labels: layout analysis
#2098 - Add config variable for selection of dot product function
Pull Request -
State: closed - Opened by stweil about 6 years ago
- 11 comments
Labels: feature request
#2071 - Line Indentation is ignored, no way to enable it.
Issue -
State: closed - Opened by mickaelistria about 6 years ago
- 2 comments
Labels: feature request, question
#2064 - Tesseract 4.0.0 crashed on Intel I5-8400 CPU with Debian 9.6.0 amd64 (SSE/AVX/AVX2)
Issue -
State: closed - Opened by s3vrlinux over 6 years ago
- 88 comments
Labels: SIMD, unexpected termination
#2052 - [accuracy] 4.0.0 sees white text "Video Mode" on dark grey background as "Vite [ote Cols"
Issue -
State: closed - Opened by AdamWill over 6 years ago
- 7 comments
Labels: accuracy
#2032 - Lines misalignment in HOCR File
Issue -
State: open - Opened by engahmed1190 over 6 years ago
- 2 comments
Labels: layout analysis, tables
#2029 - Tesseract fails to initialize : cannot read traineddata files from $TESSDATA_PREFIX
Issue -
State: closed - Opened by srdg over 6 years ago
- 9 comments
Labels: question
#1932 - Text-line Extraction based on Deep Learining [Feature Wanted]
Issue -
State: closed - Opened by ghost over 6 years ago
- 5 comments
Labels: feature request
#1902 - text2image Null box at index 0
Issue -
State: open - Opened by YiWenFY over 6 years ago
- 17 comments
Labels: bug, training, text2image
#1886 - Word-Level OCR
Issue -
State: closed - Opened by ghost over 6 years ago
- 15 comments
#1799 - tesseract is not installed or it is not in your path
Issue -
State: closed - Opened by pete111 over 6 years ago
- 17 comments
#1749 - Public key for tesseract-4.00~git2686-1.1.x86_64.rpm is not installed
Issue -
State: closed - Opened by gitscamI over 6 years ago
- 5 comments
#1714 - [Feature Request] Table structure extraction at the API
Issue -
State: open - Opened by troplin over 6 years ago
- 69 comments
Labels: feature request, accuracy, tables
#1702 - Warning. Invalid resolution 0 dpi. Using 70 instead.
Issue -
State: closed - Opened by JILeXanDR over 6 years ago
- 37 comments
Labels: image resolution
#1674 - [Feature Request]: add an option to text2image that randomly mixes different font styles in the same line
Issue -
State: open - Opened by Shreeshrii over 6 years ago
- 7 comments
Labels: feature request, training, text2image
#1627 - RFC: Situation with tests in Tesseract
Issue -
State: closed - Opened by zamazan4ik over 6 years ago
- 54 comments
Labels: feature request, question, unit tests, RFC
#1620 - Tesseract Binary: read_params_file: parameter not found: enable_new_segsearch
Issue -
State: closed - Opened by jmercouris over 6 years ago
- 13 comments
#1600 - Run Tesseract using more than 4 threads ?
Issue -
State: closed - Opened by Chien-Hao over 6 years ago
- 8 comments
#1465 - Tesseract inserting additional alternative characters
Issue -
State: open - Opened by jghare almost 7 years ago
- 18 comments
Labels: accuracy, diplopia
#1446 - Issue while using indian language traineddata.
Issue -
State: closed - Opened by shekarnode almost 7 years ago
- 7 comments
#1362 - recognizes more characters than present
Issue -
State: open - Opened by abieler almost 7 years ago
- 5 comments
Labels: accuracy, diplopia