Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / internetarchive/archive-pdf-tools issues and pull requests
#70 - error during installation
Issue -
State: closed - Opened by zbgns 3 months ago
- 10 comments
#69 - Recode does not merge hocr into pdf
Issue -
State: open - Opened by jcuenod over 1 year ago
- 6 comments
#68 - Fix pdfrenderer.py reference
Pull Request -
State: closed - Opened by tfmorris over 1 year ago
- 5 comments
#67 - A user-friendly example for a scanned multipage PDF needed
Issue -
State: open - Opened by FilipDominec over 1 year ago
- 3 comments
#66 - A certain PDF from Archive.org does not display all of its contents on Mac OS
Issue -
State: closed - Opened by EngineersNeedArt over 1 year ago
- 26 comments
#65 - Q: accessible tagging/hints?
Issue -
State: closed - Opened by jrochkind almost 2 years ago
- 4 comments
#64 - Installing on MacOS?
Issue -
State: closed - Opened by jrochkind almost 2 years ago
- 29 comments
#63 - HOCR rendering compares unfavorably with tesseract PDF text layer
Issue -
State: open - Opened by jrochkind almost 2 years ago
- 11 comments
#62 - Additional apt packages needed to build current jbig2enc on Ubuntu 22.04
Pull Request -
State: closed - Opened by jrochkind almost 2 years ago
- 1 comment
#61 - IndexError: list index out of range (single TIFF file)
Issue -
State: closed - Opened by jrochkind almost 2 years ago
- 5 comments
#60 - First recode_pdf test: 'numpy' has no attribute 'int'.
Issue -
State: closed - Opened by dwids almost 2 years ago
- 5 comments
#59 - Wrong resolution of mask image when foreground image is downsampled
Issue -
State: open - Opened by JoeLoginIsAlreadyTaken about 2 years ago
- 1 comment
#58 - Update requirements.txt
Pull Request -
State: closed - Opened by Redsandro over 2 years ago
- 1 comment
#57 - Fix an error and a warning reported by LGTM
Pull Request -
State: open - Opened by stweil over 2 years ago
- 1 comment
#56 - Fix it's => its in documentation
Pull Request -
State: closed - Opened by stweil over 2 years ago
- 1 comment
#55 - pdfcomp: problems with inverted text that is often better in hocr.
Issue -
State: open - Opened by rmast over 2 years ago
- 10 comments
#54 - The choice for inverting, what's the use for perc_larger?
Issue -
State: open - Opened by rmast over 2 years ago
#53 - correct ratio determination for noise estimation
Pull Request -
State: open - Opened by rmast over 2 years ago
- 5 comments
#52 - Bug in foreground/background separator choosing massive block instead of character outline.
Issue -
State: open - Opened by rmast over 2 years ago
- 14 comments
#51 - pdfcomp: new tool, discussion, compression questions
Issue -
State: open - Opened by MerlijnWajer over 2 years ago
- 19 comments
#50 - Missing test suite?
Issue -
State: open - Opened by mara004 over 2 years ago
- 1 comment
#49 - Upgrade GitHub Actions
Pull Request -
State: closed - Opened by cclauss over 2 years ago
- 3 comments
#48 - Create better presets for users with quality-comparable options for openjpeg/grok/pillow and kakadu
Issue -
State: open - Opened by MerlijnWajer over 2 years ago
- 1 comment
#47 - Define scope of tooling and work to improve for that scope
Issue -
State: open - Opened by MerlijnWajer over 2 years ago
#46 - Detect if RGB images in pages are greyscale or even 1bit
Issue -
State: open - Opened by MerlijnWajer over 2 years ago
#45 - Some scans become inverted
Issue -
State: closed - Opened by Redsandro over 2 years ago
- 7 comments
#44 - Update README add installation instructions
Pull Request -
State: closed - Opened by Redsandro over 2 years ago
- 9 comments
#43 - Need some inspiration?
Issue -
State: open - Opened by rmast almost 3 years ago
- 7 comments
#42 - pillow is not working properly
Issue -
State: open - Opened by Redsandro almost 3 years ago
- 27 comments
#41 - openjpeg is not working properly
Issue -
State: closed - Opened by Redsandro almost 3 years ago
- 43 comments
#40 - Update README fix typo
Pull Request -
State: closed - Opened by Redsandro almost 3 years ago
- 2 comments
#39 - Fix setup by reading the version file manually
Pull Request -
State: closed - Opened by mara004 about 3 years ago
- 1 comment
#38 - Improve setup configuration (see #36)
Pull Request -
State: closed - Opened by mara004 about 3 years ago
- 6 comments
#37 - Just some other errors with the current version. I can't get the current version to work with a hocr-file coming from pdftree to get out the current searchable text from a PDF
Issue -
State: closed - Opened by rmast about 3 years ago
- 18 comments
#36 - master file contents.rst not found during build of docs
Issue -
State: closed - Opened by rmast about 3 years ago
- 8 comments
#35 - --jbig2 deprecated
Issue -
State: closed - Opened by rmast about 3 years ago
- 1 comment
#34 - License (in)compatibility
Issue -
State: open - Opened by rmast about 3 years ago
- 4 comments
#33 - Usefulness of MRC for decent quality compression of scanned book pages with illustrations
Issue -
State: open - Opened by fusefib about 3 years ago
- 42 comments
#32 - I don't understand this picture
Issue -
State: open - Opened by rmast about 3 years ago
- 11 comments
#31 - Small difference in compressionratio
Issue -
State: open - Opened by rmast about 3 years ago
- 9 comments
#30 - Error with hocr-files from Tesseract
Issue -
State: closed - Opened by rmast about 3 years ago
- 25 comments
#29 - Support pillow jpeg2000 writing
Issue -
State: closed - Opened by MerlijnWajer about 3 years ago
- 3 comments
#28 - Support recompressing existing PDFs without hOCR files and without touching the text input
Issue -
State: open - Opened by MerlijnWajer about 3 years ago
#27 - Use (not yet released) pdf->hocr conversation to improve compression for existing PDFs
Issue -
State: open - Opened by MerlijnWajer about 3 years ago
- 2 comments
#26 - Lot of fuzz in background picture
Issue -
State: open - Opened by rmast about 3 years ago
- 36 comments
#25 - Add --best flag?
Issue -
State: open - Opened by MerlijnWajer about 3 years ago
- 2 comments
#24 - Run noise estimation on a part of the image
Issue -
State: closed - Opened by MerlijnWajer about 3 years ago
- 1 comment
#23 - Support hOCR ocr_photo / ocr_image element
Issue -
State: open - Opened by MerlijnWajer about 3 years ago
#22 - Windows port
Issue -
State: closed - Opened by MerlijnWajer over 3 years ago
- 19 comments
#21 - Add option to disable jbig2
Issue -
State: closed - Opened by MerlijnWajer over 3 years ago
- 1 comment
#20 - Add option (and heuristic) to treat the background as 'just plain (white) paper' for further optimisations
Issue -
State: open - Opened by MerlijnWajer over 3 years ago
#19 - Add support for 1-bit (black & white) mode, where the end result is just the mask
Issue -
State: closed - Opened by MerlijnWajer over 3 years ago
- 1 comment
#18 - Undefined name: 'obj_' --> 'self._obj'
Pull Request -
State: closed - Opened by cclauss over 3 years ago
- 1 comment
#17 - PDF/UA improvements
Issue -
State: open - Opened by MerlijnWajer over 3 years ago
- 3 comments
#16 - Add tests...
Issue -
State: open - Opened by MerlijnWajer over 3 years ago
- 1 comment
#15 - look at kakadu/grok/openjpeg compression parameters
Issue -
State: closed - Opened by MerlijnWajer over 3 years ago
- 1 comment
#14 - Use "linear" option from new pymupdf (if it doesn't break metadata writing)
Issue -
State: open - Opened by MerlijnWajer over 3 years ago
#13 - Use JBIG2 compression to determine if we want to blur or denoise before thresholding
Issue -
State: closed - Opened by MerlijnWajer over 3 years ago
- 2 comments
#12 - Support actual recompression of an existing PDF without any input hOCR or input images
Issue -
State: open - Opened by MerlijnWajer over 3 years ago
#11 - Support PDF generation/compression without hOCR files
Issue -
State: open - Opened by MerlijnWajer over 3 years ago
#10 - Consider turning on mask denoising by default
Issue -
State: closed - Opened by MerlijnWajer over 3 years ago
- 1 comment
#9 - Upon release of the new mupdf and pymupdf, flip on JBIG2 by default
Issue -
State: closed - Opened by MerlijnWajer over 3 years ago
#8 - Improve mask and background generation
Issue -
State: open - Opened by MerlijnWajer over 3 years ago
- 9 comments
#7 - Look into increasing the quality of the foreground image by compressing less
Issue -
State: closed - Opened by MerlijnWajer over 3 years ago
- 1 comment
#6 - Maybe support a glob for hocr files too, rather than requiring them to be combined into a single file
Issue -
State: open - Opened by MerlijnWajer over 3 years ago
#5 - Add/implement regression tests for MRC
Issue -
State: open - Opened by MerlijnWajer over 3 years ago
- 2 comments
#4 - Add another font beyond the glyphless font to actually render fonts of the languages that are in use
Issue -
State: open - Opened by MerlijnWajer over 3 years ago
#3 - Look into support JPG instead of JPEG2000 for foreground/background generation
Issue -
State: closed - Opened by MerlijnWajer over 3 years ago
- 1 comment
#2 - Support PDF generation without MRC
Issue -
State: open - Opened by MerlijnWajer over 3 years ago
#1 - Support Grok for JPEG2000 encode/decode and support OpenJPEG2000 in a better fashion
Issue -
State: closed - Opened by MerlijnWajer over 3 years ago
- 1 comment