Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / jrmuizel/pdf-extract issues and pull requests

#117 - Replace println with log

Pull Request - State: open - Opened by FelixBrakel 16 days ago

#116 - Use the `tracing` library to log instead of `println!`

Pull Request - State: open - Opened by phillipleblanc 17 days ago - 1 comment

#115 - Python bindings

Issue - State: open - Opened by Goldziher 19 days ago - 4 comments

#114 - [FEATURE] Use `log` crate for logging instead of `println`

Issue - State: open - Opened by FelixBrakel 20 days ago - 1 comment

#113 - assertion failure: operation.operands.len() == 6

Issue - State: open - Opened by 0xricksanchez about 1 month ago - 4 comments

#112 - Add support for dehyphenation

Issue - State: open - Opened by jrmuizel about 1 month ago

#111 - Update README link for related project pdfextract by CrossRef

Pull Request - State: closed - Opened by matthewjnield about 1 month ago

#110 - Extract encodings from embeded type1c fonts

Issue - State: closed - Opened by jrmuizel about 1 month ago - 2 comments

#109 - refactor: more robust character encoding

Pull Request - State: closed - Opened by 0xricksanchez about 1 month ago - 2 comments

#108 - "unhandled function type 4" panic

Issue - State: closed - Opened by 0xricksanchez about 1 month ago

#107 - Missing char in unicode map panic

Issue - State: closed - Opened by 0xricksanchez about 1 month ago

#106 - Bad length of hexstring panic

Issue - State: closed - Opened by 0xricksanchez about 1 month ago - 1 comment

#105 - refactor: apply common rust patterns

Pull Request - State: closed - Opened by 0xricksanchez about 1 month ago - 3 comments

#104 - Unicode Mismatch Panic

Issue - State: open - Opened by 0xNF about 1 month ago - 3 comments

#103 - Handle non-identity predefined CMaps

Issue - State: open - Opened by jrmuizel about 2 months ago

#101 - http://arxiv.org/pdf/2312.00577v1 has conversion problems

Issue - State: open - Opened by jrmuizel about 2 months ago

#100 - Call for support

Issue - State: open - Opened by 0xricksanchez about 2 months ago - 4 comments

#99 - two column pdf wrong order

Issue - State: open - Opened by jingangdidi about 2 months ago - 1 comment

#97 - change panic to Result<> parameter

Pull Request - State: open - Opened by jankstar 2 months ago - 5 comments

#96 - Pack of files which cause crashes

Issue - State: open - Opened by qarmin 5 months ago - 2 comments

#95 - release a new version of pdf-extract

Issue - State: closed - Opened by prabirshrestha 6 months ago - 1 comment

#94 - Panics with message "no widths"

Issue - State: closed - Opened by xrl1 6 months ago - 5 comments

#93 - extract pdf by pages (based on https://github.com/jrmuizel/pdf-extract/pull/73)

Pull Request - State: closed - Opened by linusbierhoff 7 months ago - 4 comments

#92 - Using Encoding-RS instead

Pull Request - State: closed - Opened by pvichivanives 7 months ago - 1 comment

#90 - RUSTSEC-2021-0153 Switch to Encoding RS?

Issue - State: closed - Opened by pvichivanives 9 months ago - 1 comment

#89 - panic: index out of bounds: the len is 1 but the index is 1

Issue - State: open - Opened by Sinderella 10 months ago - 2 comments

#88 - Unicode map unsafe get leads to panic

Issue - State: closed - Opened by DimitriTimoz 10 months ago - 2 comments

#87 - Fix crashing debug output in PdfSimpleFont

Pull Request - State: open - Opened by Bennett-Petzold 10 months ago - 1 comment

#86 - add extract txt with page example

Pull Request - State: open - Opened by BenLocal 10 months ago - 1 comment

#85 - Fonts with custom encoding

Issue - State: open - Opened by maxpowel 11 months ago - 5 comments

#84 - Text result split by spacing

Issue - State: closed - Opened by frankvgompel 11 months ago - 2 comments

#83 - Fix panic by setting default_width to Some(1.0)

Pull Request - State: closed - Opened by prscoelho about 1 year ago - 6 comments

#82 - Add decryption functions and attempt decrypt if pdf is encrypted

Pull Request - State: closed - Opened by prscoelho about 1 year ago

#81 - Upgraded lopdf version

Pull Request - State: closed - Opened by maxpowel about 1 year ago

#80 - Added support for missing colour spaces

Pull Request - State: closed - Opened by josemirm about 1 year ago

#79 - Text result is split by spacing

Issue - State: closed - Opened by Implocell about 1 year ago - 8 comments

#78 - FR: Make the HTML output buffer string available

Issue - State: closed - Opened by annie444 about 1 year ago - 2 comments

#77 - fix: Use `get` instead of `[]` to avoid panic when key is missing

Pull Request - State: closed - Opened by dilawar about 1 year ago - 1 comment

#76 - panic while parsing PDF

Issue - State: closed - Opened by dilawar about 1 year ago - 2 comments

#75 - Multiple panics on Arxiv.org PDFs

Issue - State: closed - Opened by jlandahl about 1 year ago - 3 comments

#74 - unexpected smask type 168 0 R

Issue - State: closed - Opened by nbittich about 1 year ago - 3 comments

#73 - Added extract_text_by_page()

Pull Request - State: open - Opened by JustBobinAround about 1 year ago

#72 - thread 'main' panicked at 'missing char 33 in map

Issue - State: closed - Opened by danindiana over 1 year ago - 2 comments

#71 - Spec violation TrueType without Encoding entry

Issue - State: open - Opened by sftse over 1 year ago

#70 - /ToUnicode spec violation

Issue - State: open - Opened by sftse over 1 year ago - 2 comments

#69 - add example document where characters of extracted text are poorly sp…

Pull Request - State: closed - Opened by sftse over 1 year ago - 2 comments

#68 - new line

Issue - State: open - Opened by CaptainKludge over 1 year ago

#67 - panic : missing colorspace [67, 83, 112]

Issue - State: closed - Opened by nbittich over 1 year ago - 2 comments

#66 - panic on unwrap on a None value

Issue - State: closed - Opened by blankenshipz over 1 year ago - 5 comments

#63 - Added output_page fn

Pull Request - State: open - Opened by JuniFruit over 1 year ago

#62 - Empty text output

Issue - State: closed - Opened by Palmik over 1 year ago - 6 comments

#61 - Sanity Check - Unicode Mismatch

Issue - State: closed - Opened by piotroxp over 1 year ago - 9 comments

#60 - Unsafe get and Missing char

Issue - State: closed - Opened by 0xMimir over 1 year ago - 4 comments

#59 - added decryption logic for encrypted document

Pull Request - State: closed - Opened by russellwmy over 1 year ago - 1 comment

#58 - pdftotext -layout equivalent

Issue - State: open - Opened by Sinderella almost 2 years ago - 1 comment

#57 - thread 'main' panicked at 'assertion failed: name == \"Identity-H\"

Issue - State: open - Opened by wingjson almost 2 years ago - 9 comments

#56 - missing char 48 in map

Issue - State: closed - Opened by gravit22 almost 2 years ago - 6 comments

#55 - extract_text_from_mem not found in `pdf_extract`

Issue - State: closed - Opened by felixbecker almost 2 years ago - 3 comments

#54 - Tests from pdf.link files

Pull Request - State: closed - Opened by joepio almost 2 years ago - 1 comment

#52 - remove some println! to be more CLI / TUI friendly

Issue - State: open - Opened by qkzk almost 2 years ago - 2 comments

#51 - Missing LICENSE File

Issue - State: open - Opened by Endle about 2 years ago - 1 comment

#50 - Empty output file running extract example on a test pdf file

Issue - State: closed - Opened by bogct0mculhl about 2 years ago - 3 comments

#49 - Multiple improvements across multiple forks

Pull Request - State: open - Opened by Hessesian about 2 years ago - 3 comments

#48 - Less panics, add error handling, add tests, re-export lopdf, linting, readme

Pull Request - State: open - Opened by joepio about 2 years ago - 9 comments

#47 - Error handling - replace `.unwrap` and `panic` with `?`

Issue - State: open - Opened by joepio about 2 years ago - 7 comments

#46 - panicked at 'attempt to add with overflow'

Issue - State: closed - Opened by AndyJado about 2 years ago - 3 comments

#45 - Change all println statements to dlog

Pull Request - State: open - Opened by jacob-horton over 2 years ago

#44 - re-export lopdf to prevent having to match the same lopdf version …

Pull Request - State: closed - Opened by monkeydioude over 2 years ago - 5 comments

#43 - `pdf_extract::extract_text` returns an empty string for a non-empty PDF

Issue - State: closed - Opened by baarkerlounger over 2 years ago - 9 comments

#42 - extract text from memory

Pull Request - State: closed - Opened by scambier over 2 years ago

#41 - Consider supporting ActualText

Issue - State: open - Opened by badicsalex over 2 years ago

#40 - DeviceN colorspace is not supported

Issue - State: closed - Opened by badicsalex over 2 years ago - 1 comment

#39 - Panic on the SC command when using Pattern colorspace

Issue - State: closed - Opened by badicsalex over 2 years ago

#38 - Panic on specific cases of "Separation"-type ColorSpace

Issue - State: closed - Opened by badicsalex over 2 years ago

#37 - Failure to extract text from AMD GPU ISA docs

Issue - State: closed - Opened by inequation almost 3 years ago - 6 comments

#36 - Panic at FirstChar

Issue - State: open - Opened by Grant-Brinkman almost 3 years ago - 5 comments

#35 - Word spacing is not applied correctly

Issue - State: closed - Opened by badicsalex almost 3 years ago - 1 comment

#34 - Performance: glyphnames::name_to_unicode is very slow

Issue - State: open - Opened by badicsalex almost 3 years ago - 4 comments

#33 - Performance: use nom_parser in lopdf instead of pom_parser

Issue - State: closed - Opened by badicsalex almost 3 years ago - 1 comment

#32 - RUSTSEC-2021-0017

Issue - State: open - Opened by simon-an about 3 years ago - 1 comment

#31 - RUSTSEC-2020-0144

Issue - State: open - Opened by simon-an about 3 years ago - 1 comment

#30 - Fix cargo audit warnings

Pull Request - State: closed - Opened by ghost over 3 years ago

#29 - fixed crashing debug output when font has no name

Pull Request - State: closed - Opened by Grollicus over 3 years ago

#28 - Upgrade lopdf to version 0.26 to resolve panic

Issue - State: open - Opened by rondonjon over 3 years ago - 2 comments

#27 - unexpected smask type 554 0 R

Issue - State: open - Opened by grindfuzz over 3 years ago - 1 comment

#26 - Bump lopdf

Pull Request - State: closed - Opened by teovoinea about 4 years ago - 1 comment

#25 - Changed the panic for a empty value

Pull Request - State: open - Opened by victorinno over 4 years ago - 1 comment

#24 - unexpected smask type <</Type /Mask/S /Luminosity/G 13 0 R>>

Issue - State: open - Opened by victorinno over 4 years ago - 1 comment

#23 - Extract text from string

Issue - State: open - Opened by pickfire over 4 years ago - 4 comments

#22 - fix missing unicode map entries

Pull Request - State: open - Opened by llogiq over 4 years ago - 3 comments

#21 - Added an optional word delimiter.

Pull Request - State: open - Opened by anderejd over 4 years ago - 2 comments

#20 - Update lopdf to v0.24.0

Pull Request - State: closed - Opened by runebaas almost 5 years ago

#19 - Handle documents missing colorspace

Issue - State: open - Opened by eutampieri about 5 years ago - 3 comments

#18 - Apply fix-ups and fix warnings

Pull Request - State: closed - Opened by oleid about 5 years ago