Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / smalot/pdfparser issues and pull requests

#747 - Extracting graphics from a PDF

Issue - State: open - Opened by Himeos 11 days ago - 1 comment
Labels: question

#746 - Issue With Parsing Graphical PDF

Issue - State: open - Opened by omitpavel 12 days ago
Labels: bug

#745 - Some more exceptions

Pull Request - State: open - Opened by ThomasLandauer 25 days ago - 1 comment
Labels: enhancement

#744 - Introducing CONTRIBUTING.md

Pull Request - State: closed - Opened by k00ni 25 days ago
Labels: enhancement, documentation

#743 - False positive on "Secured pdf file" detection

Issue - State: closed - Opened by CedCannes about 1 month ago - 3 comments

#742 - Fund PDFParser: Thoughts and decisions

Issue - State: open - Opened by unixnut about 1 month ago - 12 comments
Labels: enhancement, help wanted

#741 - phpunit pdf tofu characters detection

Issue - State: open - Opened by 8ctopus about 2 months ago

#739 - Adding more dedicated exceptions

Pull Request - State: closed - Opened by ThomasLandauer about 2 months ago - 7 comments
Labels: enhancement

#737 - Simplified Coding Style checks: PSR12 replaces Symfony, risky not allowed anymore

Pull Request - State: closed - Opened by k00ni 2 months ago
Labels: unit tests / CI

#736 - HorizontalOffset is not supported anymore

Issue - State: open - Opened by luigif 2 months ago - 2 comments
Labels: needs more info

#735 - Allowed memory size exhausted on Font.php line 150

Issue - State: open - Opened by UnnitMetaliya 2 months ago - 13 comments

#734 - Problems with getText() on PDF documents with UTF16BE encoding

Issue - State: open - Opened by SeedDMS 3 months ago
Labels: bug, de-/encoding issue

#733 - getDataTm() provides wrong coordinates for text blocks

Issue - State: open - Opened by parpalak 3 months ago - 1 comment
Labels: bug

#732 - v2.11.0

Issue - State: closed - Opened by k00ni 3 months ago - 2 comments
Labels: new release

#730 - Metadata content garbled for some PDFs

Issue - State: open - Opened by rdmpage 4 months ago
Labels: bug

#729 - fix typo and clarify sentence

Pull Request - State: closed - Opened by bernard-ng 4 months ago
Labels: documentation

#727 - fix: check that the previous xref is not the just processed xref

Pull Request - State: closed - Opened by tkegan 4 months ago - 1 comment
Labels: fix

#726 - PDFs made with MS Edge don't parse

Issue - State: open - Opened by nadkab11 4 months ago - 2 comments
Labels: bug

#725 - getDataTm() with PDF containing accents

Issue - State: open - Opened by tiffanymartin34 5 months ago - 1 comment
Labels: bug, needs more info

#724 - - The code disappears.

Issue - State: closed - Opened by jeoungss12 5 months ago - 2 comments
Labels: bug, invalid, stale

#723 - CI: added PHP 8.4

Pull Request - State: closed - Opened by k00ni 5 months ago - 4 comments
Labels: enhancement, unit tests / CI

#722 - Merge XMP Metadata if dc:format tag not found

Pull Request - State: closed - Opened by GreyWyvern 5 months ago
Labels: fix

#721 - Title and other properties not read with getDetails for some files

Issue - State: closed - Opened by dbarron 5 months ago - 1 comment
Labels: bug

#720 - Implement missing cm command

Pull Request - State: closed - Opened by DominikDostal 6 months ago - 13 comments
Labels: enhancement

#719 - Continuous-integration.yml: let workflow run on each push event

Pull Request - State: closed - Opened by k00ni 6 months ago - 1 comment
Labels: enhancement, unit tests / CI

#718 - Duplicate images in PDF > Find which pages it occurs on

Issue - State: open - Opened by eddih19 6 months ago
Labels: question

#717 - getDataTm returns empty array for one page only

Issue - State: open - Opened by mrasmith 6 months ago - 2 comments
Labels: bug

#716 - Fix hexadecimal decoding (fixes #715)

Pull Request - State: closed - Opened by krzyc 6 months ago - 11 comments
Labels: needs work, fix

#715 - Closing round bracket encoded in hexadecimal format breaks parsing

Issue - State: open - Opened by krzyc 6 months ago - 2 comments
Labels: needs more info, fix

#714 - Facing an Error: Invalid object reference for $obj.

Issue - State: open - Opened by mumer96 6 months ago - 3 comments
Labels: needs more info, stale

#712 - Possibility to improve parsing perfomance by not using uniqid() and add an early return?

Issue - State: open - Opened by bernemann 6 months ago - 8 comments
Labels: enhancement

#711 - Fix for adjacent escaped slashes and escaped parentheses in strings

Pull Request - State: closed - Opened by GreyWyvern 7 months ago - 1 comment
Labels: fix

#710 - []TJ command parsed improperly

Issue - State: closed - Opened by DisabledMonkey 7 months ago - 10 comments
Labels: bug

#709 - preg_match(): Compilation failed: regular expression is too large at offset 38605

Issue - State: closed - Opened by huihuangjiuai 7 months ago - 1 comment
Labels: bug

#708 - pdfparser version 2.10.0 is not updated on packagist

Issue - State: closed - Opened by huihuangjiuai 7 months ago - 2 comments
Labels: new release

#707 - Wrong version tag

Issue - State: closed - Opened by nuernbergerA 7 months ago - 2 comments

#706 - v2.10.0

Issue - State: closed - Opened by k00ni 7 months ago - 1 comment
Labels: new release

#705 - How to get images and text in order as in PDF?

Issue - State: open - Opened by salmanulfaris 7 months ago - 5 comments
Labels: question

#704 - Strengthen check for UTF-8 conformity in formatContent()

Pull Request - State: closed - Opened by GreyWyvern 7 months ago
Labels: enhancement, fix, de-/encoding issue

#703 - can't parse fdpf file from 1.86 version of FPDF and works fine with FPDF 1.81

Issue - State: open - Opened by Saulight73 7 months ago - 8 comments
Labels: bug

#702 - Wrong Character - can detect this ?

Issue - State: open - Opened by davidribatto 8 months ago - 2 comments

#701 - Having error when trying to get details from pdf page generated with html emoji code inside pdf text

Issue - State: open - Opened by luffyfr 8 months ago - 1 comment
Labels: bug, de-/encoding issue

#700 - fix: Return page width and height from document

Pull Request - State: closed - Opened by vitormattos 8 months ago - 1 comment
Labels: enhancement, documentation

#699 - Class PDFDocEncoding does not extend AbstractEncoding

Issue - State: open - Opened by SaschaScholly 8 months ago - 2 comments
Labels: bug

#698 - Fix for two bugs related to Unicode translation support by Font objects

Pull Request - State: closed - Opened by unixnut 8 months ago - 16 comments
Labels: fix

#697 - Parsing with unknown text. Help me resolve

Issue - State: open - Opened by aarjiontech 8 months ago - 7 comments

#696 - MS PDF Printer Chrome 1.7: getText() results in empty text

Issue - State: closed - Opened by pud-micha 8 months ago - 2 comments

#695 - Fixed CS issue in PDFObject.php

Pull Request - State: closed - Opened by k00ni 8 months ago
Labels: fix

#694 - Image position relative to pages (Or text if possible)

Issue - State: open - Opened by Loai-Hassan 8 months ago - 1 comment
Labels: bug, needs more info, PDF required to demonstrate issue

#693 - Account for inline images in formatContent()

Pull Request - State: closed - Opened by GreyWyvern 8 months ago - 4 comments
Labels: fix

#692 - Account for inaccurate offsets in getXrefData()

Pull Request - State: closed - Opened by GreyWyvern 9 months ago
Labels: fix

#691 - Trying to access array offset on value of type null (PDFObject.php line 795)

Issue - State: closed - Opened by iGrog 9 months ago - 5 comments
Labels: bug

#690 - Attempt to fix #659 (gzuncompress(): data error)

Pull Request - State: closed - Opened by k00ni 9 months ago - 4 comments
Labels: needs work, needs more info, stale

#689 - (Question) Replace an Image inside a PDF

Issue - State: closed - Opened by juanborras 9 months ago - 4 comments
Labels: question

#688 - PNG Images with FlateDecode are corrupt

Issue - State: open - Opened by MajedSardar64 9 months ago - 6 comments
Labels: bug

#687 - Filter ElementHexa::decode() of non-hex chars

Pull Request - State: closed - Opened by GreyWyvern 9 months ago
Labels: fix, de-/encoding issue

#686 - Prevent zero from being passed to array_chunk()

Pull Request - State: closed - Opened by GreyWyvern 9 months ago
Labels: fix

#685 - Dropping support for PHP 7.1, 7.2 and 7.3?

Issue - State: open - Opened by k00ni 9 months ago - 2 comments
Labels: question, help wanted

#684 - v2.9.0

Issue - State: closed - Opened by k00ni 9 months ago - 6 comments
Labels: new release

#683 - Weird UTF-8 Characters when parsing hex string (keywords)

Issue - State: closed - Opened by code-mage-com 9 months ago - 3 comments
Labels: bug, de-/encoding issue

#682 - Fixes Scrutinizer integration (mostly failing tests)

Pull Request - State: closed - Opened by k00ni 9 months ago
Labels: unit tests / CI, fix

#681 - Error when obtaining array from a PDF

Issue - State: open - Opened by andresflorez12 9 months ago - 1 comment
Labels: bug

#680 - Add limiter with getText

Pull Request - State: closed - Opened by LiThaM 9 months ago - 1 comment

#679 - Bug in RawDataParser.php when a row has no data

Issue - State: closed - Opened by KeanuTang 9 months ago - 9 comments
Labels: bug

#678 - LZW decode (infinie loop)

Issue - State: open - Opened by ElGigi 9 months ago - 1 comment
Labels: bug

#677 - Fixed latest coding style issues and refined a few PHP doc entries to match types

Pull Request - State: closed - Opened by k00ni 9 months ago
Labels: fix

#676 - Check for binary content in formatContent() before a problematic regexp

Pull Request - State: closed - Opened by GreyWyvern 9 months ago - 2 comments
Labels: fix

#675 - getText() returns text without any spaces when using a pdf from google docs

Issue - State: open - Opened by veepdotai 10 months ago - 15 comments
Labels: bug

#674 - completely different output for table data (2.7.0 vs 2.8.0)

Issue - State: open - Opened by andus4n 10 months ago - 4 comments
Labels: bug

#673 - Undefined array key 1,crash on parsing

Issue - State: closed - Opened by micos7 10 months ago - 5 comments
Labels: bug

#672 - Text Layout are unpredictable

Issue - State: open - Opened by DaLiV 10 months ago - 4 comments
Labels: invalid, needs more info

#671 - getDataTm() positions wrong?

Issue - State: open - Opened by dartheditous 10 months ago - 4 comments
Labels: bug

#670 - Fixed a few coding style issues

Pull Request - State: closed - Opened by k00ni 10 months ago
Labels: unit tests / CI, fix

#669 - Baseencoding fallback

Pull Request - State: closed - Opened by GreyWyvern 10 months ago - 3 comments
Labels: fix

#668 - preg_match(): compilation failed: regular expression is too large to offset 143690

Issue - State: closed - Opened by lonelyrider44 10 months ago - 20 comments
Labels: bug, parsing fail

#667 - Fix for two bugs related to Unicode translation support by Font objects

Pull Request - State: closed - Opened by unixnut 10 months ago - 7 comments
Labels: enhancement, needs work, needs more info, fix, stale, tests required

#666 - Fix returning empty text in some cases

Pull Request - State: closed - Opened by xAzoom 11 months ago - 6 comments
Labels: fix, parsing fail

#664 - Chinese text problems

Issue - State: open - Opened by micos7 11 months ago - 5 comments
Labels: bug, parsing fail, de-/encoding issue

#663 - incorrect parsing tt and ti

Issue - State: open - Opened by mitchgthb 11 months ago - 3 comments
Labels: bug

#662 - Want to remove page numbers eg: (page 1 of 2), when using getText( ), can I achieve that ?

Issue - State: closed - Opened by SwanHtet018 11 months ago - 1 comment
Labels: question

#661 - No output while parsing such files with headers (I suppose)

Issue - State: closed - Opened by tejpsingh9 11 months ago - 4 comments
Labels: bug, parsing fail

#660 - Get text from PDF using PdfParser with table format or section wise get data.

Issue - State: closed - Opened by sagarsapariya93 11 months ago - 2 comments
Labels: parsing fail, stale, PDF required to demonstrate issue

#659 - gzuncompress(): data error

Issue - State: closed - Opened by NickHahac 11 months ago - 9 comments
Labels: needs more info, stale, PDF required to demonstrate issue

#658 - Incorrect parsing, get empty text

Issue - State: open - Opened by ishowshao 11 months ago - 2 comments
Labels: bug, parsing fail

#657 - Font Fallback Issue

Issue - State: open - Opened by paytah232 11 months ago - 12 comments
Labels: bug

#656 - Getting error Class 'Smalot\PdfParser\Parser' not found.

Issue - State: closed - Opened by sagarsapariya93 12 months ago - 1 comment
Labels: invalid

#655 - Fatal Error when parsing some PDFs

Issue - State: open - Opened by soupmagnet 12 months ago - 4 comments
Labels: bug, needs more info

#654 - Strange chararacters while parsing PDF

Issue - State: open - Opened by LudovicMaillet about 1 year ago - 3 comments
Labels: bug, de-/encoding issue

#653 - Ignore encryption

Pull Request - State: closed - Opened by unixnut about 1 year ago - 9 comments
Labels: enhancement

#652 - gettext empty result

Issue - State: open - Opened by bigmoney99 about 1 year ago - 2 comments
Labels: bug

#651 - does your package supports Arabic and Persian language?

Issue - State: open - Opened by mdoulabi1 about 1 year ago - 4 comments
Labels: bug

#650 - Call for Testers! - v2.8.0

Issue - State: closed - Opened by k00ni about 1 year ago - 9 comments
Labels: new release

#649 - Invalid object reference for $obj.

Issue - State: closed - Opened by mschrading about 1 year ago - 1 comment

#648 - Is there any way to get page number with its content?

Issue - State: closed - Opened by mdoulabi1 about 1 year ago - 3 comments
Labels: question