Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / eellak/gsoc2018-gg-extraction issues and pull requests
#7 - Not enough manual extractor debugging done yet
Issue -
State: open - Opened by ckarageorgkaneen over 6 years ago
- 1 comment
Labels: good first issue
#6 - Parsed txt often scrambled
Issue -
State: open - Opened by ckarageorgkaneen over 6 years ago
- 1 comment
Labels: help wanted
#5 - Add pdfminer.six and openpyxl to the dependencies
Issue -
State: closed - Opened by varlamis over 6 years ago
#4 - No UTF-support when run with python 2.7
Issue -
State: closed - Opened by varlamis over 6 years ago
#3 - Non-linear pdf elements extracted one character at a time
Issue -
State: closed - Opened by ckarageorgkaneen over 6 years ago
#2 - Keys vary greatly, lots of edge-cases & human-errors in the GG issues (spelling etc.)
Issue -
State: open - Opened by ckarageorgkaneen over 6 years ago
Labels: enhancement
#1 - '(cid:#)' occurences in simple_pdf_to_txt( ) raw text (produced by the 'pdf2txt.py' program)
Issue -
State: closed - Opened by ckarageorgkaneen over 6 years ago
- 6 comments