Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / neuml/paperetl issues and pull requests

#60 - Add keyword filter to PMB and modify filtering logic

Issue - State: closed - Opened by davidmezzetti about 1 month ago

#59 - Improve file processing performance

Issue - State: closed - Opened by davidmezzetti about 1 month ago

#58 - Require Python >= 3.9

Issue - State: closed - Opened by davidmezzetti about 1 month ago

#57 - Close processes at end of `Execute.run` method

Issue - State: closed - Opened by davidmezzetti about 2 months ago
Labels: bug

#56 - Can't insert all my data into sqlite database.

Issue - State: closed - Opened by kellytsorb 7 months ago
Labels: bug

#55 - .xml references not processed

Issue - State: open - Opened by amscosta2022 11 months ago - 2 comments

#54 - example data processing warning using google colab

Issue - State: closed - Opened by amscosta 12 months ago - 5 comments

#53 - sample lines for running etl server and grobid instance

Issue - State: closed - Opened by amscosta 12 months ago - 2 comments

#52 - Added note on grobid concurrency configuration to README.

Pull Request - State: closed - Opened by elshimone about 1 year ago - 2 comments

#51 - Use figure index rather than xml:id attribute this is not always present

Pull Request - State: closed - Opened by elshimone about 1 year ago - 2 comments

#50 - Scaling to create a proccess per cpu core overwhelms grobid service

Issue - State: closed - Opened by elshimone about 1 year ago - 1 comment

#49 - Zotero connection

Issue - State: open - Opened by andreifoldes over 1 year ago - 2 comments

#48 - Update setup.py to only show standard image on PyPI

Issue - State: closed - Opened by davidmezzetti over 1 year ago
Labels: bug

#47 - Update minimum Python version to 3.8

Issue - State: closed - Opened by davidmezzetti over 1 year ago

#46 - AttributeError: 'NoneType' object has no attribute 'upper'

Issue - State: closed - Opened by wmurphy126 almost 2 years ago - 3 comments

#45 - sqlite3.OperationalError: database is locked

Issue - State: closed - Opened by Chriszhangmw almost 2 years ago - 6 comments

#44 - Update CORD-19 scripts

Issue - State: closed - Opened by davidmezzetti about 2 years ago

#43 - Add example notebook

Issue - State: closed - Opened by davidmezzetti about 2 years ago

#42 - Improve PMB filtering logic

Issue - State: closed - Opened by davidmezzetti about 2 years ago

#41 - Issue processing into Elasticsearch

Issue - State: closed - Opened by jak1502 about 2 years ago - 5 comments
Labels: bug

#40 - Require Python 3.7+

Issue - State: closed - Opened by davidmezzetti about 3 years ago

#39 - Support reading compressed files

Issue - State: closed - Opened by davidmezzetti about 3 years ago

#38 - Add multiprocessing support to files process

Issue - State: closed - Opened by davidmezzetti about 3 years ago

#36 - Remove legacy merge logic

Issue - State: closed - Opened by davidmezzetti about 3 years ago

#35 - Add pre-commit checks

Issue - State: closed - Opened by davidmezzetti about 3 years ago

#33 - Detect month changes in CORD-19 entry date process

Issue - State: closed - Opened by davidmezzetti over 3 years ago

#32 - Update CORD-19 entry dates source

Issue - State: closed - Opened by davidmezzetti almost 4 years ago

#31 - Add common method for accessing Grammar object

Issue - State: closed - Opened by davidmezzetti almost 4 years ago

#30 - Add generic CSV source

Issue - State: closed - Opened by davidmezzetti almost 4 years ago

#29 - Improve sample size extraction

Issue - State: closed - Opened by davidmezzetti about 4 years ago

#28 - Support spaCy 3.0

Issue - State: closed - Opened by davidmezzetti about 4 years ago - 2 comments

#27 - Ensure length of sections is less than max nlp length

Issue - State: closed - Opened by davidmezzetti about 4 years ago

#26 - Fix bug with study model training

Issue - State: closed - Opened by davidmezzetti about 4 years ago

#25 - Fix bug with JSON export

Issue - State: closed - Opened by davidmezzetti about 4 years ago

#24 - Modify merge method to handle no update merges

Issue - State: closed - Opened by davidmezzetti about 4 years ago

#23 - Increase test coverage

Issue - State: closed - Opened by davidmezzetti about 4 years ago

#22 - Handle PDF parsing exceptions

Pull Request - State: closed - Opened by nialov about 4 years ago - 8 comments

#21 - Evaluate integrating with paperoni

Issue - State: closed - Opened by davidmezzetti about 4 years ago - 1 comment

#20 - Review and update README.md

Issue - State: closed - Opened by davidmezzetti about 4 years ago - 1 comment

#19 - Add pre-trained study design models to GitHub

Issue - State: closed - Opened by davidmezzetti about 4 years ago - 2 comments

#18 - Add component to build entry-dates.csv

Issue - State: closed - Opened by davidmezzetti about 4 years ago

#17 - Add arXiv as source

Issue - State: closed - Opened by davidmezzetti about 4 years ago

#16 - Add PubMed as source

Issue - State: closed - Opened by davidmezzetti about 4 years ago

#15 - Build test suite

Issue - State: closed - Opened by davidmezzetti over 4 years ago

#14 - Filter duplicate ids

Issue - State: closed - Opened by davidmezzetti over 4 years ago

#13 - Use XML id for file figure processing

Issue - State: closed - Opened by davidmezzetti over 4 years ago

#12 - Add file name as source for file process

Issue - State: closed - Opened by davidmezzetti over 4 years ago

#11 - Remove citations table/index

Issue - State: closed - Opened by davidmezzetti over 4 years ago

#10 - Feature: Incremental database update

Issue - State: closed - Opened by davidmezzetti over 4 years ago

#9 - Add dockerfile for building paperetl environment

Issue - State: closed - Opened by davidmezzetti over 4 years ago

#8 - Better error handling for parsing publication date

Issue - State: closed - Opened by davidmezzetti over 4 years ago

#7 - Recursively process files in input directory

Issue - State: closed - Opened by davidmezzetti over 4 years ago

#6 - Additional installation steps and bug for CORD-19

Issue - State: closed - Opened by DavidRivasPhD over 4 years ago - 2 comments

#5 - Error either with or without pre-trained attribute file

Issue - State: closed - Opened by DavidRivasPhD over 4 years ago - 4 comments

#4 - what I missing? KeyError: '47235b96c07e8066195b6521882340408b9bdd34'

Issue - State: closed - Opened by SeekPoint over 4 years ago - 1 comment

#3 - KeyError: 'pdf_json_files'

Issue - State: closed - Opened by SeekPoint over 4 years ago - 4 comments

#2 - Windows install issue

Issue - State: closed - Opened by davidmezzetti over 4 years ago

#1 - PDF extraction improvements

Issue - State: closed - Opened by davidmezzetti over 4 years ago