Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / kermitt2/grobid issues and pull requests
#1247 - Bugfix/adjust notes avoid stackoverflow
Pull Request -
State: closed - Opened by lfoppiano 5 days ago
#1246 - Avoid empty figures/tables reference markers
Pull Request -
State: closed - Opened by lfoppiano 6 days ago
- 1 comment
Labels: bug
#1245 - Fix flavor parameter
Pull Request -
State: closed - Opened by lfoppiano 13 days ago
- 1 comment
#1244 - Wrongly placed figure reference
Issue -
State: open - Opened by lfoppiano 13 days ago
Labels: bug, implemented
#1243 - Server Memory Limit
Issue -
State: closed - Opened by ccchan-cpii 17 days ago
- 4 comments
#1242 - Avoid running aground when detecting the language
Pull Request -
State: closed - Opened by lfoppiano 17 days ago
- 1 comment
#1241 - Language cannot be null - when segmenting sentences
Issue -
State: closed - Opened by lfoppiano 22 days ago
Labels: bug
#1240 - Training the fulltext model
Issue -
State: open - Opened by martasoricetti 22 days ago
- 12 comments
#1239 - Use lingua for language recognition
Pull Request -
State: open - Opened by lfoppiano 24 days ago
#1238 - Fix the way table notes are streamed on the XML
Pull Request -
State: closed - Opened by lfoppiano 25 days ago
- 1 comment
#1237 - Training fulltext - annotation doubt
Issue -
State: open - Opened by martasoricetti 30 days ago
- 1 comment
Labels: question, training guidelines, models:fulltext
#1236 - For some PDFs fulltext document does not contain header document
Issue -
State: open - Opened by loopdeloop76 about 1 month ago
- 4 comments
Labels: question
#1235 - Avoid empty sentences
Pull Request -
State: closed - Opened by lfoppiano about 1 month ago
- 1 comment
#1234 - Error 408 During PDF Batch Processing: Empty TXT Files Created Without Detailed Logs
Issue -
State: open - Opened by Samuel-Scalbert about 1 month ago
- 3 comments
#1233 - <ref type="figure" target="#fig_5"> missed
Issue -
State: open - Opened by Samuel-Scalbert about 1 month ago
- 3 comments
Labels: bug
#1232 - <p> duplicates
Issue -
State: open - Opened by Samuel-Scalbert about 1 month ago
- 7 comments
Labels: bug, implemented
#1231 - "Author contributions" section content is skipped by grobid
Issue -
State: open - Opened by i-amkashif about 1 month ago
- 1 comment
Labels: bug
#1230 - Missing Wapiti for linux arm64
Issue -
State: open - Opened by AaronNGray about 1 month ago
- 5 comments
#1229 - grobid run giving '/tini: 1: Syntax error: "(" unexpected'
Issue -
State: open - Opened by AaronNGray about 1 month ago
- 8 comments
#1228 - Updated Grobid lucene analyzers for CJK languages
Pull Request -
State: closed - Opened by kermitt2 about 1 month ago
- 3 comments
#1227 - Fix for CVE in CommonsIO
Pull Request -
State: closed - Opened by kermitt2 about 1 month ago
- 2 comments
#1226 - Replace cybozu lang detection with Lingua
Issue -
State: open - Opened by lfoppiano about 1 month ago
Labels: enhancement
#1225 - Fix code scanning alert no. 39: Arbitrary file access during archive extraction ("Zip Slip")
Pull Request -
State: closed - Opened by lfoppiano about 1 month ago
- 1 comment
#1224 - make the start/end page for header processing customizable #282
Pull Request -
State: closed - Opened by lfoppiano about 1 month ago
Labels: enhancement
#1223 - Fix code scanning alert no. 61: Arbitrary file access during archive extraction ("Zip Slip")
Pull Request -
State: closed - Opened by lfoppiano about 1 month ago
- 2 comments
#1222 - Build Failed on Ubuntu GLIBC 2.35 (runs well on 2.31)
Issue -
State: open - Opened by laoliu5280 about 1 month ago
- 2 comments
Labels: Linux-specific
#1221 - DAS extraction issues in Plos Articles
Issue -
State: open - Opened by lfoppiano about 1 month ago
- 1 comment
Labels: error cases, models:segmentation, models:header
#1220 - How should I annotate markers to list items? - training the fulltext model
Issue -
State: open - Opened by martasoricetti about 2 months ago
- 1 comment
Labels: question, training guidelines
#1219 - When ARM64 DLL will be available (present in lib and pdfalto)
Issue -
State: open - Opened by manjulahonnappa-agi about 2 months ago
- 4 comments
#1218 - Regarding GROBID support ARM64
Issue -
State: open - Opened by manjulahonnappa-agi about 2 months ago
- 1 comment
Labels: need help, macOS-specific
#1217 - Extremely large PDF (245MB) returns [BAD_INPUT_DATA] PDF to XML conversion failed with error code: 134
Issue -
State: closed - Opened by OESNES about 2 months ago
- 2 comments
#1216 - Update pdfalto recognition of non-standard fonts
Pull Request -
State: open - Opened by lfoppiano about 2 months ago
- 1 comment
#1215 - Process figures,tables and equations from back/annex section
Pull Request -
State: open - Opened by lfoppiano about 2 months ago
- 1 comment
#1214 - add corrected training data for the segmentation model
Pull Request -
State: open - Opened by lfoppiano about 2 months ago
- 1 comment
#1213 - Missing last line in a page but it's due to the segmentation model
Issue -
State: open - Opened by lfoppiano about 2 months ago
Labels: error cases, models:segmentation
#1212 - Collect "other" text on request
Pull Request -
State: closed - Opened by lfoppiano 2 months ago
- 1 comment
#1211 - Fix Deep Learning header classification inconsistency
Pull Request -
State: open - Opened by lfoppiano 2 months ago
- 1 comment
#1210 - consolidation entry not available @ grobid.yaml?
Issue -
State: closed - Opened by pcalais 2 months ago
- 1 comment
#1209 - Strange header mis-classificaton from DL
Issue -
State: open - Opened by lfoppiano 2 months ago
- 1 comment
#1208 - Headnote missing and/or having wrong labels
Issue -
State: open - Opened by ronny3 3 months ago
- 4 comments
Labels: error cases, models:segmentation
#1207 - Handle incompleted/missclassified tables and figures
Pull Request -
State: closed - Opened by lfoppiano 3 months ago
- 1 comment
Labels: bug, enhancement
#1206 - Misclassified tables and/or figures maybe tossed incorrectly
Issue -
State: open - Opened by lfoppiano 3 months ago
- 4 comments
Labels: bug, implemented
#1205 - Fix code scanning alert no. 41: Resolving XML external entity in user-controlled data
Pull Request -
State: closed - Opened by lfoppiano 3 months ago
- 1 comment
#1204 - Correctly replacement of the file extension when creating training data
Pull Request -
State: closed - Opened by lfoppiano 3 months ago
- 1 comment
#1203 - Fix fulltext block start
Pull Request -
State: open - Opened by lfoppiano 3 months ago
- 1 comment
#1203 - Fix fulltext block start
Pull Request -
State: closed - Opened by lfoppiano 3 months ago
- 1 comment
#1202 - Alternative articles processing flavors
Pull Request -
State: closed - Opened by lfoppiano 3 months ago
- 4 comments
#1201 - I want to contribute!
Issue -
State: open - Opened by chozillla 3 months ago
#1200 - Additional training data and model retrain for the segmentation
Pull Request -
State: closed - Opened by lfoppiano 3 months ago
- 2 comments
#1199 - Supplementary materials
Issue -
State: open - Opened by lfoppiano 4 months ago
#1199 - Supplementary materials
Issue -
State: open - Opened by lfoppiano 4 months ago
#1198 - Annex and body misclassification
Issue -
State: open - Opened by lfoppiano 4 months ago
Labels: error cases, models:segmentation
#1198 - Annex and body misclassification
Issue -
State: open - Opened by lfoppiano 4 months ago
Labels: error cases, implemented, models:segmentation
#1197 - Data availability tokens misclassified
Issue -
State: open - Opened by lfoppiano 4 months ago
Labels: error cases, models:header
#1197 - Data availability tokens misclassified
Issue -
State: open - Opened by lfoppiano 4 months ago
Labels: error cases, models:header
#1196 - Grobid processes the entire text sometimes but other times it doesn't process correctly and returns None as the output
Issue -
State: closed - Opened by Odrec 4 months ago
- 5 comments
#1195 - Issue with processFulltextDocument API: Frequent "503 Service Unavailable" Responses
Issue -
State: closed - Opened by sdspieg 4 months ago
- 2 comments
#1195 - Issue with processFulltextDocument API: Frequent "503 Service Unavailable" Responses
Issue -
State: open - Opened by sdspieg 4 months ago
- 1 comment
#1194 - Strange double-div structure added for acknowledgment
Issue -
State: closed - Opened by lfoppiano 4 months ago
- 1 comment
Labels: error cases, models:segmentation
#1194 - Strange double-div structure added for acknowledgment
Issue -
State: open - Opened by lfoppiano 4 months ago
Labels: error cases
#1193 - move TEI idno identifiers under <analytics>
Pull Request -
State: open - Opened by lfoppiano 4 months ago
- 3 comments
#1193 - move TEI idno identifiers under <analytics>
Pull Request -
State: open - Opened by lfoppiano 4 months ago
- 3 comments
#1192 - Correction of idno position under analytics
Issue -
State: open - Opened by lfoppiano 4 months ago
#1192 - Correction of idno position under analytics
Issue -
State: open - Opened by lfoppiano 4 months ago
#1191 - URLs where the regex capture less than the annotations are not consolidated with the clickable links from the PDF document
Issue -
State: open - Opened by lfoppiano 4 months ago
- 2 comments
Labels: bug
#1191 - URLs where the regex capture less than the annotations are not consolidated with the clickable links from the PDF document
Issue -
State: closed - Opened by lfoppiano 4 months ago
- 2 comments
Labels: bug
#1190 - Fix URL extraction when the regex falls short
Pull Request -
State: closed - Opened by lfoppiano 4 months ago
- 2 comments
#1189 - Fix internal links in the documentation
Pull Request -
State: closed - Opened by lfoppiano 4 months ago
- 1 comment
#1188 - Tensorflow 2.16
Pull Request -
State: open - Opened by kermitt2 4 months ago
- 2 comments
#1188 - Tensorflow 2.16
Pull Request -
State: open - Opened by kermitt2 4 months ago
- 2 comments
#1187 - Data availabilty extraction failure use cases
Issue -
State: open - Opened by lfoppiano 4 months ago
- 2 comments
Labels: error cases, implemented, models:segmentation
#1186 - Extracting tables spanning across multiple pages
Issue -
State: open - Opened by BC-Naman 4 months ago
#1186 - Extracting tables spanning across multiple pages
Issue -
State: open - Opened by BC-Naman 4 months ago
- 1 comment
#1185 - Update the URL regexes matching urls starting with a vulgar www.
Pull Request -
State: closed - Opened by lfoppiano 4 months ago
- 2 comments
#1185 - Update the URL regexes matching urls starting with a vulgar www.
Pull Request -
State: closed - Opened by lfoppiano 4 months ago
- 2 comments
#1184 - some URLs are not extracted in DAS
Issue -
State: closed - Opened by lfoppiano 4 months ago
- 1 comment
Labels: bug, enhancement
#1183 - fix(doc): Add hyperlink on documentation
Pull Request -
State: closed - Opened by annelhote 4 months ago
- 1 comment
#1183 - fix(doc): Add hyperlink on documentation
Pull Request -
State: closed - Opened by annelhote 4 months ago
- 1 comment
#1182 - fix: ignore IDE config file from VSCode
Pull Request -
State: closed - Opened by annelhote 4 months ago
- 1 comment
#1182 - fix: ignore IDE config file from VSCode
Pull Request -
State: closed - Opened by annelhote 4 months ago
- 1 comment
#1181 - Add includeRawCopyrights in the UI
Pull Request -
State: closed - Opened by annelhote 4 months ago
- 4 comments
#1180 - Questions
Issue -
State: closed - Opened by flckv 5 months ago
- 3 comments
Labels: question
#1179 - typo
Pull Request -
State: closed - Opened by annelhote 5 months ago
- 1 comment
#1178 - Null pointer exception on absent copyright model in 0.8.1. image
Issue -
State: closed - Opened by pasha-pplx 5 months ago
- 1 comment
Labels: info-needed
#1177 - docker: failed to create task for container on Windows
Issue -
State: open - Opened by charlesJHarrisIII 5 months ago
- 1 comment
Labels: Windows-specific, docker
#1176 - jep.JepException: <class 'lmdb.ReadonlyError'>: data/db/glove-840B: Permission denied
Issue -
State: closed - Opened by RLWOHIO 5 months ago
- 1 comment
#1175 - Empty refs
Issue -
State: open - Opened by lfoppiano 5 months ago
- 1 comment
Labels: bug, implemented, licence:needs_CC-BY
#1174 - improve issue template
Pull Request -
State: closed - Opened by lfoppiano 5 months ago
#1173 - hold cuda11.2 from upgrading to cuda12.2
Pull Request -
State: closed - Opened by vipulg13 5 months ago
- 1 comment
#1172 - Fix missing libcublas-12
Pull Request -
State: closed - Opened by lfoppiano 5 months ago
#1171 - Annotation of footnoted references for training custom Grobid models
Issue -
State: open - Opened by cboulanger 5 months ago
- 8 comments
#1170 - Container 0.8.1 not working with Nvidia GPUs
Issue -
State: closed - Opened by dcfidalgo 5 months ago
- 3 comments
Labels: bug, implemented
#1169 - Training data non TEI-conformant
Issue -
State: closed - Opened by cboulanger 5 months ago
- 3 comments
#1168 - Fix code scanning alert #41: Resolving XML external entity in user-controlled data
Pull Request -
State: closed - Opened by lfoppiano 5 months ago
- 2 comments
#1167 - Grobid docker container - location of grobid-trainer
Issue -
State: open - Opened by cboulanger 5 months ago
- 16 comments
Labels: docker
#1166 - Fix affiliation missing when using DL affiliation-address model
Pull Request -
State: closed - Opened by lfoppiano 5 months ago
- 4 comments
#1165 - Docker build for multi-architecture amd/arm
Pull Request -
State: open - Opened by lfoppiano 5 months ago
- 6 comments
#1164 - GROBID does not parse author affiliations anymore.
Issue -
State: closed - Opened by mbosten 5 months ago
- 3 comments
Labels: bug, implemented
#1163 - How to turn off logs when using GROBID in batch mode?
Issue -
State: open - Opened by Koruvika 5 months ago
#1162 - Not extracting when PDF is large
Issue -
State: open - Opened by victorcasignia 6 months ago
- 4 comments
Labels: error cases, models:fulltext, models:segmentation