Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / kermitt2/grobid issues and pull requests

#1247 - Bugfix/adjust notes avoid stackoverflow

Pull Request - State: closed - Opened by lfoppiano 5 days ago

#1246 - Avoid empty figures/tables reference markers

Pull Request - State: closed - Opened by lfoppiano 6 days ago - 1 comment
Labels: bug

#1245 - Fix flavor parameter

Pull Request - State: closed - Opened by lfoppiano 13 days ago - 1 comment

#1244 - Wrongly placed figure reference

Issue - State: open - Opened by lfoppiano 13 days ago
Labels: bug, implemented

#1243 - Server Memory Limit

Issue - State: closed - Opened by ccchan-cpii 17 days ago - 4 comments

#1242 - Avoid running aground when detecting the language

Pull Request - State: closed - Opened by lfoppiano 17 days ago - 1 comment

#1241 - Language cannot be null - when segmenting sentences

Issue - State: closed - Opened by lfoppiano 22 days ago
Labels: bug

#1240 - Training the fulltext model

Issue - State: open - Opened by martasoricetti 22 days ago - 12 comments

#1239 - Use lingua for language recognition

Pull Request - State: open - Opened by lfoppiano 24 days ago

#1238 - Fix the way table notes are streamed on the XML

Pull Request - State: closed - Opened by lfoppiano 25 days ago - 1 comment

#1237 - Training fulltext - annotation doubt

Issue - State: open - Opened by martasoricetti 30 days ago - 1 comment
Labels: question, training guidelines, models:fulltext

#1236 - For some PDFs fulltext document does not contain header document

Issue - State: open - Opened by loopdeloop76 about 1 month ago - 4 comments
Labels: question

#1235 - Avoid empty sentences

Pull Request - State: closed - Opened by lfoppiano about 1 month ago - 1 comment

#1233 - <ref type="figure" target="#fig_5"> missed

Issue - State: open - Opened by Samuel-Scalbert about 1 month ago - 3 comments
Labels: bug

#1232 - <p> duplicates

Issue - State: open - Opened by Samuel-Scalbert about 1 month ago - 7 comments
Labels: bug, implemented

#1231 - "Author contributions" section content is skipped by grobid

Issue - State: open - Opened by i-amkashif about 1 month ago - 1 comment
Labels: bug

#1230 - Missing Wapiti for linux arm64

Issue - State: open - Opened by AaronNGray about 1 month ago - 5 comments

#1229 - grobid run giving '/tini: 1: Syntax error: "(" unexpected'

Issue - State: open - Opened by AaronNGray about 1 month ago - 8 comments

#1228 - Updated Grobid lucene analyzers for CJK languages

Pull Request - State: closed - Opened by kermitt2 about 1 month ago - 3 comments

#1227 - Fix for CVE in CommonsIO

Pull Request - State: closed - Opened by kermitt2 about 1 month ago - 2 comments

#1226 - Replace cybozu lang detection with Lingua

Issue - State: open - Opened by lfoppiano about 1 month ago
Labels: enhancement

#1224 - make the start/end page for header processing customizable #282

Pull Request - State: closed - Opened by lfoppiano about 1 month ago
Labels: enhancement

#1222 - Build Failed on Ubuntu GLIBC 2.35 (runs well on 2.31)

Issue - State: open - Opened by laoliu5280 about 1 month ago - 2 comments
Labels: Linux-specific

#1221 - DAS extraction issues in Plos Articles

Issue - State: open - Opened by lfoppiano about 1 month ago - 1 comment
Labels: error cases, models:segmentation, models:header

#1220 - How should I annotate markers to list items? - training the fulltext model

Issue - State: open - Opened by martasoricetti about 2 months ago - 1 comment
Labels: question, training guidelines

#1219 - When ARM64 DLL will be available (present in lib and pdfalto)

Issue - State: open - Opened by manjulahonnappa-agi about 2 months ago - 4 comments

#1218 - Regarding GROBID support ARM64

Issue - State: open - Opened by manjulahonnappa-agi about 2 months ago - 1 comment
Labels: need help, macOS-specific

#1216 - Update pdfalto recognition of non-standard fonts

Pull Request - State: open - Opened by lfoppiano about 2 months ago - 1 comment

#1215 - Process figures,tables and equations from back/annex section

Pull Request - State: open - Opened by lfoppiano about 2 months ago - 1 comment

#1214 - add corrected training data for the segmentation model

Pull Request - State: open - Opened by lfoppiano about 2 months ago - 1 comment

#1213 - Missing last line in a page but it's due to the segmentation model

Issue - State: open - Opened by lfoppiano about 2 months ago
Labels: error cases, models:segmentation

#1212 - Collect "other" text on request

Pull Request - State: closed - Opened by lfoppiano 2 months ago - 1 comment

#1211 - Fix Deep Learning header classification inconsistency

Pull Request - State: open - Opened by lfoppiano 2 months ago - 1 comment

#1210 - consolidation entry not available @ grobid.yaml?

Issue - State: closed - Opened by pcalais 2 months ago - 1 comment

#1209 - Strange header mis-classificaton from DL

Issue - State: open - Opened by lfoppiano 2 months ago - 1 comment

#1208 - Headnote missing and/or having wrong labels

Issue - State: open - Opened by ronny3 3 months ago - 4 comments
Labels: error cases, models:segmentation

#1207 - Handle incompleted/missclassified tables and figures

Pull Request - State: closed - Opened by lfoppiano 3 months ago - 1 comment
Labels: bug, enhancement

#1206 - Misclassified tables and/or figures maybe tossed incorrectly

Issue - State: open - Opened by lfoppiano 3 months ago - 4 comments
Labels: bug, implemented

#1204 - Correctly replacement of the file extension when creating training data

Pull Request - State: closed - Opened by lfoppiano 3 months ago - 1 comment

#1203 - Fix fulltext block start

Pull Request - State: open - Opened by lfoppiano 3 months ago - 1 comment

#1203 - Fix fulltext block start

Pull Request - State: closed - Opened by lfoppiano 3 months ago - 1 comment

#1202 - Alternative articles processing flavors

Pull Request - State: closed - Opened by lfoppiano 3 months ago - 4 comments

#1201 - I want to contribute!

Issue - State: open - Opened by chozillla 3 months ago

#1200 - Additional training data and model retrain for the segmentation

Pull Request - State: closed - Opened by lfoppiano 3 months ago - 2 comments

#1199 - Supplementary materials

Issue - State: open - Opened by lfoppiano 4 months ago

#1199 - Supplementary materials

Issue - State: open - Opened by lfoppiano 4 months ago

#1198 - Annex and body misclassification

Issue - State: open - Opened by lfoppiano 4 months ago
Labels: error cases, models:segmentation

#1198 - Annex and body misclassification

Issue - State: open - Opened by lfoppiano 4 months ago
Labels: error cases, implemented, models:segmentation

#1197 - Data availability tokens misclassified

Issue - State: open - Opened by lfoppiano 4 months ago
Labels: error cases, models:header

#1197 - Data availability tokens misclassified

Issue - State: open - Opened by lfoppiano 4 months ago
Labels: error cases, models:header

#1194 - Strange double-div structure added for acknowledgment

Issue - State: closed - Opened by lfoppiano 4 months ago - 1 comment
Labels: error cases, models:segmentation

#1194 - Strange double-div structure added for acknowledgment

Issue - State: open - Opened by lfoppiano 4 months ago
Labels: error cases

#1193 - move TEI idno identifiers under <analytics>

Pull Request - State: open - Opened by lfoppiano 4 months ago - 3 comments

#1193 - move TEI idno identifiers under <analytics>

Pull Request - State: open - Opened by lfoppiano 4 months ago - 3 comments

#1192 - Correction of idno position under analytics

Issue - State: open - Opened by lfoppiano 4 months ago

#1192 - Correction of idno position under analytics

Issue - State: open - Opened by lfoppiano 4 months ago

#1190 - Fix URL extraction when the regex falls short

Pull Request - State: closed - Opened by lfoppiano 4 months ago - 2 comments

#1189 - Fix internal links in the documentation

Pull Request - State: closed - Opened by lfoppiano 4 months ago - 1 comment

#1188 - Tensorflow 2.16

Pull Request - State: open - Opened by kermitt2 4 months ago - 2 comments

#1188 - Tensorflow 2.16

Pull Request - State: open - Opened by kermitt2 4 months ago - 2 comments

#1187 - Data availabilty extraction failure use cases

Issue - State: open - Opened by lfoppiano 4 months ago - 2 comments
Labels: error cases, implemented, models:segmentation

#1186 - Extracting tables spanning across multiple pages

Issue - State: open - Opened by BC-Naman 4 months ago

#1186 - Extracting tables spanning across multiple pages

Issue - State: open - Opened by BC-Naman 4 months ago - 1 comment

#1185 - Update the URL regexes matching urls starting with a vulgar www.

Pull Request - State: closed - Opened by lfoppiano 4 months ago - 2 comments

#1185 - Update the URL regexes matching urls starting with a vulgar www.

Pull Request - State: closed - Opened by lfoppiano 4 months ago - 2 comments

#1184 - some URLs are not extracted in DAS

Issue - State: closed - Opened by lfoppiano 4 months ago - 1 comment
Labels: bug, enhancement

#1183 - fix(doc): Add hyperlink on documentation

Pull Request - State: closed - Opened by annelhote 4 months ago - 1 comment

#1183 - fix(doc): Add hyperlink on documentation

Pull Request - State: closed - Opened by annelhote 4 months ago - 1 comment

#1182 - fix: ignore IDE config file from VSCode

Pull Request - State: closed - Opened by annelhote 4 months ago - 1 comment

#1182 - fix: ignore IDE config file from VSCode

Pull Request - State: closed - Opened by annelhote 4 months ago - 1 comment

#1181 - Add includeRawCopyrights in the UI

Pull Request - State: closed - Opened by annelhote 4 months ago - 4 comments

#1180 - Questions

Issue - State: closed - Opened by flckv 5 months ago - 3 comments
Labels: question

#1179 - typo

Pull Request - State: closed - Opened by annelhote 5 months ago - 1 comment

#1178 - Null pointer exception on absent copyright model in 0.8.1. image

Issue - State: closed - Opened by pasha-pplx 5 months ago - 1 comment
Labels: info-needed

#1177 - docker: failed to create task for container on Windows

Issue - State: open - Opened by charlesJHarrisIII 5 months ago - 1 comment
Labels: Windows-specific, docker

#1175 - Empty refs

Issue - State: open - Opened by lfoppiano 5 months ago - 1 comment
Labels: bug, implemented, licence:needs_CC-BY

#1174 - improve issue template

Pull Request - State: closed - Opened by lfoppiano 5 months ago

#1173 - hold cuda11.2 from upgrading to cuda12.2

Pull Request - State: closed - Opened by vipulg13 5 months ago - 1 comment

#1172 - Fix missing libcublas-12

Pull Request - State: closed - Opened by lfoppiano 5 months ago

#1171 - Annotation of footnoted references for training custom Grobid models

Issue - State: open - Opened by cboulanger 5 months ago - 8 comments

#1170 - Container 0.8.1 not working with Nvidia GPUs

Issue - State: closed - Opened by dcfidalgo 5 months ago - 3 comments
Labels: bug, implemented

#1169 - Training data non TEI-conformant

Issue - State: closed - Opened by cboulanger 5 months ago - 3 comments

#1168 - Fix code scanning alert #41: Resolving XML external entity in user-controlled data

Pull Request - State: closed - Opened by lfoppiano 5 months ago - 2 comments

#1167 - Grobid docker container - location of grobid-trainer

Issue - State: open - Opened by cboulanger 5 months ago - 16 comments
Labels: docker

#1166 - Fix affiliation missing when using DL affiliation-address model

Pull Request - State: closed - Opened by lfoppiano 5 months ago - 4 comments

#1165 - Docker build for multi-architecture amd/arm

Pull Request - State: open - Opened by lfoppiano 5 months ago - 6 comments

#1164 - GROBID does not parse author affiliations anymore.

Issue - State: closed - Opened by mbosten 5 months ago - 3 comments
Labels: bug, implemented

#1162 - Not extracting when PDF is large

Issue - State: open - Opened by victorcasignia 6 months ago - 4 comments
Labels: error cases, models:fulltext, models:segmentation