Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / opendatalab/MinerU issues and pull requests

#1076 - Release 0.10.1

Pull Request - State: closed - Opened by myhloli 4 days ago

#1075 - demo: batch process demo PDFs

Pull Request - State: closed - Opened by myhloli 4 days ago

#1073 - Detection of Umlaut / vowel mutation in German OCR

Issue - State: open - Opened by myjob 4 days ago - 2 comments
Labels: bug

#1072 - 某些情况下caption和footnote的错误匹配

Issue - State: open - Opened by wanxueyao 5 days ago
Labels: bug

#1071 - Fix/demo

Pull Request - State: closed - Opened by icecraft 5 days ago - 1 comment

#1070 - 内存泄漏导致进程被杀死

Issue - State: open - Opened by Cokejia 6 days ago - 4 comments
Labels: bug

#1069 - Look at this... 👀

Issue - State: closed - Opened by Davidjennison1 6 days ago

#1068 - 0.10.0跑magic_pdf_parse_main.py demo修改了一下magic-pdf.json报错,求解

Issue - State: closed - Opened by boranyang-ML 7 days ago - 2 comments
Labels: bug

#1067 - CUDA device is not set properly

Issue - State: open - Opened by HakunanMatatat 7 days ago - 1 comment
Labels: bug

#1066 - master -> dev

Pull Request - State: closed - Opened by myhloli 7 days ago - 1 comment

#1065 - fix(pdf_parse): improve OCR result handling

Pull Request - State: closed - Opened by myhloli 7 days ago

#1064 - fix(pdf_parse): improve OCR result handling

Pull Request - State: closed - Opened by myhloli 7 days ago

#1063 - Release 0.10.0

Pull Request - State: closed - Opened by myhloli 7 days ago

#1059 - feat(README): update for v0.10.0

Pull Request - State: closed - Opened by myhloli 7 days ago

#1057 - pipe_mk_uni_format 要如何返回坐标

Issue - State: open - Opened by sph116 7 days ago
Labels: enhancement

#1056 - Add test cases to json compressor util

Pull Request - State: closed - Opened by liugongjian 7 days ago

#1055 - CPU占用高,貌似没有充分使用GPU

Issue - State: closed - Opened by singeleaf 7 days ago - 2 comments
Labels: enhancement

#1053 - fix(pdf_parse): improve line stop flag detection accuracy

Pull Request - State: closed - Opened by myhloli 7 days ago

#1052 - fix: use concrete class instead of abstract class

Pull Request - State: closed - Opened by icecraft 8 days ago

#1051 - 请问下,mineru支持对word文档的解析吗?

Issue - State: open - Opened by asenasen123 8 days ago - 2 comments
Labels: enhancement

#1050 - refactor(txt_parse): improve text extraction accuracy with new algorithm

Pull Request - State: closed - Opened by myhloli 8 days ago - 2 comments

#1049 - feat(ocr): improve text detection and OCR accuracy

Pull Request - State: closed - Opened by myhloli 8 days ago

#1047 - fix(ocr_mkcontent): improve hyphen handling at line ends

Pull Request - State: closed - Opened by myhloli 8 days ago

#1045 - fix(tools): handle empty language string in common.py

Pull Request - State: closed - Opened by myhloli 8 days ago

#1044 - 表格布局识别不正确

Issue - State: open - Opened by squirrelfish 8 days ago - 5 comments
Labels: bug

#1043 - 【模型加载求助】

Issue - State: closed - Opened by yingliu0518 8 days ago - 4 comments

#1042 - rapidocr_paddle

Issue - State: open - Opened by lyc728 8 days ago - 2 comments

#1041 - 能不能做到标题和正文在一行时对标题的识别

Issue - State: open - Opened by ZzYAmbition 8 days ago
Labels: enhancement

#1039 - 请问目前对于并发度的支持是怎么样呢?如果需要多并发度怎么操作?

Issue - State: closed - Opened by Muyi030 9 days ago - 1 comment
Labels: enhancement

#1038 - There are reading order problems in this published version

Issue - State: closed - Opened by zahrarsl 9 days ago - 8 comments
Labels: bug

#1037 - AttributeError: 'tuple' object has no attribute 'shape'

Issue - State: open - Opened by xuhongtian 9 days ago - 6 comments
Labels: bug

#1036 - fix: remove test code

Pull Request - State: closed - Opened by icecraft 9 days ago

#1035 - 批量测试

Issue - State: closed - Opened by lyc728 9 days ago - 3 comments

#1034 - 【可复现】报错:pymupdf.mupdf.FzErrorSyntax: code=8: Failed to decode JPX image

Issue - State: closed - Opened by CocoaML 9 days ago - 4 comments
Labels: bug

#1033 - 图并没有截图到文件中

Issue - State: closed - Opened by lyc728 9 days ago

#1032 - Request for Bengali Language Support in OCR

Issue - State: open - Opened by raselmeya94 9 days ago
Labels: enhancement

#1030 - 是否能支持batch批跑呢

Issue - State: open - Opened by charliedream1 10 days ago
Labels: enhancement

#1029 - 提取错误

Issue - State: open - Opened by YANGtzeRi 10 days ago - 3 comments
Labels: bug

#1027 - refactor: move some constants or enums defs to config folder

Pull Request - State: closed - Opened by icecraft 10 days ago

#1026 - 请问NVIDIA-SMI 510.54 Driver Version: 510.54 CUDA Version: 11.6可以使用GPU加速吗

Issue - State: closed - Opened by Muyi030 10 days ago - 2 comments
Labels: enhancement

#1024 - delete unused pipeline file

Pull Request - State: closed - Opened by liugongjian 10 days ago - 2 comments

#1023 - 新版本0.93报错 发现是公式解析模型的时候

Issue - State: closed - Opened by 3300752199 10 days ago - 1 comment
Labels: bug

#1021 - 新手想问问怎么启动源码?目的是想将识别为figure的强制ocr提取文本信息

Issue - State: closed - Opened by aodingpeng 10 days ago - 9 comments
Labels: enhancement

#1020 - layout识别错位

Issue - State: open - Opened by FHhui 10 days ago - 3 comments
Labels: bug

#1019 - 使用magic-pdf命令,报错OpenBLAS线程限制

Issue - State: open - Opened by Muyi030 10 days ago - 1 comment
Labels: bug

#1017 - ppocr DEBUG 请问这是错误吗?

Issue - State: closed - Opened by sanwacompany 10 days ago - 2 comments
Labels: bug

#1016 - build(setup): add old_linux specific dependencies

Pull Request - State: closed - Opened by myhloli 10 days ago

#1014 - 【QA】mineru公式后处理问题

Issue - State: closed - Opened by dt-yy 11 days ago - 1 comment
Labels: bug

#1013 - refactor(para): improve paragraph splitting logic

Pull Request - State: closed - Opened by myhloli 11 days ago

#1012 - add DocLayout-YOLO url

Pull Request - State: closed - Opened by qiangqiang199 11 days ago - 1 comment

#1011 - add Doclayout-yolo url

Pull Request - State: closed - Opened by qiangqiang199 11 days ago - 1 comment

#1010 - feat(ocr): improve handling of angled text boxes

Pull Request - State: closed - Opened by myhloli 11 days ago

#1009 - 标题识别和代码识别需求

Issue - State: closed - Opened by Tian14267 11 days ago - 6 comments
Labels: enhancement

#1007 - 页眉页脚解析问题

Issue - State: open - Opened by zhongxin129 11 days ago
Labels: bug

#1006 - fix: using new data api replace old rw api

Pull Request - State: closed - Opened by icecraft 11 days ago

#1005 - fastapi部署时,返回结果出错

Issue - State: open - Opened by asenasen123 11 days ago - 1 comment
Labels: bug

#1002 - 内网无法访问huggingface

Issue - State: closed - Opened by yq-warehouse 11 days ago - 24 comments
Labels: enhancement

#1000 - 请问目前能支持centos7系统吗

Issue - State: closed - Opened by Muyi030 11 days ago - 7 comments
Labels: enhancement

#999 - `unimernet` CustomMBartDecoder does not support Flash Attention 2

Issue - State: open - Opened by sepcnt 11 days ago
Labels: bug

#998 - test(unitest): Restore unit test cases

Pull Request - State: closed - Opened by myhloli 11 days ago

#997 - 使用Quick CPU Demo中的命令下载预编译错误

Issue - State: closed - Opened by yq-warehouse 11 days ago - 4 comments
Labels: bug

#996 - 如何使用RapidTable?改配置文件不生效

Issue - State: closed - Opened by charliedream1 11 days ago - 24 comments
Labels: bug

#995 - 在Django中启动项目后出现了内存溢出

Issue - State: closed - Opened by haoweiwang0 11 days ago - 1 comment
Labels: bug

#994 - MinerU无法识别多级标题,识别的标题全部归为一级标题

Issue - State: closed - Opened by JoshonSmith 11 days ago - 2 comments
Labels: enhancement

#993 - Good

Issue - State: closed - Opened by Davidjennison1 11 days ago

#992 - PaddlePaddle相关问题复现case

Issue - State: open - Opened by phlrain 11 days ago - 1 comment
Labels: enhancement

#990 - 3 requirements files are there which one should use

Issue - State: closed - Opened by Akshaybhure111 11 days ago - 1 comment
Labels: bug

#988 - T

Issue - State: closed - Opened by Davidjennison1 12 days ago
Labels: enhancement

#987 - Error related to script

Issue - State: closed - Opened by Akshaybhure111 12 days ago - 9 comments
Labels: bug

#986 - update ci

Pull Request - State: closed - Opened by dt-yy 12 days ago

#985 - 【QA】0.9.3版本配置改成table-master生成的md表格为图片

Issue - State: closed - Opened by dt-yy 12 days ago - 1 comment
Labels: bug

#983 - 【QA】0.9.3版本 单词黏连问题

Issue - State: closed - Opened by dt-yy 12 days ago - 1 comment
Labels: bug

#982 - 【QA】0.9.0版本行内公式前后多了空格

Issue - State: closed - Opened by dt-yy 12 days ago - 1 comment
Labels: bug

#981 - 【QA】MinerU0.9.0 API版本从 Hugging Face 下载模型 error

Issue - State: closed - Opened by dt-yy 12 days ago - 1 comment
Labels: bug

#980 - argument expect 3 but 4 given

Issue - State: closed - Opened by Akshaybhure111 12 days ago - 2 comments
Labels: bug

#979 - 不知道可否支持 MLX

Issue - State: open - Opened by yibie 13 days ago - 2 comments
Labels: enhancement

#978 - 希望能添加控制输出结构的选项

Issue - State: closed - Opened by yibie 13 days ago - 1 comment
Labels: enhancement

#977 - docs: update readme

Pull Request - State: closed - Opened by myhloli 13 days ago

#977 - docs: update readme

Pull Request - State: closed - Opened by myhloli 13 days ago

#976 - Dev to 0.9.3

Pull Request - State: closed - Opened by myhloli 13 days ago

#976 - Dev to 0.9.3

Pull Request - State: closed - Opened by myhloli 13 days ago