Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / opendatalab/MinerU issues and pull requests
#1076 - Release 0.10.1
Pull Request -
State: closed - Opened by myhloli 4 days ago
#1075 - demo: batch process demo PDFs
Pull Request -
State: closed - Opened by myhloli 4 days ago
#1074 - feat(demo): add visualization bbox parameter and refactor parsing process
Pull Request -
State: closed - Opened by myhloli 4 days ago
#1073 - Detection of Umlaut / vowel mutation in German OCR
Issue -
State: open - Opened by myjob 4 days ago
- 2 comments
Labels: bug
#1072 - 某些情况下caption和footnote的错误匹配
Issue -
State: open - Opened by wanxueyao 5 days ago
Labels: bug
#1071 - Fix/demo
Pull Request -
State: closed - Opened by icecraft 5 days ago
- 1 comment
#1070 - 内存泄漏导致进程被杀死
Issue -
State: open - Opened by Cokejia 6 days ago
- 4 comments
Labels: bug
#1069 - Look at this... 👀
Issue -
State: closed - Opened by Davidjennison1 6 days ago
#1068 - 0.10.0跑magic_pdf_parse_main.py demo修改了一下magic-pdf.json报错,求解
Issue -
State: closed - Opened by boranyang-ML 7 days ago
- 2 comments
Labels: bug
#1067 - CUDA device is not set properly
Issue -
State: open - Opened by HakunanMatatat 7 days ago
- 1 comment
Labels: bug
#1066 - master -> dev
Pull Request -
State: closed - Opened by myhloli 7 days ago
- 1 comment
#1065 - fix(pdf_parse): improve OCR result handling
Pull Request -
State: closed - Opened by myhloli 7 days ago
#1064 - fix(pdf_parse): improve OCR result handling
Pull Request -
State: closed - Opened by myhloli 7 days ago
#1063 - Release 0.10.0
Pull Request -
State: closed - Opened by myhloli 7 days ago
#1062 - fix(table): add null check for OCR result in rapid table prediction
Pull Request -
State: closed - Opened by myhloli 7 days ago
#1061 - refactor(model): move page total time logging to custom model analysis
Pull Request -
State: closed - Opened by myhloli 7 days ago
#1060 - fix(table): add null check for OCR result in rapid table prediction
Pull Request -
State: closed - Opened by myhloli 7 days ago
#1059 - feat(README): update for v0.10.0
Pull Request -
State: closed - Opened by myhloli 7 days ago
#1058 - refactor(para): improve line stop flag and remove unused debug mode
Pull Request -
State: closed - Opened by myhloli 7 days ago
#1057 - pipe_mk_uni_format 要如何返回坐标
Issue -
State: open - Opened by sph116 7 days ago
Labels: enhancement
#1056 - Add test cases to json compressor util
Pull Request -
State: closed - Opened by liugongjian 7 days ago
#1055 - CPU占用高,貌似没有充分使用GPU
Issue -
State: closed - Opened by singeleaf 7 days ago
- 2 comments
Labels: enhancement
#1054 - test: comment out assertions for metascan classify and meta scan tests
Pull Request -
State: closed - Opened by myhloli 7 days ago
#1053 - fix(pdf_parse): improve line stop flag detection accuracy
Pull Request -
State: closed - Opened by myhloli 7 days ago
#1052 - fix: use concrete class instead of abstract class
Pull Request -
State: closed - Opened by icecraft 8 days ago
#1051 - 请问下,mineru支持对word文档的解析吗?
Issue -
State: open - Opened by asenasen123 8 days ago
- 2 comments
Labels: enhancement
#1050 - refactor(txt_parse): improve text extraction accuracy with new algorithm
Pull Request -
State: closed - Opened by myhloli 8 days ago
- 2 comments
#1049 - feat(ocr): improve text detection and OCR accuracy
Pull Request -
State: closed - Opened by myhloli 8 days ago
#1048 - fix(remove_overlaps_min_spans): optimize overlap detection in OCR span list modification
Pull Request -
State: closed - Opened by myhloli 8 days ago
#1047 - fix(ocr_mkcontent): improve hyphen handling at line ends
Pull Request -
State: closed - Opened by myhloli 8 days ago
#1046 - refactor(ocr_dict_merge): add threshold parameter for line merging
Pull Request -
State: closed - Opened by myhloli 8 days ago
#1045 - fix(tools): handle empty language string in common.py
Pull Request -
State: closed - Opened by myhloli 8 days ago
#1044 - 表格布局识别不正确
Issue -
State: open - Opened by squirrelfish 8 days ago
- 5 comments
Labels: bug
#1043 - 【模型加载求助】
Issue -
State: closed - Opened by yingliu0518 8 days ago
- 4 comments
#1042 - rapidocr_paddle
Issue -
State: open - Opened by lyc728 8 days ago
- 2 comments
#1041 - 能不能做到标题和正文在一行时对标题的识别
Issue -
State: open - Opened by ZzYAmbition 8 days ago
Labels: enhancement
#1039 - 请问目前对于并发度的支持是怎么样呢?如果需要多并发度怎么操作?
Issue -
State: closed - Opened by Muyi030 9 days ago
- 1 comment
Labels: enhancement
#1038 - There are reading order problems in this published version
Issue -
State: closed - Opened by zahrarsl 9 days ago
- 8 comments
Labels: bug
#1037 - AttributeError: 'tuple' object has no attribute 'shape'
Issue -
State: open - Opened by xuhongtian 9 days ago
- 6 comments
Labels: bug
#1036 - fix: remove test code
Pull Request -
State: closed - Opened by icecraft 9 days ago
#1035 - 批量测试
Issue -
State: closed - Opened by lyc728 9 days ago
- 3 comments
#1034 - 【可复现】报错:pymupdf.mupdf.FzErrorSyntax: code=8: Failed to decode JPX image
Issue -
State: closed - Opened by CocoaML 9 days ago
- 4 comments
Labels: bug
#1033 - 图并没有截图到文件中
Issue -
State: closed - Opened by lyc728 9 days ago
#1032 - Request for Bengali Language Support in OCR
Issue -
State: open - Opened by raselmeya94 9 days ago
Labels: enhancement
#1031 - 一张图片里有简体中文、英文、韩文、繁体中文、日文等多种语言 如何进行OCR识别
Issue -
State: open - Opened by huyidu 9 days ago
- 1 comment
Labels: bug
#1030 - 是否能支持batch批跑呢
Issue -
State: open - Opened by charliedream1 10 days ago
Labels: enhancement
#1029 - 提取错误
Issue -
State: open - Opened by YANGtzeRi 10 days ago
- 3 comments
Labels: bug
#1028 - 使用RapidTable识别表格且已开启table-config中的识别表格功能,结果是图片而不是html
Issue -
State: closed - Opened by mrslimslim 10 days ago
- 14 comments
Labels: bug
#1027 - refactor: move some constants or enums defs to config folder
Pull Request -
State: closed - Opened by icecraft 10 days ago
#1026 - 请问NVIDIA-SMI 510.54 Driver Version: 510.54 CUDA Version: 11.6可以使用GPU加速吗
Issue -
State: closed - Opened by Muyi030 10 days ago
- 2 comments
Labels: enhancement
#1025 - Including link: https://aquasecurity.github.io/
Issue -
State: closed - Opened by Davidjennison1 10 days ago
#1024 - delete unused pipeline file
Pull Request -
State: closed - Opened by liugongjian 10 days ago
- 2 comments
#1023 - 新版本0.93报错 发现是公式解析模型的时候
Issue -
State: closed - Opened by 3300752199 10 days ago
- 1 comment
Labels: bug
#1022 - 请帮我看看我的这个问题,我在使用原本0.8.1版本的时候可以跑的pdf文件,在换用了新的框架之后出了问题
Issue -
State: closed - Opened by farierer 10 days ago
- 2 comments
Labels: bug
#1021 - 新手想问问怎么启动源码?目的是想将识别为figure的强制ocr提取文本信息
Issue -
State: closed - Opened by aodingpeng 10 days ago
- 9 comments
Labels: enhancement
#1020 - layout识别错位
Issue -
State: open - Opened by FHhui 10 days ago
- 3 comments
Labels: bug
#1019 - 使用magic-pdf命令,报错OpenBLAS线程限制
Issue -
State: open - Opened by Muyi030 10 days ago
- 1 comment
Labels: bug
#1018 - refactor(para): adjust right margin threshold based on block width
Pull Request -
State: closed - Opened by myhloli 10 days ago
#1017 - ppocr DEBUG 请问这是错误吗?
Issue -
State: closed - Opened by sanwacompany 10 days ago
- 2 comments
Labels: bug
#1016 - build(setup): add old_linux specific dependencies
Pull Request -
State: closed - Opened by myhloli 10 days ago
#1015 - ERROR: detectron2-0.6-cp310-cp310-macosx_10_9_universal2.whl is not a supported wheel on this platform.
Issue -
State: closed - Opened by CyberAsteroid 11 days ago
- 2 comments
#1014 - 【QA】mineru公式后处理问题
Issue -
State: closed - Opened by dt-yy 11 days ago
- 1 comment
Labels: bug
#1013 - refactor(para): improve paragraph splitting logic
Pull Request -
State: closed - Opened by myhloli 11 days ago
#1012 - add DocLayout-YOLO url
Pull Request -
State: closed - Opened by qiangqiang199 11 days ago
- 1 comment
#1011 - add Doclayout-yolo url
Pull Request -
State: closed - Opened by qiangqiang199 11 days ago
- 1 comment
#1010 - feat(ocr): improve handling of angled text boxes
Pull Request -
State: closed - Opened by myhloli 11 days ago
#1009 - 标题识别和代码识别需求
Issue -
State: closed - Opened by Tian14267 11 days ago
- 6 comments
Labels: enhancement
#1008 - FastAPI的PDF解析接口,解析完的md文件和图片在哪里可以看到
Issue -
State: open - Opened by asenasen123 11 days ago
Labels: bug
#1007 - 页眉页脚解析问题
Issue -
State: open - Opened by zhongxin129 11 days ago
Labels: bug
#1006 - fix: using new data api replace old rw api
Pull Request -
State: closed - Opened by icecraft 11 days ago
#1005 - fastapi部署时,返回结果出错
Issue -
State: open - Opened by asenasen123 11 days ago
- 1 comment
Labels: bug
#1004 - 由于新版本albumentations依赖simsimd导致不支持Centos7的说明
Issue -
State: closed - Opened by myhloli 11 days ago
#1002 - 内网无法访问huggingface
Issue -
State: closed - Opened by yq-warehouse 11 days ago
- 24 comments
Labels: enhancement
#1001 - refactor(tests): extract common test utilities into test_commons.py
Pull Request -
State: closed - Opened by myhloli 11 days ago
#1000 - 请问目前能支持centos7系统吗
Issue -
State: closed - Opened by Muyi030 11 days ago
- 7 comments
Labels: enhancement
#999 - `unimernet` CustomMBartDecoder does not support Flash Attention 2
Issue -
State: open - Opened by sepcnt 11 days ago
Labels: bug
#998 - test(unitest): Restore unit test cases
Pull Request -
State: closed - Opened by myhloli 11 days ago
#997 - 使用Quick CPU Demo中的命令下载预编译错误
Issue -
State: closed - Opened by yq-warehouse 11 days ago
- 4 comments
Labels: bug
#996 - 如何使用RapidTable?改配置文件不生效
Issue -
State: closed - Opened by charliedream1 11 days ago
- 24 comments
Labels: bug
#995 - 在Django中启动项目后出现了内存溢出
Issue -
State: closed - Opened by haoweiwang0 11 days ago
- 1 comment
Labels: bug
#994 - MinerU无法识别多级标题,识别的标题全部归为一级标题
Issue -
State: closed - Opened by JoshonSmith 11 days ago
- 2 comments
Labels: enhancement
#993 - Good
Issue -
State: closed - Opened by Davidjennison1 11 days ago
#992 - PaddlePaddle相关问题复现case
Issue -
State: open - Opened by phlrain 11 days ago
- 1 comment
Labels: enhancement
#991 - Post in thread 'Boba's Dakar Yellow E46 M3 to CSL look-a-likey'
Issue -
State: closed - Opened by Davidjennison1 11 days ago
#990 - 3 requirements files are there which one should use
Issue -
State: closed - Opened by Akshaybhure111 11 days ago
- 1 comment
Labels: bug
#989 - how have you processed the blocks after finding out the layout order?
Issue -
State: closed - Opened by vikas-singh16 11 days ago
- 2 comments
#988 - T
Issue -
State: closed - Opened by Davidjennison1 12 days ago
Labels: enhancement
#987 - Error related to script
Issue -
State: closed - Opened by Akshaybhure111 12 days ago
- 9 comments
Labels: bug
#986 - update ci
Pull Request -
State: closed - Opened by dt-yy 12 days ago
#985 - 【QA】0.9.3版本配置改成table-master生成的md表格为图片
Issue -
State: closed - Opened by dt-yy 12 days ago
- 1 comment
Labels: bug
#983 - 【QA】0.9.3版本 单词黏连问题
Issue -
State: closed - Opened by dt-yy 12 days ago
- 1 comment
Labels: bug
#982 - 【QA】0.9.0版本行内公式前后多了空格
Issue -
State: closed - Opened by dt-yy 12 days ago
- 1 comment
Labels: bug
#981 - 【QA】MinerU0.9.0 API版本从 Hugging Face 下载模型 error
Issue -
State: closed - Opened by dt-yy 12 days ago
- 1 comment
Labels: bug
#980 - argument expect 3 but 4 given
Issue -
State: closed - Opened by Akshaybhure111 12 days ago
- 2 comments
Labels: bug
#979 - 不知道可否支持 MLX
Issue -
State: open - Opened by yibie 13 days ago
- 2 comments
Labels: enhancement
#978 - 希望能添加控制输出结构的选项
Issue -
State: closed - Opened by yibie 13 days ago
- 1 comment
Labels: enhancement
#977 - docs: update readme
Pull Request -
State: closed - Opened by myhloli 13 days ago
#977 - docs: update readme
Pull Request -
State: closed - Opened by myhloli 13 days ago
#976 - Dev to 0.9.3
Pull Request -
State: closed - Opened by myhloli 13 days ago
#976 - Dev to 0.9.3
Pull Request -
State: closed - Opened by myhloli 13 days ago