Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / opendatalab/MinerU issues and pull requests

#662 - IndexError: index 0 is out of bounds for axis 0 with size 0

Issue - State: open - Opened by zuanzuanshao 2 months ago - 2 comments
Labels: bug

#661 - feat:web and web_demo

Pull Request - State: closed - Opened by LollipopsAndWine 2 months ago - 1 comment

#660 - Some content in pdf is not recognized

Issue - State: closed - Opened by liangqf108 2 months ago - 1 comment
Labels: bug

#659 - 0.7.1 版本中gpu运行问题

Issue - State: closed - Opened by James-Dao 2 months ago - 34 comments
Labels: bug

#658 - `Segmentation fault` is detected by the operating system.

Issue - State: open - Opened by Justin18Chan 2 months ago - 4 comments
Labels: bug

#657 - 文档标题无法识别

Issue - State: closed - Opened by Ceceliachenen 2 months ago - 3 comments
Labels: bug

#656 - gradio怎么显示行内符号和图片?

Issue - State: closed - Opened by sunzx8 2 months ago - 2 comments

#655 - "device-mode": "cuda:1",请问可不可以这样写?

Issue - State: closed - Opened by randydl 2 months ago - 10 comments
Labels: enhancement

#654 - 能否提供简易的更新方法

Issue - State: closed - Opened by XSR-WatchPioneer 2 months ago - 2 comments
Labels: enhancement

#653 - 下载后一直是0.6.1版本 强制选定版本就会报缺少其它包

Issue - State: closed - Opened by Twilight-Spider 2 months ago - 10 comments
Labels: bug

#652 - feat: wep_api and web

Pull Request - State: closed - Opened by LollipopsAndWine 2 months ago - 1 comment

#651 - 【QA】研报中文章标题被识别为Table xx

Issue - State: closed - Opened by dt-yy 2 months ago - 1 comment
Labels: bug, backlog

#650 - 【QA】0.8.1版本研报表格的说明识别错误

Issue - State: closed - Opened by dt-yy 2 months ago - 2 comments
Labels: bug, backlog

#649 - 建议把表格中,每个单元格的坐标,返回到pipline中

Issue - State: open - Opened by Vawter-001 2 months ago
Labels: enhancement

#648 - 这种手写的图片转成的PDF识别不了文字嘛

Issue - State: closed - Opened by THEONEBUKE 2 months ago - 1 comment
Labels: bug

#645 - feat: add test case

Pull Request - State: closed - Opened by dt-yy 2 months ago

#643 - demo报错

Issue - State: closed - Opened by tqangxl 2 months ago - 3 comments
Labels: bug

#640 - ImportError: cannot import name 'preserve_channel_dim' from 'albucore.utils'

Issue - State: closed - Opened by hxypqr 2 months ago - 2 comments
Labels: bug

#638 - 希望能保留下划线等占位符,希望能保留

Issue - State: closed - Opened by jeremyWangJun03 2 months ago - 1 comment
Labels: bug

#634 - 识别表格的时候只给出了图片,不是json数据。

Issue - State: closed - Opened by stormsea 2 months ago - 1 comment
Labels: bug

#633 - 在表格识别时内容缺失

Issue - State: closed - Opened by YoungWWan 2 months ago - 2 comments
Labels: bug

#632 - 是否有脚本的启动样例,只有命令行启动感觉不太满足后续开发需要,

Issue - State: closed - Opened by FHhui 2 months ago - 1 comment
Labels: enhancement

#630 - feat(ocr_mkcontent): support drop reason in none_with_reason mode

Pull Request - State: closed - Opened by myhloli 2 months ago

#627 - 新版本运行出现bug:IndexError: index 10 is out of bounds for axis 0 with size 10

Issue - State: open - Opened by Maple0709 2 months ago - 3 comments
Labels: bug

#623 - | ERROR | magic_pdf.cli.magicpdf:do_parse:114 - need model list input

Issue - State: closed - Opened by HSDCLZ 3 months ago - 2 comments
Labels: bug

#620 - magic-pdf.exe在电脑上不可用

Issue - State: closed - Opened by Gene1343 3 months ago - 2 comments
Labels: bug

#619 - 【QA】magic-pdf解析pdf后资源没有即刻释放

Issue - State: closed - Opened by dt-yy 3 months ago - 1 comment
Labels: bug

#618 - api调用短时间上传相同pdf出现could not execute a primitive错误

Issue - State: closed - Opened by dehua6666666 3 months ago - 1 comment
Labels: bug

#617 - 更新改本后部分内容会丢失

Issue - State: closed - Opened by Maple0709 3 months ago - 3 comments
Labels: bug

#615 - 能否支持三栏布局的pdf文档解析

Issue - State: closed - Opened by guoguo0646 3 months ago - 2 comments
Labels: enhancement

#615 - 能否支持三栏布局的pdf文档解析

Issue - State: closed - Opened by guoguo0646 3 months ago - 2 comments
Labels: enhancement

#610 - 图片描述和图片并排时图片描述丢失的问题。

Issue - State: closed - Opened by L9qmzn 3 months ago - 2 comments
Labels: bug

#609 - 5900X cpu跑的,cpu的python占用率才15%+7G内存

Issue - State: closed - Opened by dayfan0810 3 months ago - 3 comments
Labels: bug

#606 - How can I use all GPUs to accelarate?

Issue - State: closed - Opened by wahahaer 3 months ago - 1 comment
Labels: enhancement

#600 - magic-pdf 安装的版本为 0.6.1

Issue - State: closed - Opened by Jie2GG 3 months ago - 2 comments
Labels: bug

#596 - 按轮次处理文件,一轮只使用一个模型,减少对最大显存的需求。

Issue - State: closed - Opened by hwf1324 3 months ago - 1 comment
Labels: enhancement

#595 - magic_pdf.tools.cli:parse_doc:96 - code=8: invalid key in dict

Issue - State: closed - Opened by hwf1324 3 months ago - 6 comments
Labels: bug

#594 - 文件转换

Issue - State: closed - Opened by sabibi12 3 months ago - 2 comments
Labels: enhancement

#593 - 请问华为昇腾卡可否用来运行MinerU推理加速呢

Issue - State: closed - Opened by JiangRunzhi 3 months ago - 1 comment
Labels: enhancement

#592 - 解析信息丢失

Issue - State: closed - Opened by JackMacs 3 months ago - 1 comment
Labels: bug

#591 - magic-pdf -p demo1.pdf Illegal instruction | 执行magic-pdf提示指令非法

Issue - State: closed - Opened by Joyouspeng 3 months ago - 3 comments
Labels: bug

#585 - 能提供远程IP地址+API key使用的功能吗?

Issue - State: closed - Opened by llity 3 months ago - 3 comments
Labels: enhancement

#583 - 能否在版本发布时候,同时发布更新一个相应版本的docker镜像呢?

Issue - State: open - Opened by DreamTeamWangbowen 3 months ago - 12 comments
Labels: enhancement

#576 - Unable to handle large files

Issue - State: closed - Opened by Sg4Dylan 3 months ago - 3 comments
Labels: bug

#575 - 求助docker相关

Issue - State: closed - Opened by meng0423 3 months ago - 3 comments
Labels: enhancement

#572 - pdf解析时报“pymupdf.mupdf.FzErrorSyntax: code=8: syntax error in object (58 0 R)”

Issue - State: closed - Opened by liy-a 3 months ago - 4 comments
Labels: bug

#566 - 如何输出bbox框选不同元素的pdf

Issue - State: closed - Opened by jujulovesstudying 3 months ago - 1 comment

#562 - 解析PDF得到的content_list中标题只有一级

Issue - State: closed - Opened by littlexiaoyou 3 months ago - 1 comment
Labels: bug

#561 - 推理显存占用很高

Issue - State: closed - Opened by pandaominggz 3 months ago - 2 comments
Labels: bug

#558 - ocr解析pdf,部分pdf会出现乱码问题

Issue - State: closed - Opened by stormchen-cell 3 months ago - 5 comments
Labels: bug

#556 - 安装版本为0.6.1 而不是0.7.1

Issue - State: closed - Opened by James-Dao 3 months ago - 34 comments
Labels: bug

#551 - MinerU和marker解析pdf能力对比

Issue - State: closed - Opened by Sakura4036 3 months ago - 5 comments
Labels: enhancement

#547 - filename with ' ' fixed

Pull Request - State: closed - Opened by strongerfly 3 months ago - 2 comments

#545 - 在线体验端pdf识别结果问题

Issue - State: closed - Opened by X17exe 3 months ago - 1 comment

#538 - 英文部分检测乱码

Issue - State: closed - Opened by clareliu1234 3 months ago - 2 comments
Labels: bug

#535 - 'fairscale'模块不存在

Issue - State: closed - Opened by yang123456he 3 months ago - 5 comments
Labels: bug

#517 - 模型预加载

Issue - State: closed - Opened by BronyaKaslana06 3 months ago - 5 comments
Labels: enhancement

#516 - magic_pdf_parse_main.py的最佳配置

Issue - State: closed - Opened by HaoRenkk123 3 months ago - 5 comments

#513 - 在magic_pdf_parse_main这个demo中,如何才能批量处理PDF文件

Issue - State: closed - Opened by chenliutiao 3 months ago - 7 comments
Labels: enhancement

#504 - Library not loaded: @loader_path/libjxl.0.6.1.dylib

Issue - State: closed - Opened by audio-github-2020 3 months ago - 2 comments
Labels: bug

#497 - pdf识别不出图片

Issue - State: closed - Opened by Ceceliachenen 3 months ago - 1 comment
Labels: bug

#491 - 能不能转化doc成md啊?还是只能pdf转md

Issue - State: closed - Opened by Alan-zhong 3 months ago - 5 comments

#484 - Unable to allocate 41.9 MiB for an array with shape (6, 276, 6625) and data type float32

Issue - State: closed - Opened by laulguo 3 months ago - 5 comments
Labels: bug

#464 - 本地部署完成后,运行命令,出现:非法指令 的提示

Issue - State: closed - Opened by wxzheng88 3 months ago - 7 comments
Labels: bug

#463 - feat: add tablemaster_paddle

Pull Request - State: closed - Opened by papayalove 3 months ago - 1 comment

#451 - 希望可以添加一个选项,只产生markdown文件

Issue - State: closed - Opened by ywh-my 3 months ago - 8 comments
Labels: enhancement

#444 - pdf解析出来的md乱码

Issue - State: closed - Opened by zuanzuanshao 3 months ago - 8 comments
Labels: bug

#442 - I want to convert to a docx file from pdf. What can i do for it .

Issue - State: closed - Opened by zuanzuanshao 4 months ago - 4 comments
Labels: enhancement

#442 - I want to convert to a docx file from pdf. What can i do for it .

Issue - State: closed - Opened by zuanzuanshao 4 months ago - 4 comments
Labels: enhancement

#438 - torchvision报错

Issue - State: closed - Opened by 1greatday 4 months ago - 1 comment
Labels: bug

#437 - 关于LayoutLMv3模型

Issue - State: closed - Opened by Jamly7 4 months ago - 1 comment
Labels: enhancement

#434 - 輸出的表格怎麽是latex,如何指定為md格式

Issue - State: closed - Opened by HSIAOKUOWEI 4 months ago - 2 comments
Labels: bug

#432 - 有一个需要联网下载

Issue - State: closed - Opened by 1greatday 4 months ago - 4 comments
Labels: bug

#428 - PDF to TEXT

Issue - State: closed - Opened by hzzheng0612 4 months ago - 1 comment
Labels: enhancement

#415 - demo.py中如何像magic-pdf pdf-command [OPTIONS]中支持ocr、txt、auto的模式选择

Issue - State: closed - Opened by EthanD4869 4 months ago - 1 comment
Labels: enhancement

#413 - 从pdf解析出来的内容少了一大段话

Issue - State: closed - Opened by ytcpub 4 months ago - 7 comments
Labels: bug

#395 - 支持自动将图片上传到s3

Issue - State: closed - Opened by liqiankun1111 4 months ago - 1 comment
Labels: enhancement

#394 - 关闭公式解析

Issue - State: closed - Opened by yuzj1002 4 months ago - 3 comments
Labels: enhancement

#387 - 如何将表格转为markdown格式?

Issue - State: open - Opened by lrybbbccc 4 months ago - 4 comments
Labels: enhancement

#384 - 图片PDF识别遗漏表格中间的文字

Issue - State: closed - Opened by albertshx 4 months ago - 5 comments
Labels: bug

#383 - 286页的扫描版PDF文档识别报错

Issue - State: closed - Opened by lceCre4m 4 months ago - 16 comments
Labels: bug

#370 - ocr之后的pdf在那里?

Issue - State: closed - Opened by ytcpub 4 months ago - 3 comments
Labels: bug

#364 - 版面检测卡死,公式识别似乎没有做极大值抑制

Issue - State: open - Opened by shinoairisu 4 months ago - 18 comments
Labels: bug

#362 - FatalError: `Segmentation fault` is detected by the operating system.

Issue - State: open - Opened by Jalen-Zhong 4 months ago - 19 comments
Labels: bug

#360 - 提取PDF中表格的其他方案(间接)

Issue - State: closed - Opened by beiluo 4 months ago - 26 comments

#358 - pdf小标题所在行的普通文字未被识别

Issue - State: closed - Opened by yushengliao 4 months ago - 3 comments
Labels: bug

#357 - Customize request

Issue - State: closed - Opened by Ly-Lynn 4 months ago - 1 comment
Labels: enhancement

#340 - magic-pdf -p

Issue - State: closed - Opened by ywm108 4 months ago - 1 comment
Labels: bug

#336 - 报错:ModuleNotFoundError: No module named 'struct_eqtable'

Issue - State: closed - Opened by CocoaML 4 months ago - 4 comments
Labels: bug