Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / charent/ChatLM-mini-Chinese issues and pull requests

#59 - tokenizer训练OOM 。内存60G

Issue - State: open - Opened by musexiaoluo 3 months ago - 1 comment

#59 - tokenizer训练OOM 。内存60G

Issue - State: open - Opened by musexiaoluo 3 months ago

#58 - 数据清洗代码

Issue - State: open - Opened by Mrkkew 3 months ago

#58 - 数据清洗代码

Issue - State: open - Opened by Mrkkew 3 months ago - 1 comment

#55 - 3.4预训练运行出现 unsupported operand type(s) 错误,求帮忙

Issue - State: open - Opened by summerFF 4 months ago - 1 comment

#54 - 4080显卡,基本跑不了多少数据,过万条训练数据就报错

Issue - State: open - Opened by iissy 4 months ago - 4 comments

#54 - 4080显卡,基本跑不了多少数据,过万条训练数据就报错

Issue - State: open - Opened by iissy 4 months ago - 6 comments

#52 - 可以用a卡训练吗

Issue - State: closed - Opened by alexhan1012 6 months ago - 1 comment

#52 - 可以用a卡训练吗

Issue - State: closed - Opened by alexhan1012 6 months ago - 1 comment

#48 - sft_train

Issue - State: closed - Opened by dbcSep03 7 months ago - 1 comment

#48 - sft_train

Issue - State: closed - Opened by dbcSep03 7 months ago - 1 comment

#47 - Some NCCL operations have failed or timed out.

Issue - State: open - Opened by dbcSep03 7 months ago - 5 comments

#47 - Some NCCL operations have failed or timed out.

Issue - State: open - Opened by dbcSep03 7 months ago - 6 comments

#46 - 预训练数据集必须是{“prompt”: "response":}的格式么?

Issue - State: closed - Opened by dbcSep03 8 months ago - 2 comments

#46 - 预训练数据集必须是{“prompt”: "response":}的格式么?

Issue - State: closed - Opened by dbcSep03 8 months ago - 2 comments

#45 - 非常不错的开源项目

Issue - State: closed - Opened by DataXujing 8 months ago - 1 comment

#45 - 非常不错的开源项目

Issue - State: closed - Opened by DataXujing 8 months ago - 1 comment

#44 - 请问这些预训练数据加起来有多少token呀

Issue - State: closed - Opened by StarCycle 8 months ago - 2 comments

#44 - 请问这些预训练数据加起来有多少token呀

Issue - State: closed - Opened by StarCycle 8 months ago - 2 comments

#42 - train_3.5M_CN数据处理问题

Issue - State: closed - Opened by wflying000 8 months ago - 1 comment

#42 - train_3.5M_CN数据处理问题

Issue - State: closed - Opened by wflying000 8 months ago - 1 comment

#41 - 如何加载sft后的模型?

Issue - State: closed - Opened by Liuxinhao12 8 months ago - 1 comment

#41 - 如何加载sft后的模型?

Issue - State: closed - Opened by Liuxinhao12 8 months ago - 1 comment

#40 - RuntimeError: No executable batch size found, reached zero

Issue - State: closed - Opened by suiyueyousan 8 months ago - 2 comments

#40 - RuntimeError: No executable batch size found, reached zero

Issue - State: closed - Opened by suiyueyousan 8 months ago - 2 comments

#39 - 考虑出一个支持llama的版本吗

Issue - State: closed - Opened by leondada 8 months ago - 1 comment

#39 - 考虑出一个支持llama的版本吗

Issue - State: closed - Opened by leondada 8 months ago - 1 comment

#38 - 如何提取中间层的输出?

Issue - State: closed - Opened by W-void 8 months ago - 2 comments

#38 - 如何提取中间层的输出?

Issue - State: closed - Opened by W-void 8 months ago - 2 comments

#37 - sft微调时报错

Issue - State: closed - Opened by ama0zarashi 8 months ago - 4 comments

#37 - sft微调时报错

Issue - State: closed - Opened by ama0zarashi 8 months ago - 4 comments

#36 - 用train.py出现shape的mismatch

Issue - State: closed - Opened by huluk98 9 months ago - 10 comments

#36 - 用train.py出现shape的mismatch

Issue - State: closed - Opened by huluk98 9 months ago - 10 comments

#35 - 微调后预测三元组不正确原因

Issue - State: closed - Opened by qiutzh 9 months ago - 5 comments

#35 - 微调后预测三元组不正确原因

Issue - State: closed - Opened by qiutzh 9 months ago - 5 comments

#34 - 预训练数据集

Issue - State: closed - Opened by rabintang 9 months ago - 2 comments

#34 - 预训练数据集

Issue - State: closed - Opened by rabintang 9 months ago - 2 comments

#33 - 项目怎么使用fastchat 进行调试

Issue - State: closed - Opened by zhilangtaosha 9 months ago - 1 comment

#33 - 项目怎么使用fastchat 进行调试

Issue - State: closed - Opened by zhilangtaosha 9 months ago - 1 comment

#32 - Great Work! Does it support multimodal ability?

Issue - State: closed - Opened by LianghuiGuo 9 months ago - 1 comment

#32 - Great Work! Does it support multimodal ability?

Issue - State: closed - Opened by LianghuiGuo 9 months ago - 1 comment

#29 - 请问,如果有新的内容需要添加,是否需要全部重新训练?

Issue - State: closed - Opened by kideve 10 months ago - 2 comments

#29 - 请问,如果有新的内容需要添加,是否需要全部重新训练?

Issue - State: closed - Opened by kideve 10 months ago - 2 comments

#28 - Bump fastapi from 0.105.0 to 0.109.1

Pull Request - State: closed - Opened by dependabot[bot] 10 months ago
Labels: dependencies

#27 - 多卡情况下,同一份数据集会加载多次吗

Issue - State: closed - Opened by shinerdeng 10 months ago - 6 comments

#25 - 请教“3.3 Tokenizer训练”如何运行?

Issue - State: closed - Opened by ybdesire 10 months ago - 2 comments

#24 - Why do I get stuck loading the dataset after running it

Issue - State: closed - Opened by anyiz 10 months ago - 11 comments

#23 - 在 SFT 微调途中出现报错

Issue - State: closed - Opened by aoguai 10 months ago - 11 comments

#22 - 有考虑将模型分发的https://modelscope.cn/么?

Issue - State: closed - Opened by qmjy 10 months ago - 2 comments

#21 - 使用Lora 和 sft_train.py 训练效果好像没有,有没有好的方法?

Issue - State: closed - Opened by yugu91 10 months ago - 7 comments

#20 - readme可以提供下封装了环境加模型的docker镜像吗?

Issue - State: closed - Opened by zack-sys 10 months ago - 1 comment

#19 - 是否有计划针对agent函数调用微调

Issue - State: closed - Opened by lucasjinreal 10 months ago - 4 comments

#18 - 如果在更好的设备上训练效果区别大吗

Issue - State: closed - Opened by aiwillcoming 10 months ago - 1 comment

#17 - 请教一个问题,生成的回复重复

Issue - State: closed - Opened by shinerdeng 10 months ago - 2 comments

#16 - 為甚麼我啟動API會出現這個

Issue - State: closed - Opened by Adolph3671 10 months ago - 1 comment

#14 - 是否可以在服务器上运行?

Issue - State: closed - Opened by yanyilin3344 10 months ago - 5 comments

#13 - 基于提供的模型进行sft报错

Issue - State: closed - Opened by cq1316 10 months ago - 13 comments

#12 - 清洗好的数据集会开源吗?

Issue - State: closed - Opened by echo-valor 11 months ago - 1 comment

#11 - Dev

Pull Request - State: closed - Opened by charent 11 months ago

#10 - 如何运行呢?

Issue - State: closed - Opened by meng25meng 11 months ago - 17 comments

#9 - 关于小模型ChatLM-mini-Chinese 信息抽取的 sft_train.json文件

Issue - State: closed - Opened by pengcheng-yan 11 months ago - 4 comments

#8 - merge from dev to main

Pull Request - State: closed - Opened by charent 11 months ago

#7 - 可以介绍一下不同的任务训练的配置吗

Issue - State: closed - Opened by PshySimon 11 months ago - 4 comments

#6 - merge dev to main

Pull Request - State: closed - Opened by charent 11 months ago

#5 - merge from dev

Pull Request - State: closed - Opened by charent 11 months ago

#4 - Bump transformers from 4.35.2 to 4.36.0

Pull Request - State: closed - Opened by dependabot[bot] 11 months ago
Labels: dependencies

#3 - Bump pyarrow from 13.0.0 to 14.0.1

Pull Request - State: closed - Opened by dependabot[bot] 12 months ago
Labels: dependencies

#2 - add TODO

Pull Request - State: closed - Opened by charent about 1 year ago

#1 - Bump pyarrow from 13.0.0 to 14.0.1

Pull Request - State: closed - Opened by dependabot[bot] about 1 year ago
Labels: dependencies