Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / yanqiangmiffy/instructglm issues and pull requests
#34 - Problems in train_deepspeed.py with ZeRO stage 1|2|3
Issue -
State: open - Opened by zjhJOJO about 1 year ago
#34 - Problems in train_deepspeed.py with ZeRO stage 1|2|3
Issue -
State: open - Opened by zjhJOJO about 1 year ago
#33 - 用tokenizer_dataset_rows.py转换自己的数据报错datasets.builder.datasetgeneraationerror
Issue -
State: open - Opened by cat1222 over 1 year ago
#33 - 用tokenizer_dataset_rows.py转换自己的数据报错datasets.builder.datasetgeneraationerror
Issue -
State: open - Opened by cat1222 over 1 year ago
#32 - 使用lora训练,使用web_demo加载lora权重后,结果跟原生chatglm结果一样,lora权重没生效,这个是什么原因呢
Issue -
State: open - Opened by AnddyWang over 1 year ago
- 1 comment
#32 - 使用lora训练,使用web_demo加载lora权重后,结果跟原生chatglm结果一样,lora权重没生效,这个是什么原因呢
Issue -
State: open - Opened by AnddyWang over 1 year ago
- 1 comment
#31 - 用BelleGroup/train_1M_CN训练后,为什么用数据集里的问题测,回答不一样
Issue -
State: open - Opened by czhcc over 1 year ago
#31 - 用BelleGroup/train_1M_CN训练后,为什么用数据集里的问题测,回答不一样
Issue -
State: open - Opened by czhcc over 1 year ago
#30 - 调教后的逻辑能力如何?
Issue -
State: open - Opened by daiaji over 1 year ago
- 1 comment
#30 - 调教后的逻辑能力如何?
Issue -
State: open - Opened by daiaji over 1 year ago
- 1 comment
#29 - train_lora最低需要多大显存GPU可以训练?除了batch size 还有别的参数可以降低显存使用吗?
Issue -
State: open - Opened by twosnowman over 1 year ago
- 2 comments
#29 - train_lora最低需要多大显存GPU可以训练?除了batch size 还有别的参数可以降低显存使用吗?
Issue -
State: open - Opened by twosnowman over 1 year ago
- 2 comments
#28 - 用peft加载lora后,generate时报错ValueError: 130000 is not in list,加载lora之前推理是正常的
Issue -
State: open - Opened by BIGPPWONG over 1 year ago
#27 - 请问下train_deepspeed.py 怎么引入lora.pt
Issue -
State: open - Opened by AlexXx-Wu over 1 year ago
- 1 comment
#27 - 请问下train_deepspeed.py 怎么引入lora.pt
Issue -
State: open - Opened by AlexXx-Wu over 1 year ago
- 1 comment
#26 - torch.distributed.elastic.multiprocessing.errors.ChildFailedError
Issue -
State: closed - Opened by MonkeyTB over 1 year ago
- 1 comment
#26 - torch.distributed.elastic.multiprocessing.errors.ChildFailedError
Issue -
State: closed - Opened by MonkeyTB over 1 year ago
- 1 comment
#25 - ValueError: ChatGLMForConditionalGeneration does not support gradient checkpointing.
Issue -
State: open - Opened by deepeye over 1 year ago
- 7 comments
#25 - ValueError: ChatGLMForConditionalGeneration does not support gradient checkpointing.
Issue -
State: open - Opened by deepeye over 1 year ago
- 7 comments
#24 - 预测时,torch.set_default_tensor_type(torch.cuda.HalfTensor)的问题
Issue -
State: closed - Opened by reborm over 1 year ago
#24 - 预测时,torch.set_default_tensor_type(torch.cuda.HalfTensor)的问题
Issue -
State: closed - Opened by reborm over 1 year ago
#23 - datasets.builder.InvalidConfigName: Bad characters from black list '<>:/\|?*' found in 'data/belle_data.json'. They could create issues when creating a directory for this config on Windows filesystem.
Issue -
State: open - Opened by deepeye over 1 year ago
- 1 comment
#23 - datasets.builder.InvalidConfigName: Bad characters from black list '<>:/\|?*' found in 'data/belle_data.json'. They could create issues when creating a directory for this config on Windows filesystem.
Issue -
State: open - Opened by deepeye over 1 year ago
- 1 comment
#22 - RuntimeError: torch.cat(): expected a non-empty list of Tensors
Issue -
State: closed - Opened by hrdxwandg over 1 year ago
- 7 comments
#22 - RuntimeError: torch.cat(): expected a non-empty list of Tensors
Issue -
State: closed - Opened by hrdxwandg over 1 year ago
- 7 comments
#21 - 4张32G的可以吗,作者可以用你写的其他开源数据集finetune看看效果吗,再放出转换和训练代码
Issue -
State: open - Opened by hangzeli08 over 1 year ago
- 2 comments
#21 - 4张32G的可以吗,作者可以用你写的其他开源数据集finetune看看效果吗,再放出转换和训练代码
Issue -
State: open - Opened by hangzeli08 over 1 year ago
- 2 comments
#20 - 4张 12G的 3060能训练吗
Issue -
State: open - Opened by zlszhonglongshen over 1 year ago
- 2 comments
#20 - 4张 12G的 3060能训练吗
Issue -
State: open - Opened by zlszhonglongshen over 1 year ago
- 2 comments
#19 - 运行web_demo_alpaca_lora.py报错,是单纯的显存不够嘛
Issue -
State: open - Opened by tianmala over 1 year ago
- 1 comment
#19 - 运行web_demo_alpaca_lora.py报错,是单纯的显存不够嘛
Issue -
State: open - Opened by tianmala over 1 year ago
- 1 comment
#18 - 训练python train_lora.py的时候显示 ModuleNotFoundError: No module named 'configuration_chatglm'
Issue -
State: open - Opened by dragononly over 1 year ago
- 3 comments
#18 - 训练python train_lora.py的时候显示 ModuleNotFoundError: No module named 'configuration_chatglm'
Issue -
State: open - Opened by dragononly over 1 year ago
- 3 comments
#17 - 测试数据打不开https://huggingface.co/datasets/BelleGroup/generated_train_0.5M_CN
Issue -
State: open - Opened by dragononly over 1 year ago
- 2 comments
#17 - 测试数据打不开https://huggingface.co/datasets/BelleGroup/generated_train_0.5M_CN
Issue -
State: open - Opened by dragononly over 1 year ago
- 2 comments
#16 - 微调2:BELLE中文指令数据的问题
Issue -
State: open - Opened by czhcc over 1 year ago
- 1 comment
#16 - 微调2:BELLE中文指令数据的问题
Issue -
State: open - Opened by czhcc over 1 year ago
- 1 comment
#15 - Lora+DeepSpeed多机多卡的问题
Issue -
State: closed - Opened by zyds over 1 year ago
#15 - Lora+DeepSpeed多机多卡的问题
Issue -
State: closed - Opened by zyds over 1 year ago
#14 - 怎么能把lora参数merge回原始模型呢?
Issue -
State: closed - Opened by AItechnology over 1 year ago
- 1 comment
#14 - 怎么能把lora参数merge回原始模型呢?
Issue -
State: closed - Opened by AItechnology over 1 year ago
- 1 comment
#13 - 运行 finetune.py 遇到问题:OSError: /data/pretrained-chatglm-6b/ does not appear to have a file named config.json
Issue -
State: open - Opened by xubuvd over 1 year ago
- 1 comment
#13 - 运行 finetune.py 遇到问题:OSError: /data/pretrained-chatglm-6b/ does not appear to have a file named config.json
Issue -
State: open - Opened by xubuvd over 1 year ago
- 1 comment
#12 - 最新update的代码中,web_demo推理时报错
Issue -
State: open - Opened by feyxong over 1 year ago
- 2 comments
#12 - 最新update的代码中,web_demo推理时报错
Issue -
State: open - Opened by feyxong over 1 year ago
- 2 comments
#11 - 关于训练完成后,生成的答案总是带一些莫名奇妙的Q,A数据,真的不造是哪里出了问题,还望大佬赐教!谢谢!
Issue -
State: open - Opened by UMU689 over 1 year ago
- 1 comment
#11 - 关于训练完成后,生成的答案总是带一些莫名奇妙的Q,A数据,真的不造是哪里出了问题,还望大佬赐教!谢谢!
Issue -
State: open - Opened by UMU689 over 1 year ago
- 1 comment
#10 - 修改README.md中的错别字
Pull Request -
State: closed - Opened by SunYanCN over 1 year ago
#10 - 修改README.md中的错别字
Pull Request -
State: closed - Opened by SunYanCN over 1 year ago
#9 - 关于多轮对话的疑问
Issue -
State: open - Opened by ZeyuTeng96 over 1 year ago
#8 - web_demo_belle生成结果时有大段重复的问题
Issue -
State: open - Opened by JiayiFu over 1 year ago
- 10 comments
#7 - 请问支持多卡吗,怎么改造?
Issue -
State: closed - Opened by hjyMM2018 over 1 year ago
- 2 comments
#6 - RuntimeError: expected scalar type Half but found Float
Issue -
State: closed - Opened by fulQuan over 1 year ago
- 1 comment
#5 - ValueError: 150000 is not in list
Issue -
State: open - Opened by superhg over 1 year ago
- 5 comments
#4 - ValueError: Please specify `target_modules` in `peft_config`
Issue -
State: closed - Opened by MrInouye over 1 year ago
- 3 comments
#3 - 24G显存的3090可以训练吗?
Issue -
State: closed - Opened by franklyd over 1 year ago
- 1 comment
#2 - 请问有训练好的权重可以下载吗?
Issue -
State: closed - Opened by EagleChen over 1 year ago
- 1 comment
#1 - 基于原始chatglm-6b训练效果好还是基于alpaca的lora继续微调效果好呢?
Issue -
State: closed - Opened by suc16 over 1 year ago
- 4 comments