shibing624/MedicalGPT issues and pull requests

#62 - 预训练哪些参数能有效节省显存使用呀

Issue - State: closed - Opened by starphantom666 over 1 year ago - 2 comments
Labels: question, wontfix

#61 - pretrain data format is a little bit similar to the sft stage

Issue - State: closed - Opened by chlinfeng1997 over 1 year ago - 12 comments
Labels: question

#60 - 多卡chatglm2 sft报错RuntimeError: expected scalar type Half but found Float

Issue - State: closed - Opened by zhr0313 over 1 year ago - 8 comments
Labels: bug

#59 - chatglm2-6b sft报错

Issue - State: closed - Opened by zhangatao over 1 year ago - 2 comments
Labels: bug

#58 - run_rm使用llama模型看代码做了set pad_token_id = 0还是报错

Issue - State: closed - Opened by charryshi over 1 year ago - 6 comments
Labels: question

#57 - 恢复预训练时报OOM.

Issue - State: closed - Opened by Halflifefa over 1 year ago - 4 comments
Labels: question

#56 - Prompt设置

Issue - State: closed - Opened by Ricardokevins over 1 year ago - 1 comment
Labels: question

#55 - lora训练完的chatglm2-6b，adapter model怎么和base model合并成新的model？

Issue - State: closed - Opened by AaronZLT over 1 year ago - 3 comments
Labels: question

#54 - 按run_training_pipeline.ipynb流程跑完为什么效果很差

Issue - State: closed - Opened by yangzhipeng1108 over 1 year ago - 1 comment
Labels: question

#53 - 单机多卡(A100 80GB)，torchrun 数据并行报错

Issue - State: closed - Opened by AllenYkl over 1 year ago - 9 comments
Labels: question

#52 - RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:1 and cuda:0! (when checking argument for argument index in method wrapper__index_select)

Issue - State: closed - Opened by dage0127 over 1 year ago - 1 comment
Labels: bug

#51 - run_pt run_sft run_rm run_rl 这个四步没有串行有什么意义

Issue - State: closed - Opened by yangzhipeng1108 over 1 year ago
Labels: question

#50 - 在调用 run_sft.sh 时报错。

Issue - State: closed - Opened by boxter007 over 1 year ago - 4 comments
Labels: question

#49 - rm模型训练过程

Issue - State: closed - Opened by Vincent131499 over 1 year ago - 4 comments
Labels: bug

#48 - run_rl.sh chatglm2-6b Unrecognized configuration class <class 'transformers_modules.chatglm2-6b.configuration_chatglm.ChatGLMConfig'>

Issue - State: closed - Opened by yangzhipeng1108 over 1 year ago - 4 comments
Labels: bug

#47 - Expected all tensors to be on the same device, but found at least two devices, cuda:3 and cuda:0! 错误

Issue - State: closed - Opened by dingxianzhong over 1 year ago - 1 comment
Labels: bug

#46 - 大佬好，我看到您使用了deepspeed.config 想问下训练参数支持deepspeed的ZERO参数吗？

Issue - State: closed - Opened by valkryhx over 1 year ago - 28 comments
Labels: question

#45 - 似乎训练程度有点不够？

Issue - State: closed - Opened by bash99 over 1 year ago - 6 comments

#44 - lora模型权重合并到chatglm2-6b

Issue - State: closed - Opened by richard880502 over 1 year ago - 2 comments
Labels: bug

#43 - rlhf实验效果

Issue - State: closed - Opened by zxgineng over 1 year ago - 4 comments
Labels: bug

#42 - chatglm2-6b

Issue - State: closed - Opened by Vincent131499 over 1 year ago - 2 comments
Labels: enhancement

#41 - 请教一下base_model的问题

Issue - State: closed - Opened by panliang5020 over 1 year ago - 1 comment
Labels: question

#40 - run chatglm

Issue - State: closed - Opened by yangzhipeng1108 over 1 year ago - 2 comments
Labels: question

#39 - sft脚本，preprocess_function函数，chatglm输入问题

Issue - State: closed - Opened by songt96 over 1 year ago - 2 comments
Labels: question

#38 - 为啥我这数据集老是有问题呢

Issue - State: closed - Opened by SolarKnight1 over 1 year ago - 1 comment
Labels: bug

#37 - 奖励模型数据构造

Issue - State: closed - Opened by ZTurboX over 1 year ago - 6 comments
Labels: enhancement

#36 - 如果Stage1,2选用ChatGLM-6B作为基座model，Stage3训练奖励模型这里怎么设置呢？

Issue - State: closed - Opened by xuanxixi over 1 year ago - 4 comments
Labels: question

#35 - Stage 3: Reward Modeling 报错：ValueError: weight is on the meta device, we need a `value` to put in on 1.

Issue - State: closed - Opened by dage0127 over 1 year ago - 4 comments
Labels: question

#34 - ziya-llama-13b + lora推理结果异常

Issue - State: closed - Opened by kyang888 over 1 year ago - 9 comments
Labels: question

#33 - 从peft加载LoraConfig报错

Issue - State: closed - Opened by xiaohengDa over 1 year ago - 3 comments
Labels: bug

#32 - 对pretraining阶段的数据加工有点疑问

Issue - State: closed - Opened by hongyix over 1 year ago - 1 comment
Labels: question

#31 - run_sft.sh 的 eval_loss一直不降都是同一个值

Issue - State: closed - Opened by xuanxixi over 1 year ago - 1 comment
Labels: bug

#30 - gradio_demo.py

Issue - State: closed - Opened by gaojing8500 over 1 year ago - 1 comment
Labels: bug

#29 - ziya-llama-13b-medical-lora 量化推理怎么使用？

Issue - State: closed - Opened by Nisoka over 1 year ago - 7 comments
Labels: question

#28 - chatglm现在的reward model模型缺失吗？

Issue - State: closed - Opened by ymyjl over 1 year ago - 4 comments
Labels: enhancement

#27 - reward_baseline

Issue - State: closed - Opened by yangliuIOC over 1 year ago - 1 comment
Labels: question

#26 - 关于原始百川的infer

Issue - State: closed - Opened by nuoma over 1 year ago - 5 comments
Labels: question

#25 - 训练进程卡住

Issue - State: closed - Opened by aichifandefan over 1 year ago - 1 comment
Labels: enhancement

#23 - 运行run_rm.sh报错 RuntimeError: CUDA error: device-side assert triggered

Issue - State: closed - Opened by Candy555 over 1 year ago - 1 comment
Labels: bug

#22 - 最后两步的一些疑问

Issue - State: closed - Opened by skepsun over 1 year ago - 1 comment
Labels: bug

#21 - 基于chatglm训练reward model是使用AutoModelForSequenceClassification加载模型吗

Issue - State: closed - Opened by wang9702 over 1 year ago - 4 comments
Labels: bug

#20 - rl_training和reward_modeling中Tokenizer新增Pad token，Model不需要resize吗？

Issue - State: closed - Opened by baibaiw5 over 1 year ago - 1 comment
Labels: question

#19 - 使用merge_peft_adapter.py进行merge的时候，词表映射出现了问题

Issue - State: closed - Opened by nlper-hou over 1 year ago - 3 comments
Labels: question

#18 - RLHH

Issue - State: closed - Opened by yangliuIOC over 1 year ago - 3 comments
Labels: question

#17 - 使用run_pt.sh对llama-13B在医疗数据上增量训练，跑了5个epoch，可是loss不下降是怎么回事？一直是10.25附近波动

Issue - State: closed - Opened by nlper-hou over 1 year ago - 4 comments
Labels: question

#16 - 加载bloom 13B模型报错

Issue - State: closed - Opened by 1615070057 over 1 year ago - 1 comment
Labels: bug

#15 - 单机多卡运行卡死

Issue - State: closed - Opened by zhangxinxin0428 over 1 year ago - 8 comments
Labels: question

#14 - 关于预训练完成后合并模型及SFT的问题

Issue - State: closed - Opened by charryshi over 1 year ago - 10 comments
Labels: question

#13 - supervised_finetuning.py的preprocess_function函数是否有问题

Issue - State: closed - Opened by baibaiw5 over 1 year ago - 1 comment
Labels: question

#12 - loss is 0 when turn off use_peft

Issue - State: closed - Opened by baibaiw5 over 1 year ago - 2 comments
Labels: question

#11 - ValueError: 130004 is not in list

Issue - State: closed - Opened by sexan over 1 year ago - 16 comments
Labels: question

#10 - 跑增量预训练是中断后恢复不能继续

Issue - State: closed - Opened by charryshi over 1 year ago - 7 comments
Labels: question

#9 - 为啥这个代码里都是bug 跑chatglm的微调代码根本跑不起来

Issue - State: closed - Opened by yankuo111 over 1 year ago - 1 comment
Labels: bug

#8 - 请问第一阶段的增量预训练需要的显存大小

Issue - State: closed - Opened by charryshi over 1 year ago - 1 comment
Labels: question

#7 - 如何测试reward model

Issue - State: closed - Opened by lianzhaoy over 1 year ago - 5 comments
Labels: question

#6 - 单机多卡跑gradio推理时，报CUDA的错误

Issue - State: closed - Opened by chelovek21 over 1 year ago
Labels: question

#5 - 直接运行时会出现tokenizer长度错误

Issue - State: closed - Opened by chelovek21 over 1 year ago - 4 comments
Labels: question

#4 - 单机多卡预训练ChatGLM报错：

Issue - State: closed - Opened by zzzhaoguziji over 1 year ago - 8 comments
Labels: question

#3 - DistributedDataParallel device_ids and output_device arguments only work with single-device/multiple-device GPU modules or CPU modules, but got device_ids [0], output_device 0, and module parameters {device(type='cuda', index=0), device(type='cuda', index=1), device(type='cuda', index=2), device(type='cuda', index=3)}.

Issue - State: closed - Opened by gaojing8500 over 1 year ago - 1 comment
Labels: question

#2 - 直接运行run_rm.sh，产生关于计算图的RuntimeError

Issue - State: closed - Opened by zhpmatrix over 1 year ago - 4 comments
Labels: question

#1 - 请问增量预训练大概需要几块GPU呢？

Issue - State: closed - Opened by jason7323 over 1 year ago - 3 comments
Labels: question

GitHub / shibing624/MedicalGPT issues and pull requests