Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / tsinghuaai/cpm-2-finetune issues and pull requests

#50 - Cpm2 open

Issue - State: open - Opened by Suleymanlizad 4 months ago

#50 - Cpm2 open

Issue - State: open - Opened by Suleymanlizad 4 months ago

#49 - change mp size 后,训练会出现 size missmatch 的错误

Issue - State: open - Opened by samulew over 1 year ago - 2 comments

#49 - change mp size 后,训练会出现 size missmatch 的错误

Issue - State: open - Opened by samulew over 1 year ago - 2 comments

#48 - 训好的模型如何转化成huggingface的模型格式呢

Issue - State: open - Opened by Tron1994 almost 2 years ago

#48 - 训好的模型如何转化成huggingface的模型格式呢

Issue - State: open - Opened by Tron1994 almost 2 years ago

#45 - CPM2Datasets.py 中的T5Dataset报错

Issue - State: closed - Opened by dingtine almost 2 years ago - 1 comment

#45 - CPM2Datasets.py 中的T5Dataset报错

Issue - State: closed - Opened by dingtine almost 2 years ago - 1 comment

#44 - 在A100上加载FusedAdam报错

Issue - State: open - Opened by giter000 about 2 years ago - 1 comment

#44 - 在A100上加载FusedAdam报错

Issue - State: open - Opened by giter000 about 2 years ago - 1 comment

#41 - promt adgen文件缺失

Issue - State: closed - Opened by zhu1090093659 over 2 years ago - 3 comments

#41 - promt adgen文件缺失

Issue - State: closed - Opened by zhu1090093659 over 2 years ago - 3 comments

#39 - CPM2如何做few-shot的文本生成任务

Issue - State: closed - Opened by zhihao-chen over 2 years ago

#39 - CPM2如何做few-shot的文本生成任务

Issue - State: closed - Opened by zhihao-chen over 2 years ago

#38 - 请教:使用中英文双语模型报了一下的错误:

Issue - State: closed - Opened by Chunhui-Zou over 2 years ago - 1 comment

#38 - 请教:使用中英文双语模型报了一下的错误:

Issue - State: closed - Opened by Chunhui-Zou over 2 years ago - 1 comment

#36 - 救助:模型支持的最长输入序列是多少

Issue - State: closed - Opened by Chunhui-Zou over 2 years ago - 2 comments

#36 - 救助:模型支持的最长输入序列是多少

Issue - State: closed - Opened by Chunhui-Zou over 2 years ago - 2 comments

#35 - 想看模型生成的结果,该修改代码那一块

Issue - State: closed - Opened by Chunhui-Zou over 2 years ago

#35 - 想看模型生成的结果,该修改代码那一块

Issue - State: closed - Opened by Chunhui-Zou over 2 years ago

#34 - Math23K 没有公开test_private.json 文件吗?

Issue - State: closed - Opened by XiaoqingNLP almost 3 years ago - 3 comments

#34 - Math23K 没有公开test_private.json 文件吗?

Issue - State: closed - Opened by XiaoqingNLP almost 3 years ago - 3 comments

#33 - RuntimeError: Unable to proceed, no GPU resources available

Issue - State: open - Opened by louxingrui almost 3 years ago - 2 comments

#33 - RuntimeError: Unable to proceed, no GPU resources available

Issue - State: open - Opened by louxingrui almost 3 years ago - 2 comments

#32 - 数据集怎么处理,我下载了LSCTS数据集,运行程序后报错。

Issue - State: closed - Opened by Chunhui-Zou almost 3 years ago - 2 comments

#32 - 数据集怎么处理,我下载了LSCTS数据集,运行程序后报错。

Issue - State: closed - Opened by Chunhui-Zou almost 3 years ago - 2 comments

#31 - Create finetune_cpm2_sogou-log.sh

Pull Request - State: closed - Opened by xcjthu almost 3 years ago

#31 - Create finetune_cpm2_sogou-log.sh

Pull Request - State: closed - Opened by xcjthu almost 3 years ago

#30 - CPM2模型推理代码

Issue - State: closed - Opened by Bournet almost 3 years ago - 1 comment

#30 - CPM2模型推理代码

Issue - State: closed - Opened by Bournet almost 3 years ago - 1 comment

#29 - CPM2在生成任务上的微调策略

Issue - State: closed - Opened by XiaoqingNLP almost 3 years ago - 3 comments

#29 - CPM2在生成任务上的微调策略

Issue - State: closed - Opened by XiaoqingNLP almost 3 years ago - 3 comments

#28 - 怎么加入新词再finetune

Issue - State: closed - Opened by LinglingGreat almost 3 years ago - 3 comments

#28 - 怎么加入新词再finetune

Issue - State: closed - Opened by LinglingGreat almost 3 years ago - 3 comments

#27 - A100-8卡环境cublas报错

Issue - State: closed - Opened by linjianz almost 3 years ago - 1 comment

#27 - A100-8卡环境cublas报错

Issue - State: closed - Opened by linjianz almost 3 years ago - 1 comment

#25 - How to use BMInf to inference 100000.tar 11B model?

Issue - State: closed - Opened by linjianz about 3 years ago - 1 comment

#25 - How to use BMInf to inference 100000.tar 11B model?

Issue - State: closed - Opened by linjianz about 3 years ago - 1 comment

#24 - 模型并行度修改的切割问题

Issue - State: closed - Opened by leelinglin about 3 years ago - 1 comment

#24 - 模型并行度修改的切割问题

Issue - State: closed - Opened by leelinglin about 3 years ago - 1 comment

#23 - 模型fine-tune显存溢出

Issue - State: closed - Opened by leelinglin about 3 years ago - 4 comments

#23 - 模型fine-tune显存溢出

Issue - State: closed - Opened by leelinglin about 3 years ago - 4 comments

#22 - how to use 32000.tar?

Issue - State: closed - Opened by AdamBear about 3 years ago

#22 - how to use 32000.tar?

Issue - State: closed - Opened by AdamBear about 3 years ago

#21 - 双机8卡分布式训练

Issue - State: closed - Opened by forrestbing about 3 years ago - 2 comments

#21 - 双机8卡分布式训练

Issue - State: closed - Opened by forrestbing about 3 years ago - 2 comments

#20 - the decoder input in evaluate_gen()

Issue - State: closed - Opened by GMago-LeWay about 3 years ago - 1 comment

#20 - the decoder input in evaluate_gen()

Issue - State: closed - Opened by GMago-LeWay about 3 years ago - 1 comment

#19 - 显存占用

Issue - State: closed - Opened by 2020zyc about 3 years ago - 2 comments

#19 - 显存占用

Issue - State: closed - Opened by 2020zyc about 3 years ago - 2 comments

#18 - 请问可以不用deepspeed吗

Issue - State: closed - Opened by 2020zyc about 3 years ago - 7 comments

#18 - 请问可以不用deepspeed吗

Issue - State: closed - Opened by 2020zyc about 3 years ago - 7 comments

#17 - bug? save_zero

Issue - State: closed - Opened by 2020zyc about 3 years ago - 1 comment

#17 - bug? save_zero

Issue - State: closed - Opened by 2020zyc about 3 years ago - 1 comment

#16 - attention.dense.weight not found when prompt fine tuning

Issue - State: closed - Opened by luotongml about 3 years ago - 17 comments

#16 - attention.dense.weight not found when prompt fine tuning

Issue - State: closed - Opened by luotongml about 3 years ago - 17 comments

#15 - 请问2张A100-40G能跑吗

Issue - State: closed - Opened by 2020zyc about 3 years ago - 11 comments

#15 - 请问2张A100-40G能跑吗

Issue - State: closed - Opened by 2020zyc about 3 years ago - 11 comments

#14 - prompt tunning问题

Issue - State: closed - Opened by zirui about 3 years ago - 4 comments

#14 - prompt tunning问题

Issue - State: closed - Opened by zirui about 3 years ago - 4 comments

#13 - 请问sentinel id的作用什么?

Issue - State: closed - Opened by wakafengfan over 3 years ago - 1 comment

#13 - 请问sentinel id的作用什么?

Issue - State: closed - Opened by wakafengfan over 3 years ago - 1 comment

#12 - Can CPM-2 run in playground model, any prompt hint?

Issue - State: closed - Opened by qhduan over 3 years ago - 1 comment

#12 - Can CPM-2 run in playground model, any prompt hint?

Issue - State: closed - Opened by qhduan over 3 years ago - 1 comment

#11 - Finetune loss and acc is pool

Issue - State: closed - Opened by k15201363625 over 3 years ago - 10 comments

#11 - Finetune loss and acc is pool

Issue - State: closed - Opened by k15201363625 over 3 years ago - 10 comments

#10 - docker 运行失败

Issue - State: closed - Opened by windflee over 3 years ago - 1 comment

#10 - docker 运行失败

Issue - State: closed - Opened by windflee over 3 years ago - 1 comment

#9 - MoE Finetune

Issue - State: closed - Opened by lizy14 over 3 years ago - 2 comments

#8 - 用两个机器,启动的时候,报错

Issue - State: closed - Opened by lonelydancer over 3 years ago - 3 comments

#7 - deepspeed init hang住

Issue - State: closed - Opened by lonelydancer over 3 years ago - 3 comments

#6 - CPM-2-Finetuning的推理速度有多快(在V100上)

Issue - State: closed - Opened by lonelydancer over 3 years ago - 2 comments

#5 - adgen数据集

Issue - State: closed - Opened by zhenhao-huang over 3 years ago - 4 comments

#4 - finetune的最小配置是?

Issue - State: closed - Opened by eshaoliu over 3 years ago - 10 comments

#3 - V100单卡能inference吗

Issue - State: closed - Opened by zuowang over 3 years ago - 1 comment

#2 - 报错

Issue - State: closed - Opened by superqing001 over 3 years ago

#1 - 是否可以提供docker运行的脚本参考?

Issue - State: closed - Opened by superqing001 over 3 years ago - 2 comments