Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / THUDM/GLM issues and pull requests
#209 - ds_finetune_superglue.sh 脚本如何配置流水线并行
Issue -
State: open - Opened by dreamstick 3 months ago
#208 - 110M的模型有huggingface版本吗?
Issue -
State: open - Opened by WzjCoder 5 months ago
#207 - 建议上架ollama
Issue -
State: open - Opened by heimy2000 5 months ago
#206 - ms-swift支持了glm-4v-9b多模态大模型的微调(finetune)🚀😊
Issue -
State: closed - Opened by Jintao-Huang 5 months ago
- 2 comments
#205 - 模型的分词逻辑
Issue -
State: open - Opened by loki-keroro 5 months ago
#204 - Add special_token
Issue -
State: open - Opened by chenzebiaohub 7 months ago
#203 - Few-shot tests on GLM-10B
Issue -
State: open - Opened by Vispstar-V 8 months ago
#202 - What is the license of Pretrained Models?
Issue -
State: open - Opened by phuchm 10 months ago
#201 - bug report!rouge-1 = 0.0000 rouge-2 = 0.0000 rouge-l = 0.0000
Issue -
State: closed - Opened by runningabcd 10 months ago
- 1 comment
#200 - 如果用glm-chinese-large 版本进行微调,相关的配置需要更改吗?
Issue -
State: closed - Opened by runningabcd 11 months ago
#199 - 请教一下大家,glm0.3b有什么可用的推理加速的方法吗?目前我的推理任务要3秒钟一个,耗时太长
Issue -
State: open - Opened by mechigonft 11 months ago
- 1 comment
#198 - 运行bash scripts/generate_block.sh config_tasks/model_blocklm_10B_chinese.sh报错
Issue -
State: open - Opened by XiaozhuLove 12 months ago
- 1 comment
#197 - mpi4py库
Issue -
State: closed - Opened by XiaozhuLove 12 months ago
#196 - 使用glm-large-chinese微调分类任务
Issue -
State: open - Opened by mechigonft about 1 year ago
#195 - 微调glm-large-chinese,不能使用deepspeed吗?
Issue -
State: open - Opened by mechigonft about 1 year ago
#194 - 在使用glm-large-chinese微调分类任务时报错
Issue -
State: open - Opened by mechigonft about 1 year ago
#193 - Create README_zh.md
Pull Request -
State: open - Opened by great-wind about 1 year ago
#192 - 使用GLM-2b推理时生成无意义内容
Issue -
State: open - Opened by ChristLBUPT about 1 year ago
#191 - 运行 GLM-10B 的最低配置是多少?
Issue -
State: open - Opened by nguyenvanhoangphuc about 1 year ago
- 1 comment
#190 - 使用Zero-1+cpu_offload=true时,出现错误?
Issue -
State: open - Opened by SkrDrag over 1 year ago
- 2 comments
#189 - ImportError: cannot import name 'container_abcs' from 'torch._six' (/root/anaconda3/envs/lss/lib/python3.8/site-packages/torch/_six.py)
Issue -
State: open - Opened by LssTry over 1 year ago
- 1 comment
#188 - MP_size大于1 continue pretrained后的模型 怎么转换成transformer模型进行测试
Issue -
State: open - Opened by wangerxiao001 over 1 year ago
#187 - 使用glm-10b-chinese调用generate方法有时时会出错
Issue -
State: open - Opened by adzhua over 1 year ago
- 1 comment
#186 - 调用glm模型,遇到modeling_glm.py的bug:attention_mask初始化device设置遗漏
Issue -
State: open - Opened by luo-li-ba-suo over 1 year ago
- 1 comment
#185 - 请问有人使用GLM跑通过Continual Pre-training么?
Issue -
State: open - Opened by wjn1996 over 1 year ago
#184 - glm-10b-chinese原始模型推理报错
Issue -
State: open - Opened by Mryangkaitong over 1 year ago
#183 - 有对glm-10b-chinese这个模型做过评测的吗?
Issue -
State: open - Opened by hegang1-tal over 1 year ago
- 1 comment
#182 - 如何将GLM10B封装成对话式API
Issue -
State: open - Opened by yihuaxiang over 1 year ago
- 3 comments
#181 - 用transformers包,下载文件到本地后无法加载AutoTokenizer
Issue -
State: open - Opened by PolarisRisingWar over 1 year ago
#180 - glm-10b / tokenization_glm.py
Issue -
State: open - Opened by chenhaoenen over 1 year ago
#179 - 预训练的数据格式可以给个示例吗,可以不显示数据,就想看下格式
Issue -
State: open - Opened by gyh123wqe over 1 year ago
- 1 comment
#178 - 使用glm-2b时候,跟随readme提供的例子,得到很糟糕的输出
Issue -
State: open - Opened by leekum2018 over 1 year ago
- 2 comments
#177 - 求问glm-10b-chinese推理所需最低配置
Issue -
State: open - Opened by TianYangCai over 1 year ago
#176 - block_lm_ratio参数
Issue -
State: closed - Opened by chenhaoenen over 1 year ago
- 1 comment
#175 - 请问微调模型的 参考资料哪里可以学习借鉴
Issue -
State: open - Opened by thurdaypeng over 1 year ago
#174 - Eligibility for Commercial Use
Issue -
State: open - Opened by Hegelim over 1 year ago
- 1 comment
#173 - 在没有InfiniBand情况下能训练glm-large吗
Issue -
State: open - Opened by allendred over 1 year ago
- 3 comments
#172 - GLM-10B中文版预训练权重下载后解压失败
Issue -
State: open - Opened by echosyy over 1 year ago
#171 - 数据集格式是怎么样的?能否把一篇一万字的文档整体塞进去训练?另外对显卡要求是多高
Issue -
State: open - Opened by dizhenx over 1 year ago
#170 - 在预训练Pretrain时报no valid `self._rcvd_idx` is found错误
Issue -
State: open - Opened by yt7589 over 1 year ago
- 3 comments
#169 - parameter SCB
Issue -
State: open - Opened by zhaoqf123 over 1 year ago
- 1 comment
#168 - 请问 glm-10b-chinese 模型初始loss是多少,我的是1.7左右合理吗
Issue -
State: open - Opened by shouwangzhe over 1 year ago
#167 - add comments and add ds_block_tiny for testing
Pull Request -
State: open - Opened by LucienShui over 1 year ago
#166 - glm-10b-chinese模型的预训练数据量
Issue -
State: open - Opened by wlike over 1 year ago
#164 - 环境问题:python 版本号与 requirements.txt 中的版本号,以及一些依赖
Issue -
State: open - Opened by LucienShui over 1 year ago
#163 - GLM 10B和ChatGLM 6B模型架构的差别
Issue -
State: open - Opened by ccsquare over 1 year ago
- 3 comments
#162 - 用"THUDM/glm-10b-chinese"做分类任务出错
Issue -
State: closed - Opened by 18335100284 over 1 year ago
- 2 comments
#161 - GLM-10B-Chinese模型文件太大无法解压
Issue -
State: closed - Opened by wusi1590 over 1 year ago
- 1 comment
#160 - 我基于10B模型做继续训练,loss只从11下降到5
Issue -
State: open - Opened by TccccD over 1 year ago
- 6 comments
#159 - 请发布一个小参数版本的ChatGLM,与ChatGLM-6B共享Tokenizer,让RLHF最后一步PPO能够最大可能提速
Issue -
State: open - Opened by yynil over 1 year ago
- 4 comments
#158 - 关于GLM的有以下两个问题?1.为什么predict的时候没有加linear映射到词表维度,而是直接与word_embeddings相乘映射到词表维度了。 2.GLM加载使用AutoModelForSeq2SeqLM,而没有使用AutoModelForCausualLM,原因是什么?
Issue -
State: open - Opened by macheng6 over 1 year ago
#157 - 继续预训练:加载切分好的2b模型时报错找不到zero_pp_rank_0_mp_rank_00_optim_states.pt
Issue -
State: open - Opened by shuangt over 1 year ago
#156 - 请问pretrain怎么控制训练的epoch数?
Issue -
State: open - Opened by wusi1590 over 1 year ago
- 7 comments
#155 - self.num_samples = 1000 * self.ds_len
Issue -
State: open - Opened by superhg over 1 year ago
#154 - glm-large-chinese-335M重复生成
Issue -
State: open - Opened by zh25714 over 1 year ago
- 3 comments
#153 - 使用 kubeflow 启动分布式训练
Issue -
State: open - Opened by EthanChen1234 over 1 year ago
#152 - 请问GLM_large_chinese的预训练语料在哪里找
Issue -
State: open - Opened by zhangzai666 over 1 year ago
#151 - 请问10B-chinese的模型文件里为什么没有词表?
Issue -
State: open - Opened by ggjge over 1 year ago
- 1 comment
#150 - Accelerate support for GLM
Issue -
State: open - Opened by larrylawl over 1 year ago
#149 - 将GLM-10B-chinese模型切分为MP_SIZE=8, 然后finetune seq2seq任务时,在eval阶段报错IndexError。怀疑eval没有以MP_SIZE=8方式运行
Issue -
State: open - Opened by webYFDT over 1 year ago
- 7 comments
#148 - 你好,我在使用glm-10-chinese对自己数据集进行微调的时候,卡在了第1000个iteration不动了
Issue -
State: open - Opened by 694344851 over 1 year ago
- 5 comments
#147 - 使用GLM-10B-Chinese模型跑seq2seq的finetune脚本报错word_embeddings.weight维度不对
Issue -
State: closed - Opened by webYFDT over 1 year ago
- 1 comment
#146 - 请问单机8卡v100 32G能跑seq2seq的fine tune吗?我跑着会work = _default_pg.barrier()
Issue -
State: closed - Opened by webYFDT over 1 year ago
#145 - 使用p-tuning去finetune glm-large-chinese模型时 --continuous-prompt
Issue -
State: closed - Opened by Chenchenwei over 1 year ago
- 3 comments
#144 - RuntimeError: expand(torch.HalfTensor{[1025, 4096]}, size=[1]): the number of sizes provided (1) must be greater or equal to the number of dimensions in the tensor (2)
Issue -
State: closed - Opened by pilipala818 over 1 year ago
- 2 comments
#143 - 如何通过huggingface加载的模型拿到last_hidden_states?
Issue -
State: closed - Opened by superhg over 1 year ago
#142 - accelerate 找不到模型
Issue -
State: open - Opened by yata0 over 1 year ago
- 7 comments
Labels: bug
#141 - GLM-10B 模型效率问题
Issue -
State: open - Opened by tqjack over 1 year ago
#140 - BUG: GLM-10B-Chinese model generate " ⁇".
Issue -
State: open - Opened by Tebmer over 1 year ago
- 3 comments
#139 - 基于Prompt数据集如何微调模型?
Issue -
State: open - Opened by xyjsjruiliu over 1 year ago
#138 - How to set hyperparameters during pretraining glm_doc?
Issue -
State: open - Opened by ymr12 over 1 year ago
#137 - 模型并行训练结束后,如何将多个模型文件合并成一个?
Issue -
State: closed - Opened by sduhkn over 1 year ago
- 2 comments
#136 - 基于10B模型继续预训练,遇到world size 不一致导致报错
Issue -
State: open - Opened by JinmingZhao over 1 year ago
- 2 comments
#135 - cmrc数据集结果,预测结果都为空
Issue -
State: open - Opened by yxk9810 over 1 year ago
#134 - chatglm-6b
Issue -
State: open - Opened by mx8435 over 1 year ago
- 1 comment
#133 - 单卡pretrain chinese-large模型
Issue -
State: closed - Opened by yxk9810 over 1 year ago
#132 - impelement by megengine
Issue -
State: open - Opened by chenqy4933 over 1 year ago
#131 - hugging face仓库的10b-chinese模型问题。用Trainer API进行数据并行微调会报出OOM错误 ,有没有优化内存的方法?
Issue -
State: open - Opened by taofennanhai over 1 year ago
- 3 comments
#130 - GLM 10B 模型零样本结果无法对齐
Issue -
State: open - Opened by LemonNoel over 1 year ago
#129 - Update README.md
Pull Request -
State: open - Opened by eltociear over 1 year ago
#128 - The attention mask and the pad token id were not set问题
Issue -
State: open - Opened by taofennanhai over 1 year ago
- 1 comment
#127 - Correct a typo in layers.py
Pull Request -
State: open - Opened by felixonmars over 1 year ago
#126 - Does this model support temperature and repetition_penalty?
Issue -
State: open - Opened by sgsdxzy over 1 year ago
#125 - GPT2Dataset和BlockDataset
Issue -
State: open - Opened by jiangix-paper over 1 year ago
#124 - 小数据finetune large-chinese rouge 为0
Issue -
State: open - Opened by yxk9810 over 1 year ago
- 2 comments
#123 - 160G内存,两张24G3090,800G硬盘的环境下,对GLM-10-chinese进行finetune
Issue -
State: open - Opened by 694344851 over 1 year ago
- 3 comments
#122 - 50035 token id 报错
Issue -
State: open - Opened by Mryangkaitong over 1 year ago
- 1 comment
#120 - GLMForSequenceClassification的使用
Issue -
State: open - Opened by Mryangkaitong over 1 year ago
- 2 comments
#119 - AutoModelForCausalLM
Issue -
State: open - Opened by beautifull4frank over 1 year ago
- 1 comment
#101 - 运行 ds_finetune_superglue.sh key error "dev-0"
Issue -
State: closed - Opened by lulia0228 over 1 year ago
- 1 comment
#100 - 多机并行可以给点示例吗?
Issue -
State: closed - Opened by lulia0228 over 1 year ago
- 3 comments
#99 - 请问GLM模型是否可以生成长句子?我对模型进行推理或者微调的时候都会报出维度不匹配的错误
Issue -
State: closed - Opened by taofennanhai over 1 year ago
- 3 comments
#98 - WudaoCorpus-Dialog
Issue -
State: closed - Opened by jiangix-paper over 1 year ago
- 1 comment
#97 - where I can check the source code of model.generate()
Issue -
State: closed - Opened by yuanjames over 1 year ago
- 1 comment
#96 - GLM-10B怎么使用模型并行
Issue -
State: open - Opened by Yiran-Zhu over 1 year ago
- 2 comments
#95 - 如何使用在huggingface下载的离线模型推理glm-10b-chinese?
Issue -
State: open - Opened by vicwer over 1 year ago
- 2 comments
#94 - 使用ds_pretrain_nvidia.sh后模型生成异常
Issue -
State: closed - Opened by yipintiancheng over 1 year ago
- 3 comments
#93 - 请问GLM-10B-Chinese的tokenizer是否支持添加自定义的token?如果支持的话大概的方式是什么?非常感谢!
Issue -
State: closed - Opened by maojinyang over 1 year ago
#92 - Why does the model occupy less GPU memory after quantization, but the inference speed is slower?
Issue -
State: open - Opened by Ant0082 over 1 year ago
- 1 comment