Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / THUDM/GLM-130B issues and pull requests
#220 - Could you offer a download link with Chinese mainland mirror?
Issue -
State: open - Opened by GreekPanda 5 months ago
#219 - error about the GLM-130B’s model checkpoint
Issue -
State: open - Opened by sunpian1 6 months ago
- 1 comment
#218 - 下载到一半就再也下不了了
Issue -
State: open - Opened by HaHaLiang666 7 months ago
#217 - 请各位大佬伸以援手,我想要在自己本地部署一个该模型,怎么在windows上进行部署?
Issue -
State: open - Opened by kangkangkangkkkk 8 months ago
#216 - 有用tensortRT-llm的docker环境跑通模型的吗?求助...
Issue -
State: open - Opened by dahaobenhao 9 months ago
#215 - Clarification Request on GLM-130B Model Architecture and Licensing for Commercial Use
Issue -
State: open - Opened by JayLiangs 11 months ago
#214 - 执行bash scripts/generate.sh --input-source interactive时出现的错误。大佬救救!
Issue -
State: open - Opened by Eternal-Yan 11 months ago
- 1 comment
#213 - 8卡 fastertransformer 推理报错RuntimeError: [FT][ERROR] Assertion fail: /home/young.ruan/FasterTransformer/src/fastertransformer/th_op/glm/GlmOp.h:539
Issue -
State: open - Opened by rGitcy 12 months ago
#212 - RuntimeError: probability tensor contains either `inf`, `nan` or element < 0answers, answers_with_style, blanks = fill_blanks(raw_text, model, tokenizer, strategy)
Issue -
State: open - Opened by rGitcy about 1 year ago
#210 - 请问,课程链接在哪里?
Issue -
State: open - Opened by Stonesusu about 1 year ago
- 1 comment
#209 - glm2-130B will it be made?
Issue -
State: closed - Opened by yhyu13 about 1 year ago
- 1 comment
#208 - FasterTransformer能否支持Glm6B呢
Issue -
State: open - Opened by sym19991125 about 1 year ago
#207 - 申请邮件收到的模型下载链接都失效了
Issue -
State: open - Opened by bixyz about 1 year ago
- 5 comments
#206 - 基于130B有chat版本开源的计划吗?
Issue -
State: open - Opened by ricosr about 1 year ago
#205 - 模型申请页面无法提交申请
Issue -
State: open - Opened by VSRacer about 1 year ago
- 1 comment
#204 - 请问GLM可以在输出内容时,同时输出引用内容的来源吗?
Issue -
State: open - Opened by mike-2020 over 1 year ago
#203 - 模型并行集群怎么搭建
Issue -
State: open - Opened by ChenBinfighting1 over 1 year ago
#202 - GLM-130B文档中描述model weights,GPU内存需要260G,测试demo中实际测试总占用在240G左右,请问是什么原因
Issue -
State: open - Opened by zxs789 over 1 year ago
#201 - 每个token耗时呈脉冲式变化
Issue -
State: open - Opened by wangheqi987 over 1 year ago
#200 - 关于FT inference benchmark数据的疑问
Issue -
State: open - Opened by frankxyy over 1 year ago
#199 - 训练目标
Issue -
State: open - Opened by shuangshuangguo over 1 year ago
#198 - 关于docs/quantization.md中图片疑问
Issue -
State: open - Opened by M3Dade over 1 year ago
#196 - GLM-130B 模型结构超参问题
Issue -
State: open - Opened by peiyingxin over 1 year ago
#195 - [Question]GLM-130B模型有vocab文件吗?
Issue -
State: open - Opened by starkhu over 1 year ago
- 1 comment
#194 - 6 cards inference
Issue -
State: open - Opened by wangheqi987 over 1 year ago
- 1 comment
#193 - FasterTransformer支持bf16推理吗
Issue -
State: open - Opened by benyang0506 over 1 year ago
#192 - 内测的ChatGLM(https://chatglm.cn)使用感受还不如本地部署的chatGLM-6B量化模型,这是为啥?
Issue -
State: open - Opened by zhaochuninhefei over 1 year ago
- 1 comment
#191 - Embedding Layer Gradient Shrink在哪里实现的?
Issue -
State: open - Opened by jiezhangGt over 1 year ago
- 1 comment
#190 - GLM-130B如何使用lora微调
Issue -
State: open - Opened by ShaunHeNJU over 1 year ago
#189 - 请问,GLM-130B有部署到DCU上的教程吗?
Issue -
State: open - Opened by guoxiaoyue111111 over 1 year ago
#188 - nvlink通信
Issue -
State: open - Opened by wangheqi987 over 1 year ago
#187 - aria2的http_proxy和https_proxy报错
Issue -
State: open - Opened by Timaos123 over 1 year ago
- 1 comment
#186 - 模型效果很差,是什么原因呢?
Issue -
State: open - Opened by rchanggogogo over 1 year ago
- 6 comments
#185 - int4模型加载报错
Issue -
State: closed - Opened by wudajun7509 over 1 year ago
- 3 comments
#184 - bash scripts/generate.sh --input-source interactive运行报错
Issue -
State: closed - Opened by wudajun7509 over 1 year ago
- 4 comments
#183 - 现在好像没有ChatGLM-130B开源吧?只有6B, 130B的不是Chat
Issue -
State: closed - Opened by guotong1988 over 1 year ago
- 1 comment
#182 - 如何使用FasterTransformer适配自己的模型
Issue -
State: open - Opened by ming-shy over 1 year ago
- 1 comment
#181 - RuntimeError: CUDA error: invalid device ordinal
Issue -
State: open - Opened by TranscenderNing over 1 year ago
- 1 comment
#180 - 关于论文中bf16的一个疑问
Issue -
State: open - Opened by Saggressive over 1 year ago
#179 - [HELP] 有人能分享一下量化好的int4 版本的模型吗?
Issue -
State: closed - Opened by rchanggogogo over 1 year ago
#178 - 是不是chatglm与这个GLM-130b开源模型中间还有很多问题待解决?
Issue -
State: open - Opened by applepieiris over 1 year ago
- 2 comments
#177 - [ERROR] `bash scripts/generate.sh --input-source interactive` 报错
Issue -
State: open - Opened by SniperM99 over 1 year ago
- 7 comments
#176 - 国内模型下载地址
Issue -
State: open - Opened by wangheqi987 over 1 year ago
- 2 comments
#175 - question: what does token mean here ?
Issue -
State: open - Opened by jiangying000 over 1 year ago
#174 - 4*4090gpu for int4 model inference error
Issue -
State: open - Opened by sukibean163 over 1 year ago
- 1 comment
#172 - 想问一下作者,量化成int4 int8 之后为什么模型大小没有变化,都是240g
Issue -
State: closed - Opened by GXKIM over 1 year ago
- 15 comments
#171 - https://tianqi.aminer.cn/ 天启官网合作咨询验证码打不开,请问如何联系商用
Issue -
State: open - Opened by sjtuzhaoxh over 1 year ago
- 1 comment
#170 - 为什么没有中文说明?
Issue -
State: closed - Opened by fsy1215 over 1 year ago
- 3 comments
#169 - Update requirements.txt
Pull Request -
State: open - Opened by yihuaxiang over 1 year ago
- 1 comment
#168 - V100(8 * 32G)运行报错
Issue -
State: open - Opened by yihuaxiang over 1 year ago
- 14 comments
#167 - 部署后报错 size mismatch for transformer.word_embeddings.weight: copying a param with shape torch.Size([18816, 12288]) from checkpoint, the shape in current model is torch.Size([150528, 12288]).
Issue -
State: open - Opened by yihuaxiang over 1 year ago
- 5 comments
#166 - torch run的问题
Issue -
State: open - Opened by GXKIM over 1 year ago
- 4 comments
#165 - 关于Fastertransformer推理的程序
Issue -
State: open - Opened by benyang0506 over 1 year ago
#164 - Question about P-Tuning
Issue -
State: open - Opened by Joey404 over 1 year ago
#163 - 生成代码有问题
Issue -
State: open - Opened by Ezra-Yu over 1 year ago
#162 - A100 GPU推理吞吐量
Issue -
State: open - Opened by RosieYC over 1 year ago
#161 - ValueError: could not find the metadata file ckpt/glm-130b-sat/49300/latest, please check --load
Issue -
State: open - Opened by bolongliu over 1 year ago
- 4 comments
#160 - 量化int4遇到的问题
Issue -
State: open - Opened by chensiyao12 over 1 year ago
- 10 comments
#159 - 想问下如何微调?
Issue -
State: open - Opened by kaixinjiuhao123 over 1 year ago
#158 - 请问GLM-130B是否支持华为昇腾服务器910A型号的NPU服务器或者其它型号的国产服务器进行预训练?
Issue -
State: open - Opened by gptcod over 1 year ago
- 1 comment
#157 - RuntimeError: CUDA Error: no kernel image is available for execution on the device
Issue -
State: open - Opened by wei-potato over 1 year ago
- 1 comment
#155 - Compat with numpy>=1.24.3
Pull Request -
State: closed - Opened by nrailg over 1 year ago
#154 - 无法下载完整数据,部分无法下载
Issue -
State: open - Opened by GXKIM over 1 year ago
#153 - 使用v100推理遇到错误
Issue -
State: open - Opened by zhang992253635 over 1 year ago
- 1 comment
#152 - does not get proper answer. should i change my prompt?
Issue -
State: open - Opened by liuslevis over 1 year ago
#151 - 是否支持华为昇腾服务器910A型号的NPU服务器?
Issue -
State: open - Opened by 15220036003 over 1 year ago
#150 - Error when run generate.sh
Issue -
State: closed - Opened by SnakeHacker over 1 year ago
- 2 comments
#149 - 博客出问题了
Issue -
State: closed - Opened by xiaoyaolangzhi over 1 year ago
#148 - 求chatglm-6b的FasterTransformer版本
Issue -
State: open - Opened by sc-lj over 1 year ago
- 1 comment
#147 - ValueError: could not find the metadata file
Issue -
State: closed - Opened by ferrymo over 1 year ago
#146 - What is the MFU of GLM-130B during training?
Issue -
State: open - Opened by nullnonenilNULL over 1 year ago
#145 - 求训练代码
Issue -
State: open - Opened by LucienShui over 1 year ago
#144 - 请问如何简单快捷使用huggingface transformers读取glm-base模型?
Issue -
State: open - Opened by mzh1996 over 1 year ago
- 1 comment
#143 - SwissArmyTransformer\model\base_model.py下BaseTransformer的device参数默认是cpu
Issue -
State: open - Opened by sea-of-freedom over 1 year ago
#141 - Create 111
Pull Request -
State: open - Opened by lishichao666 over 1 year ago
#140 - convert cost too much memory(260GB)
Issue -
State: closed - Opened by SnakeHacker over 1 year ago
- 1 comment
#138 - Perplexity about the number of 130b parameters
Issue -
State: closed - Opened by sea-of-freedom over 1 year ago
- 1 comment
#137 - 模型INT4量化被Killed
Issue -
State: open - Opened by Lunamoon-flow over 1 year ago
- 2 comments
#133 - GLM-130B 训练
Issue -
State: open - Opened by aph-asic over 1 year ago
- 4 comments
#132 - NCCL RuntimeError
Issue -
State: open - Opened by edwardelric1202 over 1 year ago
- 1 comment
#130 - 4090显卡需要几张?
Issue -
State: open - Opened by g-wellsa over 1 year ago
- 4 comments
#127 - 硬件配置最小要求?
Issue -
State: open - Opened by oleotiger over 1 year ago
- 5 comments
#125 - 邮件收不到模型下载链接
Issue -
State: open - Opened by zhangxiangchn over 1 year ago
- 2 comments
#122 - 下载了60个文件但是只有239GB
Issue -
State: open - Opened by leoozy over 1 year ago
- 10 comments
#121 - 带有web页面和API接口吗
Issue -
State: open - Opened by PolarPeak over 1 year ago
- 4 comments
#120 - int4 docker env: RuntimeError: shape '[24, 3, 128]' is invalid for input of size 4608
Issue -
State: open - Opened by lhray over 1 year ago
- 1 comment
#117 - 分布式训练error,求各位跑通的大佬赐教
Issue -
State: open - Opened by xiaoweiweixiao over 1 year ago
- 6 comments
#108 - GLM-130B参数模型加载到显卡(8*A100 40G)需要多久?用来推理
Issue -
State: open - Opened by TestNLP over 1 year ago
- 6 comments
#105 - 能不能像6B一样出个本地运行版本呀
Issue -
State: open - Opened by ZanoZ over 1 year ago
- 1 comment
#104 - 个人学习用,没有edu邮箱,求下载地址
Issue -
State: closed - Opened by maxadc over 1 year ago
#103 - 单机离线状态下无法运行,报错[errno 11001]getaddrinfo failed
Issue -
State: open - Opened by gsxy456 over 1 year ago
#102 - 请问 微调的各任务数据格式是怎样的呢
Issue -
State: open - Opened by SevenMpp over 1 year ago
- 2 comments
#101 - 评估数据集好像下载不了
Issue -
State: open - Opened by cingtiye over 1 year ago
#100 - 运行bash scripts/generate.sh --input-source interactive --sequential-initialization报错
Issue -
State: closed - Opened by Maxhyl over 1 year ago
- 5 comments
#99 - machine specification for pretraining
Issue -
State: open - Opened by wlike over 1 year ago
#98 - 你好,big-bench好像不支持pytorch,请问如何测试big-bench
Issue -
State: open - Opened by haiqizhang over 1 year ago
#97 - TensorBoard logging is invalid and inaccessible
Issue -
State: closed - Opened by pluiez over 1 year ago
#96 - Infer time increases dramatically when start two server
Issue -
State: open - Opened by shaochangxu over 1 year ago
- 2 comments
#95 - Inference with FasterTransformer
Issue -
State: open - Opened by justfuwei over 1 year ago
- 5 comments