Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / alibaba/Megatron-LLaMA issues and pull requests

#67 - No module named 'megatron.tokenizer.file_utils'

Issue - State: closed - Opened by yuzhiguo07 2 months ago

#66 - 如何断点续训

Issue - State: open - Opened by MAxx8371 3 months ago

#65 - No update for a long time

Issue - State: open - Opened by dong-liuliu 3 months ago

#64 - Has llama2 GQA been supported yet?

Issue - State: open - Opened by JiwenJ 4 months ago

#62 - Llama 3 Support

Issue - State: open - Opened by john-theo 5 months ago

#61 - About batch_size

Issue - State: open - Opened by tszslovewanpu 7 months ago

#60 - sh LLaMA2_7B_standalone.sh

Issue - State: open - Opened by yangzhipeng1108 7 months ago

#59 - 请问是否支持从0训练一个小规模的LLaMA模型,如:1B

Issue - State: open - Opened by liubo12 7 months ago - 1 comment

#58 - 注意力权重转换问题

Issue - State: open - Opened by noob-ctrl 8 months ago - 2 comments

#56 - 使用distributed optimzer时grad_norm计算准确度的疑问

Issue - State: open - Opened by chivychao 9 months ago - 1 comment

#54 - add alibi position embedding and support baichuan

Pull Request - State: open - Opened by qyccc 10 months ago - 3 comments

#53 - LLaMAModel._causal_lm_process中的labels和logits对齐方法疑问

Issue - State: open - Opened by chivychao 10 months ago - 3 comments

#52 - Megatron-LM权重转hf格式

Issue - State: open - Opened by Yang-QW 10 months ago - 1 comment

#52 - Megatron-LM权重转hf格式

Issue - State: open - Opened by Yang-QW 10 months ago - 4 comments

#51 - Unable to import Megatron

Issue - State: closed - Opened by fyf2016 11 months ago - 8 comments

#50 - llama中decoder layer层里面的MLP问题

Issue - State: closed - Opened by yuanzhoulvpi2017 11 months ago - 4 comments

#48 - 求一份Serving的教程代码

Issue - State: open - Opened by xealml 11 months ago - 1 comment

#47 - hf权重转换代码小bug

Issue - State: open - Opened by yuanzhoulvpi2017 11 months ago

#47 - hf权重转换代码小bug

Issue - State: open - Opened by yuanzhoulvpi2017 11 months ago

#46 - INT4 量化的模型可以被Megatron-LLaMA支持吗?

Issue - State: open - Opened by Jeff123z 12 months ago - 1 comment

#45 - 请问目前Megatron-LLaMA支持LLaMA2-70B的训练吗?

Issue - State: open - Opened by 13416157913 12 months ago - 1 comment

#44 - 是否兼容sequence parallel

Issue - State: closed - Opened by jingjie01ai 12 months ago - 2 comments

#43 - CUDA_DEVICE_MAX_CONNECTIONS 设置问题

Issue - State: closed - Opened by Richie-yan 12 months ago

#43 - CUDA_DEVICE_MAX_CONNECTIONS 设置问题

Issue - State: closed - Opened by Richie-yan 12 months ago

#42 - 每次GA的backward都需要做通信

Issue - State: closed - Opened by jingjie01ai 12 months ago - 5 comments

#41 - fp16的支持问题

Issue - State: open - Opened by XUWeijiang 12 months ago - 1 comment

#41 - fp16的支持问题

Issue - State: open - Opened by XUWeijiang 12 months ago - 1 comment

#39 - Supporting overlapping AG with forward computation

Pull Request - State: closed - Opened by li-yi-dong 12 months ago

#39 - Supporting overlapping AG with forward computation

Pull Request - State: closed - Opened by li-yi-dong 12 months ago

#38 - Adopt OverlappedDistributedOptimizer to PP

Pull Request - State: closed - Opened by li-yi-dong 12 months ago - 1 comment

#33 - solve the RuntimeError: Tensors must be CUDA and dense

Pull Request - State: open - Opened by 13416157913 about 1 year ago - 2 comments

#33 - solve the RuntimeError: Tensors must be CUDA and dense

Pull Request - State: open - Opened by 13416157913 about 1 year ago - 2 comments

#31 - Loss对齐

Issue - State: open - Opened by wuziyou199217 about 1 year ago - 3 comments

#25 - 请问ParameterSchedule实际上有作用吗?

Issue - State: closed - Opened by yinzhijian about 1 year ago - 1 comment

#23 - NGC22.08 环境报错。

Issue - State: closed - Opened by EthanChen1234 about 1 year ago - 2 comments

#21 - llama2-34b shape不匹配

Issue - State: closed - Opened by cdj0311 about 1 year ago - 4 comments

#20 - 训练完后,将保存的Megatron格式转成HF格式报错

Issue - State: closed - Opened by 13416157913 about 1 year ago - 7 comments

#19 - deepspeed+megatron+llama,请问作者有试过吗

Issue - State: open - Opened by Chandler-Bing about 1 year ago - 1 comment

#18 - add Megatron-LLaMA/examples/LLaMA/LLaMA2_7B_standalone.sh file

Pull Request - State: closed - Opened by 13416157913 about 1 year ago - 4 comments

#17 - nccl通信边界问题?

Issue - State: open - Opened by Baibaifan about 1 year ago - 10 comments

#17 - nccl通信边界问题?

Issue - State: open - Opened by Baibaifan about 1 year ago - 10 comments

#16 - 2节点训练13B LLaMA模型效率只能达到840 token/sec/GPU

Issue - State: open - Opened by YaboSun about 1 year ago - 13 comments

#15 - RuntimeError: CUDA error: device-side assert triggered

Issue - State: closed - Opened by Double-bear about 1 year ago - 2 comments

#14 - 单机训练跑不了,CUDA报错

Issue - State: closed - Opened by XUWeijiang about 1 year ago - 11 comments

#14 - 单机训练跑不了,CUDA报错

Issue - State: closed - Opened by XUWeijiang about 1 year ago - 12 comments

#13 - 请问支持Qwen模型的训练吗?

Issue - State: open - Opened by sxthunder about 1 year ago - 2 comments

#12 - DistributedOptimizer 梯度聚合,疑问

Issue - State: closed - Opened by EthanChen1234 about 1 year ago - 3 comments

#12 - DistributedOptimizer 梯度聚合,疑问

Issue - State: closed - Opened by EthanChen1234 about 1 year ago - 3 comments

#10 - hf转megatron shape错误

Issue - State: open - Opened by Double-bear about 1 year ago - 10 comments

#9 - 请问什么时候出一个傻瓜教程?比如跑通7B完整训练流程

Issue - State: open - Opened by iMountTai about 1 year ago - 11 comments

#9 - 请问什么时候出一个傻瓜教程?比如跑通7B完整训练流程

Issue - State: open - Opened by iMountTai about 1 year ago - 11 comments

#8 - llama2分布式训练脚本有没有不用容器部署方式的脚本?

Issue - State: open - Opened by 13416157913 about 1 year ago - 1 comment

#8 - llama2分布式训练脚本有没有不用容器部署方式的脚本?

Issue - State: open - Opened by 13416157913 about 1 year ago - 1 comment

#7 - 实验结果的网络带宽数据能说明一下吗

Issue - State: closed - Opened by donghucey about 1 year ago - 1 comment

#6 - Compability with Huggingface

Issue - State: open - Opened by YuanLiuuuuuu about 1 year ago - 1 comment

#6 - Compability with Huggingface

Issue - State: open - Opened by YuanLiuuuuuu about 1 year ago - 1 comment

#4 - 执行LLaMA_13_standalone.sh脚本,没有训练过程很快就结束

Issue - State: open - Opened by 13416157913 about 1 year ago - 2 comments

#4 - 执行LLaMA_13_standalone.sh脚本,没有训练过程很快就结束

Issue - State: open - Opened by 13416157913 about 1 year ago - 2 comments