Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / alibaba/Pai-Megatron-Patch issues and pull requests
#66 - 使用huggingface + deepspeed stage 3训练codellama 报错,stage2 可以,这是为什么?
Issue -
State: open - Opened by weilanzhikong about 1 year ago
- 1 comment
#66 - 使用huggingface + deepspeed stage 3训练codellama 报错,stage2 可以,这是为什么?
Issue -
State: open - Opened by weilanzhikong about 1 year ago
- 1 comment
#65 - 模型转换脚本从megatron转为huggingface格式存在不匹配
Issue -
State: open - Opened by ctomx837 about 1 year ago
- 1 comment
#64 - Add Mistral 7B and refactor data module
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#64 - Add Mistral 7B and refactor data module
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#63 - Fix llama2/qwen pretrain with original dataset
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#63 - Fix llama2/qwen pretrain with original dataset
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#62 - Fix llama2/qwen pretrain with original dataset
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#61 - Add LLaVA and Mistral, Fix Finetune llama
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#61 - Add LLaVA and Mistral, Fix Finetune llama
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#60 - Training baichuan2 13b, when pp>1, it will nanloss when flash attention is turned off.
Issue -
State: open - Opened by uygnef about 1 year ago
#60 - Training baichuan2 13b, when pp>1, it will nanloss when flash attention is turned off.
Issue -
State: open - Opened by uygnef about 1 year ago
#59 - Baichuan2 13B预训练过程的TP相关报错
Issue -
State: closed - Opened by gllary about 1 year ago
#58 - Codellama
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
- 1 comment
#58 - Codellama
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
- 1 comment
#57 - Codellama mg2hf for 34b
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
- 1 comment
#56 - adapt codellama to latest Megatron
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
- 1 comment
#56 - adapt codellama to latest Megatron
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
- 1 comment
#55 - fix model convert assertion
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#55 - fix model convert assertion
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#54 - 百川2 的max-z loss,为啥代码里只有前向部分,没有反向部分?
Issue -
State: open - Opened by flower-with-safe about 1 year ago
- 3 comments
#54 - 百川2 的max-z loss,为啥代码里只有前向部分,没有反向部分?
Issue -
State: open - Opened by flower-with-safe about 1 year ago
- 3 comments
#53 - 多机多卡相关
Issue -
State: closed - Opened by gllary about 1 year ago
- 2 comments
#53 - 多机多卡相关
Issue -
State: closed - Opened by gllary about 1 year ago
- 1 comment
#52 - Transformer Engine for Baichuan2 and Qwen
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#52 - Transformer Engine for Baichuan2 and Qwen
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#51 - Support Llama-2-70B with GQA
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#51 - Support Llama-2-70B with GQA
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#50 - Using git submodule to manage patch and megatron version match
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#50 - Using git submodule to manage patch and megatron version match
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#49 - Using git submodule to manage patch and megatron version match
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#49 - Using git submodule to manage patch and megatron version match
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#48 - Add Megatron-LM as git submodule
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#48 - Add Megatron-LM as git submodule
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#47 - support mg2hf for qwen/baichuan2/llama2 after train
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
#47 - support mg2hf for qwen/baichuan2/llama2 after train
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
#46 - LLama-2 Support Transformer Engine
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#46 - LLama-2 Support Transformer Engine
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#45 - Co-Exist LLama2 ROPE and Megatron ROPE
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#44 - AttributeError: 'torch._C._distributed_c10d.Options' object has no attribute 'config'
Issue -
State: closed - Opened by jiejie1993 about 1 year ago
- 1 comment
#44 - AttributeError: 'torch._C._distributed_c10d.Options' object has no attribute 'config'
Issue -
State: closed - Opened by jiejie1993 about 1 year ago
- 1 comment
#43 - Generation
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
#43 - Generation
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
#42 - support generation for baichuan2, llama2, qwen
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
#42 - support generation for baichuan2, llama2, qwen
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
#41 - Update Baichuan-2/Qwen model with latest Megatron LM
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#40 - Update Baichuan-2 model with latest Megatron LM
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#39 - Update LLama2 model with latest Megatron LM
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#38 - add .gitignore
Pull Request -
State: closed - Opened by MengLeebin about 1 year ago
- 1 comment
#37 - Enhance Qwen-14b running scripts
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#36 - add mg-hf convert for qwen14b
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
#36 - add mg-hf convert for qwen14b
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
#35 - Qwen-14b mg-hf
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
#35 - Qwen-14b mg-hf
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
#34 - add mg2hf for qwen-7b
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
#33 - Fix Alibi Mask when TP>1 for baichuan2 13B
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#32 - Qwen-7b mg2hf
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
#32 - Qwen-7b mg2hf
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
#31 - Fix Alibi Mask when TP>1 for baichuan2 13B
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#30 - fix: model parameter init and flash attention logging in deepspeed llama training
Pull Request -
State: closed - Opened by ZebornDuan about 1 year ago
- 1 comment
#30 - fix: model parameter init and flash attention logging in deepspeed llama training
Pull Request -
State: closed - Opened by ZebornDuan about 1 year ago
- 1 comment
#29 - Baichuan2 13B pretrain alibi_attn_mask TP=2 pp=4
Issue -
State: closed - Opened by cwszz about 1 year ago
- 5 comments
#29 - Baichuan2 13B pretrain alibi_attn_mask TP=2 pp=4
Issue -
State: closed - Opened by cwszz about 1 year ago
- 5 comments
#28 - Pai-Megatron-Patch和Megatron-LM分别应该选择什么版本?
Issue -
State: closed - Opened by wangbluo about 1 year ago
- 6 comments
#28 - Pai-Megatron-Patch和Megatron-LM分别应该选择什么版本?
Issue -
State: closed - Opened by wangbluo about 1 year ago
- 6 comments
#27 - Add NormHead for Baichuan-2 for 7B/13B model
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#26 - add zloss for baichuan2
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
#25 - Add Baichuan-2 for 7B/13B model
Pull Request -
State: closed - Opened by jerryli1981 about 1 year ago
- 1 comment
#23 - Needs requirements.txt
Issue -
State: closed - Opened by miangangzhen about 1 year ago
- 3 comments
#19 - add mg2hf for baichuan1
Pull Request -
State: closed - Opened by lwmlyy about 1 year ago
#16 - 您好,请问什么时候可以支持baichuan2呀~
Issue -
State: closed - Opened by zhangbin1997 about 1 year ago
- 2 comments
#16 - 您好,请问什么时候可以支持baichuan2呀~
Issue -
State: closed - Opened by zhangbin1997 about 1 year ago
- 2 comments
#15 - 支持rwkv
Issue -
State: closed - Opened by BBuf about 1 year ago
- 3 comments
#15 - 支持rwkv
Issue -
State: closed - Opened by BBuf about 1 year ago
- 3 comments