Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / alibaba/Pai-Megatron-Patch issues and pull requests

#65 - 模型转换脚本从megatron转为huggingface格式存在不匹配

Issue - State: open - Opened by ctomx837 about 1 year ago - 1 comment

#64 - Add Mistral 7B and refactor data module

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#64 - Add Mistral 7B and refactor data module

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#63 - Fix llama2/qwen pretrain with original dataset

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#63 - Fix llama2/qwen pretrain with original dataset

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#62 - Fix llama2/qwen pretrain with original dataset

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#61 - Add LLaVA and Mistral, Fix Finetune llama

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#61 - Add LLaVA and Mistral, Fix Finetune llama

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#59 - Baichuan2 13B预训练过程的TP相关报错

Issue - State: closed - Opened by gllary about 1 year ago

#58 - Codellama

Pull Request - State: closed - Opened by lwmlyy about 1 year ago - 1 comment

#58 - Codellama

Pull Request - State: closed - Opened by lwmlyy about 1 year ago - 1 comment

#57 - Codellama mg2hf for 34b

Pull Request - State: closed - Opened by lwmlyy about 1 year ago - 1 comment

#56 - adapt codellama to latest Megatron

Pull Request - State: closed - Opened by lwmlyy about 1 year ago - 1 comment

#56 - adapt codellama to latest Megatron

Pull Request - State: closed - Opened by lwmlyy about 1 year ago - 1 comment

#55 - fix model convert assertion

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#55 - fix model convert assertion

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#53 - 多机多卡相关

Issue - State: closed - Opened by gllary about 1 year ago - 2 comments

#53 - 多机多卡相关

Issue - State: closed - Opened by gllary about 1 year ago - 1 comment

#52 - Transformer Engine for Baichuan2 and Qwen

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#52 - Transformer Engine for Baichuan2 and Qwen

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#51 - Support Llama-2-70B with GQA

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#51 - Support Llama-2-70B with GQA

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#50 - Using git submodule to manage patch and megatron version match

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#50 - Using git submodule to manage patch and megatron version match

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#49 - Using git submodule to manage patch and megatron version match

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#49 - Using git submodule to manage patch and megatron version match

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#48 - Add Megatron-LM as git submodule

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#48 - Add Megatron-LM as git submodule

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#47 - support mg2hf for qwen/baichuan2/llama2 after train

Pull Request - State: closed - Opened by lwmlyy about 1 year ago

#47 - support mg2hf for qwen/baichuan2/llama2 after train

Pull Request - State: closed - Opened by lwmlyy about 1 year ago

#46 - LLama-2 Support Transformer Engine

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#46 - LLama-2 Support Transformer Engine

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#45 - Co-Exist LLama2 ROPE and Megatron ROPE

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#43 - Generation

Pull Request - State: closed - Opened by lwmlyy about 1 year ago

#43 - Generation

Pull Request - State: closed - Opened by lwmlyy about 1 year ago

#42 - support generation for baichuan2, llama2, qwen

Pull Request - State: closed - Opened by lwmlyy about 1 year ago

#42 - support generation for baichuan2, llama2, qwen

Pull Request - State: closed - Opened by lwmlyy about 1 year ago

#41 - Update Baichuan-2/Qwen model with latest Megatron LM

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#40 - Update Baichuan-2 model with latest Megatron LM

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#39 - Update LLama2 model with latest Megatron LM

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#38 - add .gitignore

Pull Request - State: closed - Opened by MengLeebin about 1 year ago - 1 comment

#37 - Enhance Qwen-14b running scripts

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#36 - add mg-hf convert for qwen14b

Pull Request - State: closed - Opened by lwmlyy about 1 year ago

#36 - add mg-hf convert for qwen14b

Pull Request - State: closed - Opened by lwmlyy about 1 year ago

#35 - Qwen-14b mg-hf

Pull Request - State: closed - Opened by lwmlyy about 1 year ago

#35 - Qwen-14b mg-hf

Pull Request - State: closed - Opened by lwmlyy about 1 year ago

#34 - add mg2hf for qwen-7b

Pull Request - State: closed - Opened by lwmlyy about 1 year ago

#33 - Fix Alibi Mask when TP>1 for baichuan2 13B

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#32 - Qwen-7b mg2hf

Pull Request - State: closed - Opened by lwmlyy about 1 year ago

#32 - Qwen-7b mg2hf

Pull Request - State: closed - Opened by lwmlyy about 1 year ago

#31 - Fix Alibi Mask when TP>1 for baichuan2 13B

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#30 - fix: model parameter init and flash attention logging in deepspeed llama training

Pull Request - State: closed - Opened by ZebornDuan about 1 year ago - 1 comment

#30 - fix: model parameter init and flash attention logging in deepspeed llama training

Pull Request - State: closed - Opened by ZebornDuan about 1 year ago - 1 comment

#29 - Baichuan2 13B pretrain alibi_attn_mask TP=2 pp=4

Issue - State: closed - Opened by cwszz about 1 year ago - 5 comments

#29 - Baichuan2 13B pretrain alibi_attn_mask TP=2 pp=4

Issue - State: closed - Opened by cwszz about 1 year ago - 5 comments

#28 - Pai-Megatron-Patch和Megatron-LM分别应该选择什么版本?

Issue - State: closed - Opened by wangbluo about 1 year ago - 6 comments

#28 - Pai-Megatron-Patch和Megatron-LM分别应该选择什么版本?

Issue - State: closed - Opened by wangbluo about 1 year ago - 6 comments

#27 - Add NormHead for Baichuan-2 for 7B/13B model

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#26 - add zloss for baichuan2

Pull Request - State: closed - Opened by lwmlyy about 1 year ago

#25 - Add Baichuan-2 for 7B/13B model

Pull Request - State: closed - Opened by jerryli1981 about 1 year ago - 1 comment

#23 - Needs requirements.txt

Issue - State: closed - Opened by miangangzhen about 1 year ago - 3 comments

#19 - add mg2hf for baichuan1

Pull Request - State: closed - Opened by lwmlyy about 1 year ago

#16 - 您好,请问什么时候可以支持baichuan2呀~

Issue - State: closed - Opened by zhangbin1997 about 1 year ago - 2 comments

#16 - 您好,请问什么时候可以支持baichuan2呀~

Issue - State: closed - Opened by zhangbin1997 about 1 year ago - 2 comments

#15 - 支持rwkv

Issue - State: closed - Opened by BBuf about 1 year ago - 3 comments

#15 - 支持rwkv

Issue - State: closed - Opened by BBuf about 1 year ago - 3 comments