Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / mlfoundations/open_lm issues and pull requests
#50 - Adding unites for grad_accum (with and without fsdp)
Pull Request -
State: closed - Opened by jfisher52 about 1 year ago
#50 - Adding unites for grad_accum (with and without fsdp)
Pull Request -
State: closed - Opened by jfisher52 about 1 year ago
#49 - Multinode Working Tokenize shuffle
Pull Request -
State: closed - Opened by Vaishaal about 1 year ago
- 1 comment
#49 - Multinode Working Tokenize shuffle
Pull Request -
State: closed - Opened by Vaishaal about 1 year ago
- 1 comment
#48 - Use no_sync when doing gradient accumulation
Issue -
State: closed - Opened by achalddave about 1 year ago
#48 - Use no_sync when doing gradient accumulation
Issue -
State: closed - Opened by achalddave about 1 year ago
#47 - Add accumulation test and minor fix
Pull Request -
State: closed - Opened by achalddave about 1 year ago
- 1 comment
#47 - Add accumulation test and minor fix
Pull Request -
State: closed - Opened by achalddave about 1 year ago
- 1 comment
#46 - Log and skip errors in map_dict
Pull Request -
State: closed - Opened by achalddave about 1 year ago
- 6 comments
#46 - Log and skip errors in map_dict
Pull Request -
State: closed - Opened by achalddave about 1 year ago
- 6 comments
#45 - Convert LLaMA weights to open_lm compatible weights
Pull Request -
State: closed - Opened by yuhui-zh15 about 1 year ago
- 1 comment
#45 - Convert LLaMA weights to open_lm compatible weights
Pull Request -
State: closed - Opened by yuhui-zh15 about 1 year ago
- 1 comment
#44 - Black format openlm
Issue -
State: closed - Opened by achalddave about 1 year ago
#44 - Black format openlm
Issue -
State: closed - Opened by achalddave about 1 year ago
#43 - Speed up loading remote checkpoints
Issue -
State: closed - Opened by achalddave about 1 year ago
Labels: good first issue
#43 - Speed up loading remote checkpoints
Issue -
State: closed - Opened by achalddave about 1 year ago
Labels: good first issue
#42 - Fix optim_state_dict_to_load call for latest pytorch versions
Pull Request -
State: closed - Opened by achalddave about 1 year ago
- 1 comment
#42 - Fix optim_state_dict_to_load call for latest pytorch versions
Pull Request -
State: closed - Opened by achalddave about 1 year ago
- 1 comment
#41 - Documentation: competing frameworks
Issue -
State: open - Opened by borgr about 1 year ago
- 3 comments
#41 - Documentation: competing frameworks
Issue -
State: open - Opened by borgr about 1 year ago
- 3 comments
#40 - Fused RMSNorm
Pull Request -
State: open - Opened by sagadre about 1 year ago
- 2 comments
#40 - Fused RMSNorm
Pull Request -
State: open - Opened by sagadre about 1 year ago
- 2 comments
#39 - LLaMA2 weight loading
Pull Request -
State: closed - Opened by jmercat about 1 year ago
- 2 comments
#39 - LLaMA2 weight loading
Pull Request -
State: closed - Opened by jmercat about 1 year ago
- 2 comments
#38 - Overloading --model param to support passing a json model config
Pull Request -
State: closed - Opened by sagadre about 1 year ago
#38 - Overloading --model param to support passing a json model config
Pull Request -
State: closed - Opened by sagadre about 1 year ago
#37 - fp8 training on h100s
Issue -
State: open - Opened by mitchellnw about 1 year ago
#37 - fp8 training on h100s
Issue -
State: open - Opened by mitchellnw about 1 year ago
#36 - Support LLaMA-2 with Backward Compatibility
Pull Request -
State: closed - Opened by yuhui-zh15 about 1 year ago
#36 - Support LLaMA-2 with Backward Compatibility
Pull Request -
State: closed - Opened by yuhui-zh15 about 1 year ago
#35 - Weird memory usage for 11m vs 160m: similar batch size fits in memory...
Issue -
State: open - Opened by alexjc about 1 year ago
- 14 comments
#35 - Weird memory usage for 11m vs 160m: similar batch size fits in memory...
Issue -
State: open - Opened by alexjc about 1 year ago
- 14 comments
#34 - Add bootstrap CIs to val perplexity calculation
Issue -
State: closed - Opened by achalddave about 1 year ago
#34 - Add bootstrap CIs to val perplexity calculation
Issue -
State: closed - Opened by achalddave about 1 year ago
#33 - [DON'T MERGE] Support llama-2
Pull Request -
State: closed - Opened by yuhui-zh15 about 1 year ago
- 1 comment
#33 - [DON'T MERGE] Support llama-2
Pull Request -
State: closed - Opened by yuhui-zh15 about 1 year ago
- 1 comment
#32 - Remove fused cross-entropy
Pull Request -
State: closed - Opened by achalddave about 1 year ago
#32 - Remove fused cross-entropy
Pull Request -
State: closed - Opened by achalddave about 1 year ago
#31 - Ablate on initialization
Issue -
State: closed - Opened by mitchellnw about 1 year ago
- 2 comments
#31 - Ablate on initialization
Issue -
State: closed - Opened by mitchellnw about 1 year ago
- 2 comments
#30 - LLaMA weight loading
Issue -
State: closed - Opened by ludwigschmidt about 1 year ago
- 2 comments
#30 - LLaMA weight loading
Issue -
State: closed - Opened by ludwigschmidt about 1 year ago
- 2 comments
#29 - Training speed ups: +10-15% tokens/second/gpu
Pull Request -
State: closed - Opened by achalddave about 1 year ago
- 2 comments
#29 - Training speed ups: +10-15% tokens/second/gpu
Pull Request -
State: closed - Opened by achalddave about 1 year ago
- 2 comments
#28 - Fixed up generate.py, added to readme
Pull Request -
State: closed - Opened by revbucket about 1 year ago
#28 - Fixed up generate.py, added to readme
Pull Request -
State: closed - Opened by revbucket about 1 year ago
#27 - Consider low precision normalization
Issue -
State: closed - Opened by mitchellnw about 1 year ago
#27 - Consider low precision normalization
Issue -
State: closed - Opened by mitchellnw about 1 year ago
#25 - mup simple
Pull Request -
State: closed - Opened by sagadre about 1 year ago
- 1 comment
#25 - mup simple
Pull Request -
State: closed - Opened by sagadre about 1 year ago
- 1 comment
#24 - Consolidate eval arguments with train arguments
Pull Request -
State: closed - Opened by achalddave about 1 year ago
- 3 comments
#24 - Consolidate eval arguments with train arguments
Pull Request -
State: closed - Opened by achalddave about 1 year ago
- 3 comments
#23 - Add support for sparse mixture of experts (MoE)
Issue -
State: closed - Opened by sagadre about 1 year ago
- 1 comment
#23 - Add support for sparse mixture of experts (MoE)
Issue -
State: closed - Opened by sagadre about 1 year ago
- 1 comment
#22 - add fused cross entropy
Issue -
State: closed - Opened by kernelmachine about 1 year ago
#22 - add fused cross entropy
Issue -
State: closed - Opened by kernelmachine about 1 year ago
#21 - Benchmark tok/sec with other libs
Issue -
State: open - Opened by kernelmachine about 1 year ago
- 4 comments
#21 - Benchmark tok/sec with other libs
Issue -
State: open - Opened by kernelmachine about 1 year ago
- 4 comments
#20 - Add HF mixin
Pull Request -
State: closed - Opened by NielsRogge about 1 year ago
- 7 comments
#20 - Add HF mixin
Pull Request -
State: closed - Opened by NielsRogge about 1 year ago
- 7 comments
#19 - Release the intermediate checkpoints?
Issue -
State: open - Opened by NathanGodey about 1 year ago
- 2 comments
#19 - Release the intermediate checkpoints?
Issue -
State: open - Opened by NathanGodey about 1 year ago
- 2 comments
#18 - Add doc+support for evaluating with rotary-old
Pull Request -
State: closed - Opened by achalddave about 1 year ago
#18 - Add doc+support for evaluating with rotary-old
Pull Request -
State: closed - Opened by achalddave about 1 year ago
#17 - Vq gpt adaptation
Pull Request -
State: closed - Opened by iejMac about 1 year ago
#17 - Vq gpt adaptation
Pull Request -
State: closed - Opened by iejMac about 1 year ago
#16 - main.py: use data_key in get_wds_dataset
Pull Request -
State: closed - Opened by iejMac about 1 year ago
#16 - main.py: use data_key in get_wds_dataset
Pull Request -
State: closed - Opened by iejMac about 1 year ago
#15 - datapreprocesss: make_2048.py zero-pad sample ID
Pull Request -
State: closed - Opened by iejMac about 1 year ago
#15 - datapreprocesss: make_2048.py zero-pad sample ID
Pull Request -
State: closed - Opened by iejMac about 1 year ago
#14 - VQ-GPT
Pull Request -
State: closed - Opened by iejMac about 1 year ago
- 1 comment
#14 - VQ-GPT
Pull Request -
State: closed - Opened by iejMac about 1 year ago
- 1 comment
#13 - Replaced pile -> wikipedia in quickstart instructions
Pull Request -
State: closed - Opened by revbucket about 1 year ago
#13 - Replaced pile -> wikipedia in quickstart instructions
Pull Request -
State: closed - Opened by revbucket about 1 year ago
#12 - Add generation scripts
Pull Request -
State: closed - Opened by achalddave about 1 year ago
#12 - Add generation scripts
Pull Request -
State: closed - Opened by achalddave about 1 year ago
#11 - Revamp make_2048.py script
Issue -
State: closed - Opened by sagadre about 1 year ago
- 3 comments
#11 - Revamp make_2048.py script
Issue -
State: closed - Opened by sagadre about 1 year ago
- 3 comments
#10 - train.py: sample index in chunk when chunk_size >= seq_len
Pull Request -
State: closed - Opened by iejMac about 1 year ago
#10 - train.py: sample index in chunk when chunk_size >= seq_len
Pull Request -
State: closed - Opened by iejMac about 1 year ago
#9 - Unit test for grad accum
Issue -
State: closed - Opened by mitchellnw about 1 year ago
- 2 comments
#9 - Unit test for grad accum
Issue -
State: closed - Opened by mitchellnw about 1 year ago
- 2 comments
#8 - Improved dataloading with multiple sources without resampling.
Pull Request -
State: closed - Opened by GeorgiosSmyrnis about 1 year ago
- 4 comments
#8 - Improved dataloading with multiple sources without resampling.
Pull Request -
State: closed - Opened by GeorgiosSmyrnis about 1 year ago
- 4 comments
#7 - Exit on NaN loss
Pull Request -
State: closed - Opened by sagadre about 1 year ago
- 3 comments
#7 - Exit on NaN loss
Pull Request -
State: closed - Opened by sagadre about 1 year ago
- 3 comments
#6 - Hybrid sharding change
Pull Request -
State: closed - Opened by GeorgiosSmyrnis about 1 year ago
#5 - Positional embedding xformers fix
Pull Request -
State: closed - Opened by sagadre about 1 year ago
- 2 comments
#4 - Problem in position embedding
Issue -
State: closed - Opened by jmercat about 1 year ago
- 8 comments
#3 - Add a fused RMSNorm operation
Issue -
State: closed - Opened by mitchellnw about 1 year ago
- 4 comments
#2 - Update README.md
Pull Request -
State: closed - Opened by revbucket about 1 year ago
#1 - pip installable
Pull Request -
State: closed - Opened by sagadre about 1 year ago