Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / facebookresearch/lingua issues and pull requests
#80 - Could You Share the Llama 7B 200B DCLM Token Checkpoint
Issue -
State: open - Opened by jasonkrone 4 days ago
#79 - final PR
Pull Request -
State: closed - Opened by prasannamayil 11 days ago
- 1 comment
#78 - Bug Report: Abnormal Validation Loss When Resuming Training from DCP Checkpoint
Issue -
State: open - Opened by Hannibal046 14 days ago
- 1 comment
#77 - Unable to upload model to HuggingFace for usage with AutoClasses
Issue -
State: closed - Opened by miraumlauf 14 days ago
#76 - Missing set function in explicit model loading path
Issue -
State: closed - Opened by Hannibal046 19 days ago
- 2 comments
#75 - Weight tying fix and probe cleanup in train.py
Pull Request -
State: closed - Opened by mathuvu 22 days ago
- 3 comments
Labels: CLA Signed
#74 - How to correctly handle embedding tying?
Issue -
State: closed - Opened by xinghaow99 27 days ago
- 3 comments
#73 - fixing wsd learning rate scheduler
Pull Request -
State: closed - Opened by mathuvu 27 days ago
Labels: CLA Signed
#70 - fixing dp_shard and deactivating gradient sync when using gradient accumulation
Pull Request -
State: closed - Opened by mathuvu about 1 month ago
Labels: CLA Signed
#69 - Specifying shell executable.
Pull Request -
State: closed - Opened by Nilabhra about 1 month ago
- 2 comments
Labels: CLA Signed
#68 - Pr 59
Pull Request -
State: closed - Opened by mathuvu about 1 month ago
Labels: CLA Signed
#67 - Pr 41
Pull Request -
State: closed - Opened by mathuvu about 1 month ago
Labels: CLA Signed
#66 - Potential Problems in dp_rank Calculation and Gradient Synchronization Efficiency
Issue -
State: closed - Opened by Hannibal046 about 1 month ago
- 2 comments
#65 - Make fp8 compatible with tensor parallelism
Pull Request -
State: open - Opened by lw about 2 months ago
- 1 comment
Labels: CLA Signed
#64 - How to export the trained model to Huggingface format?
Issue -
State: closed - Opened by macabdul9 2 months ago
- 7 comments
#63 - Update float8 recipe
Pull Request -
State: closed - Opened by lw 2 months ago
- 2 comments
Labels: CLA Signed
#62 - [BUGFIX] Issue with TransformerBlock parallel plan and residual connections.
Pull Request -
State: open - Opened by sirluk 3 months ago
- 1 comment
Labels: CLA Signed
#61 - Dataloader loops over a small set of chunks so may not use all data - is this intentional?
Issue -
State: closed - Opened by tanishqkumar 3 months ago
- 1 comment
#60 - Multi-modal support
Issue -
State: closed - Opened by Zengyi-Qin 3 months ago
- 1 comment
#59 - fix gradient clipping w/ gradient accumulation
Pull Request -
State: closed - Opened by hjlee1371 3 months ago
- 2 comments
Labels: CLA Signed
#58 - Setting dp_shard > 1 get incorrect number of dp_rank
Issue -
State: closed - Opened by new5558 3 months ago
- 1 comment
#57 - Does it have flash-attention 2?
Issue -
State: closed - Opened by wangyu-ustc 3 months ago
- 1 comment
#56 - act checkpointing OOM, float8 causes CUDA memory allocation retries
Issue -
State: open - Opened by Niccolo-Ajroldi 3 months ago
- 20 comments
#55 - How data is sampled?
Issue -
State: closed - Opened by macabdul9 3 months ago
- 5 comments
#54 - Exporting trained models to vLLM?
Issue -
State: closed - Opened by ryoungj 3 months ago
- 1 comment
#53 - Better Documentation on Resuming
Issue -
State: closed - Opened by Hprairie 3 months ago
- 2 comments
#52 - Grad-Norm spike on transformer depth change
Issue -
State: closed - Opened by akhauriyash 3 months ago
- 3 comments
#51 - Wandb resuming on restart
Pull Request -
State: closed - Opened by VivienCabannes 3 months ago
Labels: CLA Signed
#50 - CLI metrics viz script
Issue -
State: closed - Opened by tginart 3 months ago
- 1 comment
#49 - Multi-Node Distributed Issues
Issue -
State: closed - Opened by Hprairie 4 months ago
- 2 comments
#49 - Multi-Node Distributed Issues
Issue -
State: closed - Opened by Hprairie 4 months ago
- 2 comments
#48 - Adds new line after each file of dclm. (Untested)
Pull Request -
State: open - Opened by BadrYoubiIdrissi 4 months ago
Labels: CLA Signed
#47 - various fixes
Pull Request -
State: open - Opened by mathuvu 4 months ago
Labels: CLA Signed
#47 - various fixes
Pull Request -
State: closed - Opened by mathuvu 4 months ago
Labels: CLA Signed
#46 - lm-eval-harness WikiText bug
Issue -
State: open - Opened by akhauriyash 4 months ago
- 15 comments
#45 - Mamba config has extra argument
Issue -
State: closed - Opened by Hprairie 4 months ago
- 1 comment
#45 - Mamba config has extra argument
Issue -
State: closed - Opened by Hprairie 4 months ago
- 1 comment
#44 - Potential bug in main generate.py
Issue -
State: closed - Opened by akhauriyash 4 months ago
- 1 comment
#44 - Potential bug in main generate.py
Issue -
State: closed - Opened by akhauriyash 4 months ago
- 1 comment
#43 - better formatting
Pull Request -
State: closed - Opened by VivienCabannes 4 months ago
Labels: CLA Signed
#43 - better formatting
Pull Request -
State: closed - Opened by VivienCabannes 4 months ago
Labels: CLA Signed
#42 - Potential data-processing issue
Issue -
State: open - Opened by akhauriyash 4 months ago
- 5 comments
#42 - Potential data-processing issue
Issue -
State: open - Opened by akhauriyash 4 months ago
- 5 comments
#41 - Fix some logging value
Pull Request -
State: open - Opened by eliebak 4 months ago
- 2 comments
Labels: CLA Signed
#41 - Fix some logging value
Pull Request -
State: closed - Opened by eliebak 4 months ago
- 2 comments
Labels: CLA Signed
#40 - Update debug.yaml
Pull Request -
State: closed - Opened by VivienCabannes 4 months ago
Labels: CLA Signed
#40 - Update debug.yaml
Pull Request -
State: closed - Opened by VivienCabannes 4 months ago
Labels: CLA Signed
#39 - global depth init std factor seems incorrect
Issue -
State: closed - Opened by SeunghyunSEO 4 months ago
- 2 comments
#39 - global depth init std factor seems incorrect
Issue -
State: closed - Opened by SeunghyunSEO 4 months ago
- 2 comments
#38 - Initialize from pretrained checkpoints
Issue -
State: closed - Opened by ryoungj 4 months ago
- 2 comments
#38 - Initialize from pretrained checkpoints
Issue -
State: closed - Opened by ryoungj 4 months ago
- 2 comments
#37 - Distributed Shampoo
Issue -
State: open - Opened by Ryu1845 4 months ago
- 2 comments
#37 - Distributed Shampoo
Issue -
State: closed - Opened by Ryu1845 4 months ago
- 3 comments
#36 - Loading from consolidated checkpoint
Issue -
State: closed - Opened by SpirinEgor 4 months ago
- 3 comments
#36 - Loading from consolidated checkpoint
Issue -
State: closed - Opened by SpirinEgor 4 months ago
- 3 comments
#35 - small fix on fp8
Pull Request -
State: closed - Opened by mathuvu 4 months ago
Labels: CLA Signed
#35 - small fix on fp8
Pull Request -
State: closed - Opened by mathuvu 4 months ago
Labels: CLA Signed
#34 - Got error while trying `float8`
Issue -
State: closed - Opened by tiendung 4 months ago
- 2 comments
#34 - Got error while trying `float8`
Issue -
State: closed - Opened by tiendung 4 months ago
- 2 comments
#33 - adding option in download prepare
Pull Request -
State: closed - Opened by mathuvu 4 months ago
Labels: CLA Signed
#32 - Support for HuggingFace tokenizer
Issue -
State: closed - Opened by zhengyang-wang 4 months ago
- 3 comments
#32 - Support for HuggingFace tokenizer
Issue -
State: closed - Opened by zhengyang-wang 4 months ago
- 2 comments
#31 - train.log is rank0 only but actual stdout is all ranks
Issue -
State: closed - Opened by 152334H 4 months ago
- 3 comments
#30 - Overview of seeds used?
Issue -
State: closed - Opened by 152334H 4 months ago
- 1 comment
#29 - Val loss log
Issue -
State: closed - Opened by zhengyang-wang 4 months ago
- 4 comments
#29 - Val loss log
Issue -
State: open - Opened by zhengyang-wang 4 months ago
- 3 comments
#28 - mmlu evaluation not working
Issue -
State: open - Opened by zhengyang-wang 4 months ago
- 6 comments
#28 - mmlu evaluation not working
Issue -
State: closed - Opened by zhengyang-wang 4 months ago
- 8 comments
#27 - Adds huggingface tokenizers for easier download
Pull Request -
State: closed - Opened by BadrYoubiIdrissi 4 months ago
Labels: CLA Signed
#27 - Adds huggingface tokenizers for easier download
Pull Request -
State: closed - Opened by BadrYoubiIdrissi 4 months ago
Labels: CLA Signed
#26 - Find conda if `CONDA_ROOT` does not exist
Pull Request -
State: closed - Opened by gregjohnso 4 months ago
- 3 comments
Labels: CLA Signed
#26 - Find conda if `CONDA_ROOT` does not exist
Pull Request -
State: open - Opened by gregjohnso 4 months ago
- 2 comments
Labels: CLA Signed
#25 - avoid hardcoding `/scratch/`
Pull Request -
State: open - Opened by 152334H 4 months ago
- 5 comments
Labels: CLA Signed
#25 - avoid hardcoding `/scratch/`
Pull Request -
State: closed - Opened by 152334H 4 months ago
- 5 comments
Labels: CLA Signed
#24 - Fixing instructions for a fresh install
Pull Request -
State: closed - Opened by wesbz 4 months ago
- 3 comments
Labels: CLA Signed
#23 - can't download tokenizer
Issue -
State: closed - Opened by ath3great 4 months ago
- 1 comment
#23 - can't download tokenizer
Issue -
State: closed - Opened by ath3great 4 months ago
- 1 comment
#22 - Update requirements.txt
Pull Request -
State: closed - Opened by dinhphong81 4 months ago
- 2 comments
#22 - Update requirements.txt
Pull Request -
State: closed - Opened by dinhphong81 4 months ago
- 2 comments
#21 - Delete apps directory
Pull Request -
State: closed - Opened by Cmac89621 4 months ago
- 1 comment
#21 - Delete apps directory
Pull Request -
State: closed - Opened by Cmac89621 4 months ago
- 1 comment
#20 - https://github.com/facebookresearch/lingua.git
Issue -
State: closed - Opened by Ivanlinsousa 4 months ago
- 1 comment
#20 - https://github.com/facebookresearch/lingua.git
Issue -
State: closed - Opened by Ivanlinsousa 4 months ago
- 1 comment
#19 - Failed to Build Wheels for xformers and Compatibility
Issue -
State: closed - Opened by nhtlongcs 4 months ago
- 6 comments
#19 - Failed to Build Wheels for xformers and Compatibility
Issue -
State: closed - Opened by nhtlongcs 4 months ago
- 5 comments
#18 - Missing requirements, Readme Updates, ParquetReader Support, Conda shell
Pull Request -
State: closed - Opened by yukiman76 4 months ago
- 3 comments
Labels: CLA Signed
#18 - Missing requirements, Readme Updates, ParquetReader Support, Conda shell
Pull Request -
State: closed - Opened by yukiman76 4 months ago
- 3 comments
Labels: CLA Signed
#17 - where can i find a mtp generate block ? (Self-speculative decoding) for learning purpose.
Issue -
State: closed - Opened by manmay-nakhashi 4 months ago
- 1 comment
#17 - where can i find a mtp generate block ? (Self-speculative decoding) for learning purpose.
Issue -
State: closed - Opened by manmay-nakhashi 4 months ago
- 1 comment
#15 - Does this liberary contain context parallel to train long-context models?
Issue -
State: closed - Opened by ZetangForward 4 months ago
- 2 comments
#15 - Does this liberary contain context parallel to train long-context models?
Issue -
State: closed - Opened by ZetangForward 4 months ago
- 2 comments
#13 - lingua or touchtune or torchtitan?
Issue -
State: closed - Opened by youngsheen 4 months ago
- 5 comments
#13 - lingua or touchtune or torchtitan?
Issue -
State: closed - Opened by youngsheen 4 months ago
- 5 comments
#12 - Very small Readme adjustment
Pull Request -
State: closed - Opened by MekkCyber 4 months ago
- 2 comments
Labels: CLA Signed
#12 - Very small Readme adjustment
Pull Request -
State: closed - Opened by MekkCyber 4 months ago
- 2 comments
Labels: CLA Signed
#11 - Are there plans to open source the model weights of llama squared relu?
Issue -
State: closed - Opened by YixinSong-e 4 months ago
- 1 comment
#11 - Are there plans to open source the model weights of llama squared relu?
Issue -
State: closed - Opened by YixinSong-e 4 months ago
- 1 comment
#10 - stool not working due to sinfo schema compatibility?
Issue -
State: closed - Opened by zkx06111 4 months ago
- 2 comments
#10 - stool not working due to sinfo schema compatibility?
Issue -
State: closed - Opened by zkx06111 4 months ago
- 2 comments
#9 - Fix super tiny little typo
Pull Request -
State: closed - Opened by fzyzcjy 4 months ago
- 2 comments
Labels: CLA Signed