Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / facebookresearch/lingua issues and pull requests

#79 - final PR

Pull Request - State: closed - Opened by prasannamayil 11 days ago - 1 comment

#76 - Missing set function in explicit model loading path

Issue - State: closed - Opened by Hannibal046 19 days ago - 2 comments

#75 - Weight tying fix and probe cleanup in train.py

Pull Request - State: closed - Opened by mathuvu 22 days ago - 3 comments
Labels: CLA Signed

#74 - How to correctly handle embedding tying?

Issue - State: closed - Opened by xinghaow99 27 days ago - 3 comments

#73 - fixing wsd learning rate scheduler

Pull Request - State: closed - Opened by mathuvu 27 days ago
Labels: CLA Signed

#70 - fixing dp_shard and deactivating gradient sync when using gradient accumulation

Pull Request - State: closed - Opened by mathuvu about 1 month ago
Labels: CLA Signed

#69 - Specifying shell executable.

Pull Request - State: closed - Opened by Nilabhra about 1 month ago - 2 comments
Labels: CLA Signed

#68 - Pr 59

Pull Request - State: closed - Opened by mathuvu about 1 month ago
Labels: CLA Signed

#67 - Pr 41

Pull Request - State: closed - Opened by mathuvu about 1 month ago
Labels: CLA Signed

#65 - Make fp8 compatible with tensor parallelism

Pull Request - State: open - Opened by lw about 2 months ago - 1 comment
Labels: CLA Signed

#64 - How to export the trained model to Huggingface format?

Issue - State: closed - Opened by macabdul9 2 months ago - 7 comments

#63 - Update float8 recipe

Pull Request - State: closed - Opened by lw 2 months ago - 2 comments
Labels: CLA Signed

#62 - [BUGFIX] Issue with TransformerBlock parallel plan and residual connections.

Pull Request - State: open - Opened by sirluk 3 months ago - 1 comment
Labels: CLA Signed

#60 - Multi-modal support

Issue - State: closed - Opened by Zengyi-Qin 3 months ago - 1 comment

#59 - fix gradient clipping w/ gradient accumulation

Pull Request - State: closed - Opened by hjlee1371 3 months ago - 2 comments
Labels: CLA Signed

#58 - Setting dp_shard > 1 get incorrect number of dp_rank

Issue - State: closed - Opened by new5558 3 months ago - 1 comment

#57 - Does it have flash-attention 2?

Issue - State: closed - Opened by wangyu-ustc 3 months ago - 1 comment

#56 - act checkpointing OOM, float8 causes CUDA memory allocation retries

Issue - State: open - Opened by Niccolo-Ajroldi 3 months ago - 20 comments

#55 - How data is sampled?

Issue - State: closed - Opened by macabdul9 3 months ago - 5 comments

#54 - Exporting trained models to vLLM?

Issue - State: closed - Opened by ryoungj 3 months ago - 1 comment

#53 - Better Documentation on Resuming

Issue - State: closed - Opened by Hprairie 3 months ago - 2 comments

#52 - Grad-Norm spike on transformer depth change

Issue - State: closed - Opened by akhauriyash 3 months ago - 3 comments

#51 - Wandb resuming on restart

Pull Request - State: closed - Opened by VivienCabannes 3 months ago
Labels: CLA Signed

#50 - CLI metrics viz script

Issue - State: closed - Opened by tginart 3 months ago - 1 comment

#49 - Multi-Node Distributed Issues

Issue - State: closed - Opened by Hprairie 4 months ago - 2 comments

#49 - Multi-Node Distributed Issues

Issue - State: closed - Opened by Hprairie 4 months ago - 2 comments

#48 - Adds new line after each file of dclm. (Untested)

Pull Request - State: open - Opened by BadrYoubiIdrissi 4 months ago
Labels: CLA Signed

#47 - various fixes

Pull Request - State: open - Opened by mathuvu 4 months ago
Labels: CLA Signed

#47 - various fixes

Pull Request - State: closed - Opened by mathuvu 4 months ago
Labels: CLA Signed

#46 - lm-eval-harness WikiText bug

Issue - State: open - Opened by akhauriyash 4 months ago - 15 comments

#45 - Mamba config has extra argument

Issue - State: closed - Opened by Hprairie 4 months ago - 1 comment

#45 - Mamba config has extra argument

Issue - State: closed - Opened by Hprairie 4 months ago - 1 comment

#44 - Potential bug in main generate.py

Issue - State: closed - Opened by akhauriyash 4 months ago - 1 comment

#44 - Potential bug in main generate.py

Issue - State: closed - Opened by akhauriyash 4 months ago - 1 comment

#43 - better formatting

Pull Request - State: closed - Opened by VivienCabannes 4 months ago
Labels: CLA Signed

#43 - better formatting

Pull Request - State: closed - Opened by VivienCabannes 4 months ago
Labels: CLA Signed

#42 - Potential data-processing issue

Issue - State: open - Opened by akhauriyash 4 months ago - 5 comments

#42 - Potential data-processing issue

Issue - State: open - Opened by akhauriyash 4 months ago - 5 comments

#41 - Fix some logging value

Pull Request - State: open - Opened by eliebak 4 months ago - 2 comments
Labels: CLA Signed

#41 - Fix some logging value

Pull Request - State: closed - Opened by eliebak 4 months ago - 2 comments
Labels: CLA Signed

#40 - Update debug.yaml

Pull Request - State: closed - Opened by VivienCabannes 4 months ago
Labels: CLA Signed

#40 - Update debug.yaml

Pull Request - State: closed - Opened by VivienCabannes 4 months ago
Labels: CLA Signed

#39 - global depth init std factor seems incorrect

Issue - State: closed - Opened by SeunghyunSEO 4 months ago - 2 comments

#39 - global depth init std factor seems incorrect

Issue - State: closed - Opened by SeunghyunSEO 4 months ago - 2 comments

#38 - Initialize from pretrained checkpoints

Issue - State: closed - Opened by ryoungj 4 months ago - 2 comments

#38 - Initialize from pretrained checkpoints

Issue - State: closed - Opened by ryoungj 4 months ago - 2 comments

#37 - Distributed Shampoo

Issue - State: open - Opened by Ryu1845 4 months ago - 2 comments

#37 - Distributed Shampoo

Issue - State: closed - Opened by Ryu1845 4 months ago - 3 comments

#36 - Loading from consolidated checkpoint

Issue - State: closed - Opened by SpirinEgor 4 months ago - 3 comments

#36 - Loading from consolidated checkpoint

Issue - State: closed - Opened by SpirinEgor 4 months ago - 3 comments

#35 - small fix on fp8

Pull Request - State: closed - Opened by mathuvu 4 months ago
Labels: CLA Signed

#35 - small fix on fp8

Pull Request - State: closed - Opened by mathuvu 4 months ago
Labels: CLA Signed

#34 - Got error while trying `float8`

Issue - State: closed - Opened by tiendung 4 months ago - 2 comments

#34 - Got error while trying `float8`

Issue - State: closed - Opened by tiendung 4 months ago - 2 comments

#33 - adding option in download prepare

Pull Request - State: closed - Opened by mathuvu 4 months ago
Labels: CLA Signed

#32 - Support for HuggingFace tokenizer

Issue - State: closed - Opened by zhengyang-wang 4 months ago - 3 comments

#32 - Support for HuggingFace tokenizer

Issue - State: closed - Opened by zhengyang-wang 4 months ago - 2 comments

#31 - train.log is rank0 only but actual stdout is all ranks

Issue - State: closed - Opened by 152334H 4 months ago - 3 comments

#30 - Overview of seeds used?

Issue - State: closed - Opened by 152334H 4 months ago - 1 comment

#29 - Val loss log

Issue - State: closed - Opened by zhengyang-wang 4 months ago - 4 comments

#29 - Val loss log

Issue - State: open - Opened by zhengyang-wang 4 months ago - 3 comments

#28 - mmlu evaluation not working

Issue - State: open - Opened by zhengyang-wang 4 months ago - 6 comments

#28 - mmlu evaluation not working

Issue - State: closed - Opened by zhengyang-wang 4 months ago - 8 comments

#27 - Adds huggingface tokenizers for easier download

Pull Request - State: closed - Opened by BadrYoubiIdrissi 4 months ago
Labels: CLA Signed

#27 - Adds huggingface tokenizers for easier download

Pull Request - State: closed - Opened by BadrYoubiIdrissi 4 months ago
Labels: CLA Signed

#26 - Find conda if `CONDA_ROOT` does not exist

Pull Request - State: closed - Opened by gregjohnso 4 months ago - 3 comments
Labels: CLA Signed

#26 - Find conda if `CONDA_ROOT` does not exist

Pull Request - State: open - Opened by gregjohnso 4 months ago - 2 comments
Labels: CLA Signed

#25 - avoid hardcoding `/scratch/`

Pull Request - State: open - Opened by 152334H 4 months ago - 5 comments
Labels: CLA Signed

#25 - avoid hardcoding `/scratch/`

Pull Request - State: closed - Opened by 152334H 4 months ago - 5 comments
Labels: CLA Signed

#24 - Fixing instructions for a fresh install

Pull Request - State: closed - Opened by wesbz 4 months ago - 3 comments
Labels: CLA Signed

#23 - can't download tokenizer

Issue - State: closed - Opened by ath3great 4 months ago - 1 comment

#23 - can't download tokenizer

Issue - State: closed - Opened by ath3great 4 months ago - 1 comment

#22 - Update requirements.txt

Pull Request - State: closed - Opened by dinhphong81 4 months ago - 2 comments

#22 - Update requirements.txt

Pull Request - State: closed - Opened by dinhphong81 4 months ago - 2 comments

#21 - Delete apps directory

Pull Request - State: closed - Opened by Cmac89621 4 months ago - 1 comment

#21 - Delete apps directory

Pull Request - State: closed - Opened by Cmac89621 4 months ago - 1 comment

#20 - https://github.com/facebookresearch/lingua.git

Issue - State: closed - Opened by Ivanlinsousa 4 months ago - 1 comment

#20 - https://github.com/facebookresearch/lingua.git

Issue - State: closed - Opened by Ivanlinsousa 4 months ago - 1 comment

#19 - Failed to Build Wheels for xformers and Compatibility

Issue - State: closed - Opened by nhtlongcs 4 months ago - 6 comments

#19 - Failed to Build Wheels for xformers and Compatibility

Issue - State: closed - Opened by nhtlongcs 4 months ago - 5 comments

#18 - Missing requirements, Readme Updates, ParquetReader Support, Conda shell

Pull Request - State: closed - Opened by yukiman76 4 months ago - 3 comments
Labels: CLA Signed

#18 - Missing requirements, Readme Updates, ParquetReader Support, Conda shell

Pull Request - State: closed - Opened by yukiman76 4 months ago - 3 comments
Labels: CLA Signed

#13 - lingua or touchtune or torchtitan?

Issue - State: closed - Opened by youngsheen 4 months ago - 5 comments

#13 - lingua or touchtune or torchtitan?

Issue - State: closed - Opened by youngsheen 4 months ago - 5 comments

#12 - Very small Readme adjustment

Pull Request - State: closed - Opened by MekkCyber 4 months ago - 2 comments
Labels: CLA Signed

#12 - Very small Readme adjustment

Pull Request - State: closed - Opened by MekkCyber 4 months ago - 2 comments
Labels: CLA Signed

#11 - Are there plans to open source the model weights of llama squared relu?

Issue - State: closed - Opened by YixinSong-e 4 months ago - 1 comment

#11 - Are there plans to open source the model weights of llama squared relu?

Issue - State: closed - Opened by YixinSong-e 4 months ago - 1 comment

#10 - stool not working due to sinfo schema compatibility?

Issue - State: closed - Opened by zkx06111 4 months ago - 2 comments

#10 - stool not working due to sinfo schema compatibility?

Issue - State: closed - Opened by zkx06111 4 months ago - 2 comments

#9 - Fix super tiny little typo

Pull Request - State: closed - Opened by fzyzcjy 4 months ago - 2 comments
Labels: CLA Signed