Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / EleutherAI/gpt-neox issues and pull requests
#39 - Add improved data downloading class / pipeline
Pull Request -
State: closed - Opened by sdtblck almost 4 years ago
- 2 comments
#38 - Openwebtext2 dataset checks for presence of tar.gz file to assess whether to auto-download rather than extracted dataset
Issue -
State: closed - Opened by sdtblck almost 4 years ago
#37 - Dataset downloads <number of GPUs> times when running deepspeed train.py
Issue -
State: closed - Opened by sdtblck almost 4 years ago
#36 - Update requirements.txt
Pull Request -
State: closed - Opened by sdtblck almost 4 years ago
#35 - Fix error in extracting OWT2 dataset
Pull Request -
State: closed - Opened by steven-mi almost 4 years ago
- 1 comment
#34 - feedforward GLU on by default
Pull Request -
State: closed - Opened by lucidrains almost 4 years ago
- 1 comment
#33 - Automatically download owt2
Pull Request -
State: closed - Opened by steven-mi almost 4 years ago
#32 - Fix depreciated code
Issue -
State: closed - Opened by StellaAthena almost 4 years ago
- 1 comment
Labels: bug
#31 - disable reduce for loss calculation and calculate mean separately
Pull Request -
State: closed - Opened by anthony-dipofi almost 4 years ago
- 2 comments
#30 - untie classifier weights by default
Pull Request -
State: closed - Opened by lucidrains almost 4 years ago
#29 - add linear warmup over 5000 steps and gradient clipping
Pull Request -
State: closed - Opened by lucidrains almost 4 years ago
#28 - ftfy used in create_tfrecords.py but not listed in requirements.txt
Issue -
State: closed - Opened by anthony-dipofi almost 4 years ago
- 1 comment
Labels: bug
#27 - Update documentation
Issue -
State: closed - Opened by StellaAthena almost 4 years ago
- 1 comment
Labels: documentation
#26 - Hardcoded paths in gpt3_small.json
Issue -
State: closed - Opened by anthony-dipofi almost 4 years ago
Labels: bug
#25 - make mask value smaller by factor of 2
Pull Request -
State: closed - Opened by lucidrains almost 4 years ago
- 1 comment
#24 - GPT-3 Small Works
Pull Request -
State: closed - Opened by StellaAthena almost 4 years ago
#23 - fix small bug where sequence length is not passed into attention clas…
Pull Request -
State: closed - Opened by StellaAthena almost 4 years ago
#22 - Can't install Triton
Issue -
State: closed - Opened by StellaAthena almost 4 years ago
- 2 comments
Labels: bug
#21 - fix small bug where sequence length is not passed into attention class
Pull Request -
State: closed - Opened by lucidrains almost 4 years ago
#20 - Integrate ZeRO-Powered Data Parallelism
Issue -
State: closed - Opened by StellaAthena almost 4 years ago
- 1 comment
Labels: feature request
#19 - Integrate the full power of ZeRo into the code
Issue -
State: closed - Opened by StellaAthena almost 4 years ago
- 1 comment
Labels: feature request
#18 - add ability to use fused layer norm with use_fused_layernorm=True flag
Pull Request -
State: closed - Opened by lucidrains almost 4 years ago
#17 - add tfrecords dataset and make minor changes to configs/train script
Pull Request -
State: closed - Opened by sdtblck almost 4 years ago
#16 - add sparse attention support, with ability to specify at which layers…
Pull Request -
State: closed - Opened by lucidrains almost 4 years ago
#15 - add tensorboard logging
Pull Request -
State: closed - Opened by sdtblck almost 4 years ago
#14 - Add params & remove gpu_monitor
Pull Request -
State: closed - Opened by sdtblck almost 4 years ago
- 2 comments
#13 - remove enwik8 data from repository
Pull Request -
State: closed - Opened by lucidrains almost 4 years ago
#12 - Create train.sh
Pull Request -
State: closed - Opened by sdtblck almost 4 years ago
- 1 comment
#11 - cleanup deepspeed training scripts
Pull Request -
State: closed - Opened by lucidrains almost 4 years ago
- 1 comment
#10 - get rid of test file
Pull Request -
State: closed - Opened by sdtblck almost 4 years ago
- 2 comments
#9 - PR for Deepspeed Integration
Pull Request -
State: closed - Opened by trisongz almost 4 years ago
#8 - Create CODEOWNERS
Pull Request -
State: closed - Opened by StellaAthena almost 4 years ago
#7 - Create experiment runners
Issue -
State: closed - Opened by StellaAthena almost 4 years ago
- 11 comments
Labels: feature request, good first issue
#6 - Allow for alternative architectures
Issue -
State: closed - Opened by StellaAthena almost 4 years ago
Labels: feature request
#5 - Build a Tensorboard
Issue -
State: closed - Opened by StellaAthena almost 4 years ago
Labels: feature request
#4 - Integrate DeepSpeed
Issue -
State: closed - Opened by StellaAthena almost 4 years ago
Labels: feature request
#3 - Data loading
Issue -
State: closed - Opened by StellaAthena almost 4 years ago
- 1 comment
Labels: feature request
#2 - working minimal GPT
Pull Request -
State: closed - Opened by lucidrains almost 4 years ago
- 3 comments
#1 - test
Pull Request -
State: closed - Opened by lucidrains almost 4 years ago