Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / EleutherAI/gpt-neox issues and pull requests

#39 - Add improved data downloading class / pipeline

Pull Request - State: closed - Opened by sdtblck almost 4 years ago - 2 comments

#36 - Update requirements.txt

Pull Request - State: closed - Opened by sdtblck almost 4 years ago

#35 - Fix error in extracting OWT2 dataset

Pull Request - State: closed - Opened by steven-mi almost 4 years ago - 1 comment

#34 - feedforward GLU on by default

Pull Request - State: closed - Opened by lucidrains almost 4 years ago - 1 comment

#33 - Automatically download owt2

Pull Request - State: closed - Opened by steven-mi almost 4 years ago

#32 - Fix depreciated code

Issue - State: closed - Opened by StellaAthena almost 4 years ago - 1 comment
Labels: bug

#31 - disable reduce for loss calculation and calculate mean separately

Pull Request - State: closed - Opened by anthony-dipofi almost 4 years ago - 2 comments

#30 - untie classifier weights by default

Pull Request - State: closed - Opened by lucidrains almost 4 years ago

#29 - add linear warmup over 5000 steps and gradient clipping

Pull Request - State: closed - Opened by lucidrains almost 4 years ago

#28 - ftfy used in create_tfrecords.py but not listed in requirements.txt

Issue - State: closed - Opened by anthony-dipofi almost 4 years ago - 1 comment
Labels: bug

#27 - Update documentation

Issue - State: closed - Opened by StellaAthena almost 4 years ago - 1 comment
Labels: documentation

#26 - Hardcoded paths in gpt3_small.json

Issue - State: closed - Opened by anthony-dipofi almost 4 years ago
Labels: bug

#25 - make mask value smaller by factor of 2

Pull Request - State: closed - Opened by lucidrains almost 4 years ago - 1 comment

#24 - GPT-3 Small Works

Pull Request - State: closed - Opened by StellaAthena almost 4 years ago

#22 - Can't install Triton

Issue - State: closed - Opened by StellaAthena almost 4 years ago - 2 comments
Labels: bug

#21 - fix small bug where sequence length is not passed into attention class

Pull Request - State: closed - Opened by lucidrains almost 4 years ago

#20 - Integrate ZeRO-Powered Data Parallelism

Issue - State: closed - Opened by StellaAthena almost 4 years ago - 1 comment
Labels: feature request

#19 - Integrate the full power of ZeRo into the code

Issue - State: closed - Opened by StellaAthena almost 4 years ago - 1 comment
Labels: feature request

#18 - add ability to use fused layer norm with use_fused_layernorm=True flag

Pull Request - State: closed - Opened by lucidrains almost 4 years ago

#17 - add tfrecords dataset and make minor changes to configs/train script

Pull Request - State: closed - Opened by sdtblck almost 4 years ago

#16 - add sparse attention support, with ability to specify at which layers…

Pull Request - State: closed - Opened by lucidrains almost 4 years ago

#15 - add tensorboard logging

Pull Request - State: closed - Opened by sdtblck almost 4 years ago

#14 - Add params & remove gpu_monitor

Pull Request - State: closed - Opened by sdtblck almost 4 years ago - 2 comments

#13 - remove enwik8 data from repository

Pull Request - State: closed - Opened by lucidrains almost 4 years ago

#12 - Create train.sh

Pull Request - State: closed - Opened by sdtblck almost 4 years ago - 1 comment

#11 - cleanup deepspeed training scripts

Pull Request - State: closed - Opened by lucidrains almost 4 years ago - 1 comment

#10 - get rid of test file

Pull Request - State: closed - Opened by sdtblck almost 4 years ago - 2 comments

#9 - PR for Deepspeed Integration

Pull Request - State: closed - Opened by trisongz almost 4 years ago

#8 - Create CODEOWNERS

Pull Request - State: closed - Opened by StellaAthena almost 4 years ago

#7 - Create experiment runners

Issue - State: closed - Opened by StellaAthena almost 4 years ago - 11 comments
Labels: feature request, good first issue

#6 - Allow for alternative architectures

Issue - State: closed - Opened by StellaAthena almost 4 years ago
Labels: feature request

#5 - Build a Tensorboard

Issue - State: closed - Opened by StellaAthena almost 4 years ago
Labels: feature request

#4 - Integrate DeepSpeed

Issue - State: closed - Opened by StellaAthena almost 4 years ago
Labels: feature request

#3 - Data loading

Issue - State: closed - Opened by StellaAthena almost 4 years ago - 1 comment
Labels: feature request

#2 - working minimal GPT

Pull Request - State: closed - Opened by lucidrains almost 4 years ago - 3 comments

#1 - test

Pull Request - State: closed - Opened by lucidrains almost 4 years ago