Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / EleutherAI/gpt-neox issues and pull requests
#1288 - Reinforce PR
Pull Request -
State: open - Opened by dmahan93 3 days ago
#1287 - Remove the remaining two hanging wandb config fields
Pull Request -
State: closed - Opened by Quentin-Anthony 4 days ago
#1286 - Make monitors consistent
Pull Request -
State: closed - Opened by Quentin-Anthony 4 days ago
#1285 - Fix off by 1 error on masked tokens for RM training
Pull Request -
State: closed - Opened by dmahan93 4 days ago
#1284 - Update Comet integration instructions
Pull Request -
State: closed - Opened by Lothiraldan 4 days ago
#1283 - Automatically compute train_iters when train_epochs is specified.
Pull Request -
State: open - Opened by AI-WAIFU 5 days ago
#1282 - TransformerEngine Integration
Pull Request -
State: open - Opened by aurelion-source 6 days ago
- 1 comment
#1281 - Add model parallel group to reduce scatter
Pull Request -
State: closed - Opened by bclyang 7 days ago
#1280 - Do not fail when git is not installed
Pull Request -
State: open - Opened by gcaillaut 9 days ago
- 1 comment
#1279 - fix the imports needed for comet integration
Pull Request -
State: closed - Opened by Quentin-Anthony 11 days ago
#1278 - fix gpt-j residual bias assumption
Pull Request -
State: closed - Opened by dmahan93 11 days ago
#1277 - Post training examples
Pull Request -
State: closed - Opened by dmahan93 11 days ago
- 3 comments
#1276 - Hotfix llama models
Pull Request -
State: closed - Opened by dmahan93 12 days ago
- 1 comment
#1275 - Add more informative checks for ZeRO incompatibility.
Pull Request -
State: closed - Opened by AI-WAIFU 12 days ago
#1274 - Fix weight decay module check
Pull Request -
State: closed - Opened by aurelion-source 12 days ago
#1273 - Expand Docstring
Pull Request -
State: closed - Opened by AI-WAIFU 12 days ago
#1272 - TE Import Hotfix
Pull Request -
State: closed - Opened by Quentin-Anthony 13 days ago
- 1 comment
#1271 - Hotfix Activation Typo
Pull Request -
State: closed - Opened by Quentin-Anthony 13 days ago
#1270 - Formatting and Fix Mamba Config
Pull Request -
State: closed - Opened by Quentin-Anthony 13 days ago
#1269 - LayerNorm Refactor
Pull Request -
State: closed - Opened by aurelion-source 15 days ago
- 3 comments
#1268 - Allow training without knowing num_iters
Issue -
State: open - Opened by StellaAthena 15 days ago
- 1 comment
Labels: feature request
#1267 - Add assert to check for missing tokenizer_type in config. [#1231]
Pull Request -
State: closed - Opened by AI-WAIFU 17 days ago
- 1 comment
#1266 - Add initial ring flash attention support
Pull Request -
State: open - Opened by dmahan93 17 days ago
- 1 comment
#1265 - add Apex fused RMS norm
Pull Request -
State: closed - Opened by dmahan93 23 days ago
- 1 comment
#1264 - Frontier
Pull Request -
State: closed - Opened by jahatef 28 days ago
- 1 comment
#1263 - Improve performance of sequence parallel gather, scatter, and reduce
Pull Request -
State: closed - Opened by bclyang about 1 month ago
#1262 - mamba fixes and cleaning
Pull Request -
State: closed - Opened by jahatef about 1 month ago
- 2 comments
#1261 - Comet integration
Pull Request -
State: closed - Opened by jverre about 1 month ago
- 2 comments
#1260 - Fix gather and reduce scatter ops on sequence dimension
Pull Request -
State: closed - Opened by bclyang about 1 month ago
#1259 - Fix LayerNorm all reduce gradient hook
Pull Request -
State: closed - Opened by bclyang about 1 month ago
- 1 comment
#1258 - bugfix: chat turns instead of repeating the conversation in preprocess_data_with_chat_template.py
Pull Request -
State: closed - Opened by dmahan93 about 1 month ago
#1257 - Megatron-LM style Sequence Parallel
Pull Request -
State: closed - Opened by haileyschoelkopf about 1 month ago
- 3 comments
#1256 - GitHub actions fix
Pull Request -
State: closed - Opened by jahatef about 2 months ago
#1255 - Add new cites
Pull Request -
State: closed - Opened by StellaAthena about 2 months ago
- 1 comment
#1254 - How to Load Model from pytorch_model.bin into Trained Model for Text Generation?
Issue -
State: open - Opened by lieh1203 2 months ago
Labels: feature request
#1253 - what's the biggest dataset you've tried?
Issue -
State: open - Opened by exnx 2 months ago
Labels: bug
#1252 - too many .bin files for dataloader, crashed
Issue -
State: closed - Opened by exnx 2 months ago
Labels: bug
#1251 - Assertion Error when Setting pipe_parallel_size or model_parallel_size in GPT-NeoX
Issue -
State: open - Opened by lieh1203 2 months ago
- 3 comments
Labels: bug
#1250 - For nucleus sampling, top-p sampling appears to happen on the softmax-normalized top-k logits
Issue -
State: closed - Opened by j-frei 3 months ago
- 3 comments
Labels: bug
#1248 - batch_input and elapsed time per iteration suddenly slow down during model training
Issue -
State: open - Opened by Yuhanleeee 3 months ago
- 4 comments
Labels: bug
#1247 - Add hf llama to neox conversion
Pull Request -
State: closed - Opened by dmahan93 3 months ago
- 1 comment
#1246 - Add Reward Model training
Pull Request -
State: closed - Opened by dmahan93 3 months ago
#1245 - Conversion for CI from self-hosted hardware
Pull Request -
State: closed - Opened by jaimemcc-intel 3 months ago
#1244 - Add KTO training
Pull Request -
State: closed - Opened by dmahan93 3 months ago
#1243 - Replace unsafe `pyyaml` loader with `SafeLoader` (#2)
Pull Request -
State: closed - Opened by pixeeai 3 months ago
- 1 comment
#1242 - Add DPO training
Pull Request -
State: closed - Opened by dmahan93 3 months ago
- 1 comment
#1241 - Fix paper reference in init_functions.py
Pull Request -
State: closed - Opened by rasbt 3 months ago
- 2 comments
#1240 - SFT improvements (labeling fixes, different packing implementations)
Pull Request -
State: closed - Opened by dmahan93 3 months ago
#1239 - Add a chat data preprocessing script
Pull Request -
State: closed - Opened by dmahan93 3 months ago
#1238 - Pr1212
Pull Request -
State: closed - Opened by jahatef 3 months ago
#1237 - Add tensor parallelism for RWKV
Pull Request -
State: open - Opened by jahatef 3 months ago
#1236 - Ville dev
Pull Request -
State: closed - Opened by Vmjkom 3 months ago
- 1 comment
#1235 - Add Transformer Engine's version of RMSNorm and LayerNorm
Pull Request -
State: closed - Opened by lintangsutawika 3 months ago
- 2 comments
#1234 - fix python version and pytest install
Pull Request -
State: closed - Opened by jahatef 4 months ago
- 5 comments
#1233 - add workflow_dispatch to gh actions pr so we can run on command
Pull Request -
State: closed - Opened by jahatef 4 months ago
#1232 - init changes to README
Pull Request -
State: closed - Opened by jaimemcc-intel 4 months ago
#1231 - Cannot convert neox model to HF
Issue -
State: open - Opened by srivassid 4 months ago
- 2 comments
Labels: bug
#1230 - How to set the ffn hidden size parameter in gpt neox
Issue -
State: closed - Opened by IronMan-WangJinxi 4 months ago
- 2 comments
Labels: feature request
#1228 - Cannot perform inference, be it unconditional. input-file or interactive
Issue -
State: closed - Opened by srivassid 4 months ago
- 1 comment
Labels: bug
#1227 - The results of running eval show only 1 digit after decimal point for acc on all tested tasks
Issue -
State: closed - Opened by lernerjenny 4 months ago
- 2 comments
Labels: bug
#1226 - Add Torch Profiler Support
Pull Request -
State: closed - Opened by DayOfThePenguin 4 months ago
#1225 - Add lora support
Pull Request -
State: open - Opened by mkerin 4 months ago
#1224 - fixed fused_rope naming in JIT + Readme
Pull Request -
State: closed - Opened by R0n12 4 months ago
#1223 - Change python invocation syntax
Pull Request -
State: closed - Opened by jaimemcc-intel 4 months ago
#1222 - Small tidying
Pull Request -
State: closed - Opened by yang 4 months ago
#1221 - Rwkv pipeline parallelism
Pull Request -
State: closed - Opened by jahatef 4 months ago
- 1 comment
#1220 - fix conversion of hf -> neox for pythia in model parallel
Pull Request -
State: closed - Opened by dmahan93 4 months ago
#1219 - Fix changed behavior of pipe_parallel
Pull Request -
State: closed - Opened by yang 4 months ago
#1218 - Conversion script bugfixes
Pull Request -
State: closed - Opened by haileyschoelkopf 4 months ago
- 3 comments
#1217 - Fix markdown formatting error
Pull Request -
State: closed - Opened by StellaAthena 4 months ago
#1216 - Run document update again
Pull Request -
State: closed - Opened by jahatef 4 months ago
#1215 - fixed typos
Pull Request -
State: closed - Opened by jahatef 4 months ago
#1214 - fix pipeline parallelism detection
Pull Request -
State: closed - Opened by dmahan93 4 months ago
- 2 comments
#1213 - Add Transformer Engine
Pull Request -
State: open - Opened by Quentin-Anthony 4 months ago
- 1 comment
#1212 - Add `intermediate_size` to GPT-NeoX models
Pull Request -
State: closed - Opened by dtamayo-nlp 4 months ago
- 5 comments
#1211 - Bump jinja2 from 3.1.3 to 3.1.4 in /requirements
Pull Request -
State: closed - Opened by dependabot[bot] 5 months ago
Labels: dependencies
#1210 - Dmoe integration
Pull Request -
State: open - Opened by DayOfThePenguin 5 months ago
#1209 - Fix bug in tools/ckpts/convert_neox_to_hf.py for setting intermediate_size
Pull Request -
State: closed - Opened by jvendrow 5 months ago
- 2 comments
#1208 - 'intermediate_size' not set in tools/ckpts/convert_neox_to_hf.py for neox model architecture
Issue -
State: closed - Opened by jvendrow 5 months ago
- 3 comments
Labels: bug
#1203 - My servers used for multi-node training do not have ssh. How can I launch multi-node training using the torchrun command?
Issue -
State: open - Opened by dingning97 5 months ago
- 2 comments
Labels: feature request
#1201 - Create cmake-multi-platform.yml
Pull Request -
State: closed - Opened by Romario242003 5 months ago
- 2 comments
#1198 - add rwkv support
Pull Request -
State: closed - Opened by jahatef 6 months ago
- 2 comments
Labels: merge-queue
#1197 - Megablocks-based MoE
Pull Request -
State: closed - Opened by DayOfThePenguin 6 months ago
- 1 comment
#1194 - Added infinite lr schedules
Pull Request -
State: open - Opened by kshitijkg 6 months ago
Labels: merge-queue
#1192 - Add megablocks dropless MoE
Pull Request -
State: closed - Opened by yang 6 months ago
#1191 - [ZeRO-3] Ensured passing neox deepspeed_config when using partitioned init
Pull Request -
State: closed - Opened by R0n12 6 months ago
#1188 - [AMD] Supporting fused kernels build using JIT
Pull Request -
State: closed - Opened by R0n12 6 months ago
- 2 comments
#1177 - Remove unused requirements-sparseattention
Pull Request -
State: closed - Opened by segyges 7 months ago
- 2 comments
#1167 - Add Basic RWKV Block to GPT-NeoX
Issue -
State: closed - Opened by Quentin-Anthony 7 months ago
- 1 comment
Labels: feature request
#1156 - Fused kernel support for AMD (using JIT)
Pull Request -
State: closed - Opened by R0n12 7 months ago
- 3 comments
#1139 - Better run_eval_harness import
Pull Request -
State: closed - Opened by R0n12 8 months ago
- 1 comment
#1119 - Create Singularity Container
Issue -
State: open - Opened by Quentin-Anthony 8 months ago
- 3 comments
Labels: feature request, good first issue, help wanted
#1088 - Finetune
Issue -
State: closed - Opened by liuxinxin123 10 months ago
- 4 comments
Labels: feature request
#1084 - Support for DeepSpeed Ulysses (SP)
Pull Request -
State: closed - Opened by Quentin-Anthony 10 months ago
- 1 comment
#1078 - Port DeepSpeed Ulysses
Issue -
State: closed - Opened by Quentin-Anthony 10 months ago
- 2 comments
Labels: feature request
#979 - Dataload fix
Pull Request -
State: closed - Opened by jahatef over 1 year ago
- 2 comments
#878 - Deepspeed benchmarking
Pull Request -
State: open - Opened by cr458 over 1 year ago
- 1 comment
#812 - Add support for sequence parallelism
Issue -
State: closed - Opened by Quentin-Anthony over 1 year ago
- 12 comments
Labels: feature request, help wanted
#677 - MoE Support
Pull Request -
State: closed - Opened by Quentin-Anthony about 2 years ago
- 1 comment
#645 - RuntimeError: Error(s) in loading state_dict for EmbeddingPipe: size mismatch for word_embeddings.weight
Issue -
State: open - Opened by mcao516 about 2 years ago
- 9 comments
Labels: bug, good first issue, help wanted