Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / EleutherAI/gpt-neox issues and pull requests

#1288 - Reinforce PR

Pull Request - State: open - Opened by dmahan93 3 days ago

#1287 - Remove the remaining two hanging wandb config fields

Pull Request - State: closed - Opened by Quentin-Anthony 4 days ago

#1286 - Make monitors consistent

Pull Request - State: closed - Opened by Quentin-Anthony 4 days ago

#1285 - Fix off by 1 error on masked tokens for RM training

Pull Request - State: closed - Opened by dmahan93 4 days ago

#1284 - Update Comet integration instructions

Pull Request - State: closed - Opened by Lothiraldan 4 days ago

#1282 - TransformerEngine Integration

Pull Request - State: open - Opened by aurelion-source 6 days ago - 1 comment

#1281 - Add model parallel group to reduce scatter

Pull Request - State: closed - Opened by bclyang 7 days ago

#1280 - Do not fail when git is not installed

Pull Request - State: open - Opened by gcaillaut 9 days ago - 1 comment

#1279 - fix the imports needed for comet integration

Pull Request - State: closed - Opened by Quentin-Anthony 11 days ago

#1278 - fix gpt-j residual bias assumption

Pull Request - State: closed - Opened by dmahan93 11 days ago

#1277 - Post training examples

Pull Request - State: closed - Opened by dmahan93 11 days ago - 3 comments

#1276 - Hotfix llama models

Pull Request - State: closed - Opened by dmahan93 12 days ago - 1 comment

#1275 - Add more informative checks for ZeRO incompatibility.

Pull Request - State: closed - Opened by AI-WAIFU 12 days ago

#1274 - Fix weight decay module check

Pull Request - State: closed - Opened by aurelion-source 12 days ago

#1273 - Expand Docstring

Pull Request - State: closed - Opened by AI-WAIFU 12 days ago

#1272 - TE Import Hotfix

Pull Request - State: closed - Opened by Quentin-Anthony 13 days ago - 1 comment

#1271 - Hotfix Activation Typo

Pull Request - State: closed - Opened by Quentin-Anthony 13 days ago

#1270 - Formatting and Fix Mamba Config

Pull Request - State: closed - Opened by Quentin-Anthony 13 days ago

#1269 - LayerNorm Refactor

Pull Request - State: closed - Opened by aurelion-source 15 days ago - 3 comments

#1268 - Allow training without knowing num_iters

Issue - State: open - Opened by StellaAthena 15 days ago - 1 comment
Labels: feature request

#1267 - Add assert to check for missing tokenizer_type in config. [#1231]

Pull Request - State: closed - Opened by AI-WAIFU 17 days ago - 1 comment

#1266 - Add initial ring flash attention support

Pull Request - State: open - Opened by dmahan93 17 days ago - 1 comment

#1265 - add Apex fused RMS norm

Pull Request - State: closed - Opened by dmahan93 23 days ago - 1 comment

#1264 - Frontier

Pull Request - State: closed - Opened by jahatef 28 days ago - 1 comment

#1263 - Improve performance of sequence parallel gather, scatter, and reduce

Pull Request - State: closed - Opened by bclyang about 1 month ago

#1262 - mamba fixes and cleaning

Pull Request - State: closed - Opened by jahatef about 1 month ago - 2 comments

#1261 - Comet integration

Pull Request - State: closed - Opened by jverre about 1 month ago - 2 comments

#1260 - Fix gather and reduce scatter ops on sequence dimension

Pull Request - State: closed - Opened by bclyang about 1 month ago

#1259 - Fix LayerNorm all reduce gradient hook

Pull Request - State: closed - Opened by bclyang about 1 month ago - 1 comment

#1257 - Megatron-LM style Sequence Parallel

Pull Request - State: closed - Opened by haileyschoelkopf about 1 month ago - 3 comments

#1256 - GitHub actions fix

Pull Request - State: closed - Opened by jahatef about 2 months ago

#1255 - Add new cites

Pull Request - State: closed - Opened by StellaAthena about 2 months ago - 1 comment

#1254 - How to Load Model from pytorch_model.bin into Trained Model for Text Generation?

Issue - State: open - Opened by lieh1203 2 months ago
Labels: feature request

#1253 - what's the biggest dataset you've tried?

Issue - State: open - Opened by exnx 2 months ago
Labels: bug

#1252 - too many .bin files for dataloader, crashed

Issue - State: closed - Opened by exnx 2 months ago
Labels: bug

#1251 - Assertion Error when Setting pipe_parallel_size or model_parallel_size in GPT-NeoX

Issue - State: open - Opened by lieh1203 2 months ago - 3 comments
Labels: bug

#1250 - For nucleus sampling, top-p sampling appears to happen on the softmax-normalized top-k logits

Issue - State: closed - Opened by j-frei 3 months ago - 3 comments
Labels: bug

#1248 - batch_input and elapsed time per iteration suddenly slow down during model training

Issue - State: open - Opened by Yuhanleeee 3 months ago - 4 comments
Labels: bug

#1247 - Add hf llama to neox conversion

Pull Request - State: closed - Opened by dmahan93 3 months ago - 1 comment

#1246 - Add Reward Model training

Pull Request - State: closed - Opened by dmahan93 3 months ago

#1245 - Conversion for CI from self-hosted hardware

Pull Request - State: closed - Opened by jaimemcc-intel 3 months ago

#1244 - Add KTO training

Pull Request - State: closed - Opened by dmahan93 3 months ago

#1243 - Replace unsafe `pyyaml` loader with `SafeLoader` (#2)

Pull Request - State: closed - Opened by pixeeai 3 months ago - 1 comment

#1242 - Add DPO training

Pull Request - State: closed - Opened by dmahan93 3 months ago - 1 comment

#1241 - Fix paper reference in init_functions.py

Pull Request - State: closed - Opened by rasbt 3 months ago - 2 comments

#1239 - Add a chat data preprocessing script

Pull Request - State: closed - Opened by dmahan93 3 months ago

#1238 - Pr1212

Pull Request - State: closed - Opened by jahatef 3 months ago

#1237 - Add tensor parallelism for RWKV

Pull Request - State: open - Opened by jahatef 3 months ago

#1236 - Ville dev

Pull Request - State: closed - Opened by Vmjkom 3 months ago - 1 comment

#1235 - Add Transformer Engine's version of RMSNorm and LayerNorm

Pull Request - State: closed - Opened by lintangsutawika 3 months ago - 2 comments

#1234 - fix python version and pytest install

Pull Request - State: closed - Opened by jahatef 4 months ago - 5 comments

#1233 - add workflow_dispatch to gh actions pr so we can run on command

Pull Request - State: closed - Opened by jahatef 4 months ago

#1232 - init changes to README

Pull Request - State: closed - Opened by jaimemcc-intel 4 months ago

#1231 - Cannot convert neox model to HF

Issue - State: open - Opened by srivassid 4 months ago - 2 comments
Labels: bug

#1230 - How to set the ffn hidden size parameter in gpt neox

Issue - State: closed - Opened by IronMan-WangJinxi 4 months ago - 2 comments
Labels: feature request

#1228 - Cannot perform inference, be it unconditional. input-file or interactive

Issue - State: closed - Opened by srivassid 4 months ago - 1 comment
Labels: bug

#1226 - Add Torch Profiler Support

Pull Request - State: closed - Opened by DayOfThePenguin 4 months ago

#1225 - Add lora support

Pull Request - State: open - Opened by mkerin 4 months ago

#1224 - fixed fused_rope naming in JIT + Readme

Pull Request - State: closed - Opened by R0n12 4 months ago

#1223 - Change python invocation syntax

Pull Request - State: closed - Opened by jaimemcc-intel 4 months ago

#1222 - Small tidying

Pull Request - State: closed - Opened by yang 4 months ago

#1221 - Rwkv pipeline parallelism

Pull Request - State: closed - Opened by jahatef 4 months ago - 1 comment

#1220 - fix conversion of hf -> neox for pythia in model parallel

Pull Request - State: closed - Opened by dmahan93 4 months ago

#1219 - Fix changed behavior of pipe_parallel

Pull Request - State: closed - Opened by yang 4 months ago

#1218 - Conversion script bugfixes

Pull Request - State: closed - Opened by haileyschoelkopf 4 months ago - 3 comments

#1217 - Fix markdown formatting error

Pull Request - State: closed - Opened by StellaAthena 4 months ago

#1216 - Run document update again

Pull Request - State: closed - Opened by jahatef 4 months ago

#1215 - fixed typos

Pull Request - State: closed - Opened by jahatef 4 months ago

#1214 - fix pipeline parallelism detection

Pull Request - State: closed - Opened by dmahan93 4 months ago - 2 comments

#1213 - Add Transformer Engine

Pull Request - State: open - Opened by Quentin-Anthony 4 months ago - 1 comment

#1212 - Add `intermediate_size` to GPT-NeoX models

Pull Request - State: closed - Opened by dtamayo-nlp 4 months ago - 5 comments

#1211 - Bump jinja2 from 3.1.3 to 3.1.4 in /requirements

Pull Request - State: closed - Opened by dependabot[bot] 5 months ago
Labels: dependencies

#1210 - Dmoe integration

Pull Request - State: open - Opened by DayOfThePenguin 5 months ago

#1209 - Fix bug in tools/ckpts/convert_neox_to_hf.py for setting intermediate_size

Pull Request - State: closed - Opened by jvendrow 5 months ago - 2 comments

#1201 - Create cmake-multi-platform.yml

Pull Request - State: closed - Opened by Romario242003 5 months ago - 2 comments

#1198 - add rwkv support

Pull Request - State: closed - Opened by jahatef 6 months ago - 2 comments
Labels: merge-queue

#1197 - Megablocks-based MoE

Pull Request - State: closed - Opened by DayOfThePenguin 6 months ago - 1 comment

#1194 - Added infinite lr schedules

Pull Request - State: open - Opened by kshitijkg 6 months ago
Labels: merge-queue

#1192 - Add megablocks dropless MoE

Pull Request - State: closed - Opened by yang 6 months ago

#1188 - [AMD] Supporting fused kernels build using JIT

Pull Request - State: closed - Opened by R0n12 6 months ago - 2 comments

#1177 - Remove unused requirements-sparseattention

Pull Request - State: closed - Opened by segyges 7 months ago - 2 comments

#1167 - Add Basic RWKV Block to GPT-NeoX

Issue - State: closed - Opened by Quentin-Anthony 7 months ago - 1 comment
Labels: feature request

#1156 - Fused kernel support for AMD (using JIT)

Pull Request - State: closed - Opened by R0n12 7 months ago - 3 comments

#1139 - Better run_eval_harness import

Pull Request - State: closed - Opened by R0n12 8 months ago - 1 comment

#1119 - Create Singularity Container

Issue - State: open - Opened by Quentin-Anthony 8 months ago - 3 comments
Labels: feature request, good first issue, help wanted

#1088 - Finetune

Issue - State: closed - Opened by liuxinxin123 10 months ago - 4 comments
Labels: feature request

#1084 - Support for DeepSpeed Ulysses (SP)

Pull Request - State: closed - Opened by Quentin-Anthony 10 months ago - 1 comment

#1078 - Port DeepSpeed Ulysses

Issue - State: closed - Opened by Quentin-Anthony 10 months ago - 2 comments
Labels: feature request

#979 - Dataload fix

Pull Request - State: closed - Opened by jahatef over 1 year ago - 2 comments

#878 - Deepspeed benchmarking

Pull Request - State: open - Opened by cr458 over 1 year ago - 1 comment

#812 - Add support for sequence parallelism

Issue - State: closed - Opened by Quentin-Anthony over 1 year ago - 12 comments
Labels: feature request, help wanted

#677 - MoE Support

Pull Request - State: closed - Opened by Quentin-Anthony about 2 years ago - 1 comment

#645 - RuntimeError: Error(s) in loading state_dict for EmbeddingPipe: size mismatch for word_embeddings.weight

Issue - State: open - Opened by mcao516 about 2 years ago - 9 comments
Labels: bug, good first issue, help wanted