Ecosyste.ms: Issues

An open API service for providing issue and pull request metadata for open source projects.

GitHub / EleutherAI/pythia issues and pull requests

#176 - Update batch_viewer docs to accurately reflect data indexing

Pull Request - State: open - Opened by jeffreygwang about 1 month ago - 1 comment

#175 - No EOD Tokens in EleutherAI/pile-deduped-pythia-preshuffled

Issue - State: open - Opened by markschoene about 1 month ago - 1 comment

#174 - Iclr

Pull Request - State: closed - Opened by sunnyddelight about 1 month ago - 1 comment

#172 - Questions regarding the WSC evaluation results

Issue - State: open - Opened by mutiann about 2 months ago

#171 - Clarification of Pythia Deduped Precision - bf16 or fp16?

Issue - State: closed - Opened by RylanSchaeffer 2 months ago - 1 comment

#170 - Update README.md

Pull Request - State: open - Opened by MeDott29 3 months ago - 1 comment

#168 - Refactoring

Pull Request - State: closed - Opened by sunnyddelight 4 months ago - 1 comment

#166 - make README easier to follow

Pull Request - State: open - Opened by Arvid-pku 5 months ago - 1 comment

#165 - Issue while showering NLO events with NLO

Issue - State: open - Opened by rash-eng 5 months ago - 4 comments

#163 - cache_dir cannot be the same as model name

Issue - State: open - Opened by arunasank 5 months ago

#162 - Pythia 12b flash config

Issue - State: open - Opened by jvendrow 6 months ago

#161 - Sparse

Pull Request - State: closed - Opened by sunnyddelight 6 months ago - 1 comment

#160 - how to use Pythia

Issue - State: open - Opened by gaohang 6 months ago

#159 - Convert to GGUF

Issue - State: open - Opened by yanxon 6 months ago

#158 - Reshape error in batch viewer

Issue - State: closed - Opened by activatedgeek 6 months ago - 1 comment

#157 - Update README.md

Pull Request - State: closed - Opened by borgr 7 months ago - 1 comment

#156 - tokenizer.pad_token

Issue - State: open - Opened by vincent317 7 months ago - 1 comment

#155 - instruct-tuned pythia

Issue - State: open - Opened by WilliamsToTo 8 months ago

#154 - Correct link to huggingface

Pull Request - State: closed - Opened by l-ma 8 months ago - 1 comment

#152 - Optimizer states in HF format

Issue - State: open - Opened by seyuboglu 8 months ago - 1 comment

#151 - Weird inconsistency in Tokenizer vocabulary

Issue - State: open - Opened by javirandor 8 months ago - 1 comment

#150 - Is there existing code to resume training from specific checkpoint?

Issue - State: closed - Opened by javirandor 9 months ago - 1 comment

#149 - "gas" configuration doesn't do anything

Issue - State: open - Opened by segyges 9 months ago

#148 - Adding _warmup_mmap_file function missing from MMapIndexedDataset

Pull Request - State: closed - Opened by rdiehlmartinez 9 months ago - 1 comment

#147 - Add training loss data

Pull Request - State: open - Opened by pietrolesci 10 months ago - 1 comment

#146 - Update README.md

Pull Request - State: open - Opened by speed1313 10 months ago - 1 comment

#143 - Fix ToC

Pull Request - State: closed - Opened by osanseviero 11 months ago - 1 comment

#142 - Details about "EleutherAI/pythia-160m-seed*" models

Issue - State: closed - Opened by IanMagnusson 11 months ago - 3 comments

#141 - Missing / undownloadable checkpoints on huggingface

Issue - State: closed - Opened by mirandrom 11 months ago - 3 comments

#140 - .

Issue - State: closed - Opened by ParthaKrPaul 11 months ago - 2 comments

#139 - Wrong files in eval?

Issue - State: open - Opened by borgr 12 months ago

#138 - Pytia or GPT-neox?

Issue - State: closed - Opened by borgr 12 months ago - 1 comment

#137 - Deduplicated Pile dataset with Domain Attribution

Issue - State: closed - Opened by michaelduan8 12 months ago

#136 - Replicating the Training Data Order

Issue - State: closed - Opened by prakharg24 12 months ago - 1 comment

#135 - Inconsistent init methods of pythia-6.9b model

Issue - State: open - Opened by mqyqlx 12 months ago - 2 comments

#134 - Update README.md

Pull Request - State: closed - Opened by segyges 12 months ago

#133 - Add checksum for data from huggingface

Pull Request - State: closed - Opened by segyges about 1 year ago - 1 comment

#132 - The value of weight decay

Issue - State: closed - Opened by yehuitang about 1 year ago - 1 comment

#131 - Update requirements.txt

Pull Request - State: closed - Opened by segyges about 1 year ago

#130 - Typos in readme.md

Pull Request - State: closed - Opened by segyges about 1 year ago - 1 comment

#129 - Model Initialization Question

Issue - State: closed - Opened by yanlai00 about 1 year ago - 1 comment

#128 - Update readme to load preshuffled datasets

Pull Request - State: closed - Opened by uSaiPrashanth about 1 year ago

#127 - Has the data been shuffled?

Issue - State: open - Opened by Lisennlp about 1 year ago - 2 comments

#126 - Reading data is slowly!

Issue - State: open - Opened by Lisennlp about 1 year ago - 1 comment

#125 - Automatically calculate shard size

Pull Request - State: closed - Opened by uSaiPrashanth about 1 year ago

#124 - Automatically determine shard size

Pull Request - State: closed - Opened by uSaiPrashanth about 1 year ago - 1 comment

#123 - Batch Viewer : Why Sequence Length 2049?

Issue - State: closed - Opened by prakharg24 about 1 year ago - 15 comments

#122 - The performance about pythia and LLaMA model architecture

Issue - State: closed - Opened by peiyingxin about 1 year ago - 1 comment

#121 - Any results on the validation set?

Issue - State: open - Opened by chujiezheng about 1 year ago - 1 comment

#120 - README Update

Pull Request - State: closed - Opened by StellaAthena about 1 year ago - 1 comment

#119 - Update README.md

Pull Request - State: closed - Opened by StellaAthena about 1 year ago

#118 - Mismatch about the evaluation results

Issue - State: closed - Opened by yuzc19 about 1 year ago - 11 comments

#117 - Weights tying

Issue - State: closed - Opened by link-er about 1 year ago - 1 comment

#116 - Convert the huggingface checkpoint to GPT-Neox checkpoint

Issue - State: closed - Opened by ZhiYuanZeng over 1 year ago - 2 comments

#114 - Error when running unshard_memmap.py

Issue - State: closed - Opened by ShaneeyS over 1 year ago - 2 comments

#113 - Can I provide custom data and continue training Pythia on this new data?

Issue - State: closed - Opened by GeorgiAngelov over 1 year ago - 1 comment

#112 - Difference between LFS and HuggingFace datasets?

Issue - State: closed - Opened by eric-mitchell over 1 year ago - 1 comment

#111 - Batch viewer

Pull Request - State: closed - Opened by uSaiPrashanth over 1 year ago

#109 - Update documentation for installing `batch_viewer.py` deps

Pull Request - State: closed - Opened by haileyschoelkopf over 1 year ago

#108 - Possible error in Pythia-12B-deduped step 32000

Issue - State: closed - Opened by smahdavi4 over 1 year ago - 2 comments

#107 - pythia-12b checkpoints missing on HuggingFace for step4000 and step32000

Issue - State: closed - Opened by byungdoh over 1 year ago - 2 comments

#105 - Draft new repo structure

Pull Request - State: closed - Opened by haileyschoelkopf over 1 year ago - 2 comments

#104 - Add Memorization Evals to repo

Pull Request - State: closed - Opened by uSaiPrashanth over 1 year ago - 1 comment

#103 - Added instructions for reproducing a Pythia training

Pull Request - State: closed - Opened by BaruchG over 1 year ago - 1 comment

#102 - Train/valid/test split

Issue - State: closed - Opened by choidami over 1 year ago - 1 comment

#101 - release of checkpoints of different steps

Issue - State: closed - Opened by TobiasLee over 1 year ago - 5 comments

#100 - Ensure flash attention in configs

Pull Request - State: closed - Opened by haileyschoelkopf over 1 year ago

#99 - Revamp experiment organization and migrate code when necessary

Issue - State: closed - Opened by StellaAthena over 1 year ago
Labels: documentation

#98 - Will memorization experimental codes be released?

Issue - State: closed - Opened by chujiezheng over 1 year ago - 2 comments

#97 - the loss of pythia training

Issue - State: closed - Opened by Wangpeiyi9979 over 1 year ago - 3 comments

#96 - Fine-tuning recommendations

Issue - State: closed - Opened by RainIwakura over 1 year ago - 2 comments

#95 - Update License

Pull Request - State: closed - Opened by StellaAthena over 1 year ago - 1 comment

#94 - Pythia 6.9B Model Missing Checkpoint

Issue - State: closed - Opened by chujiezheng over 1 year ago - 1 comment

#93 - Update README.md to remove work-in-progress disclaimer

Pull Request - State: closed - Opened by haileyschoelkopf over 1 year ago - 1 comment

#92 - Is there an access to the deduplicated version of the data with meta info?

Issue - State: closed - Opened by Jason3900 over 1 year ago - 6 comments

#91 - Add a citation to Readme

Pull Request - State: closed - Opened by haileyschoelkopf over 1 year ago

#90 - Cleanup old files

Pull Request - State: closed - Opened by haileyschoelkopf over 1 year ago

#89 - Fine tune for text generation on custom data

Issue - State: closed - Opened by samarthsarin over 1 year ago - 1 comment

#88 - Add paper to README

Pull Request - State: closed - Opened by Quentin-Anthony over 1 year ago

#87 - Training time or approximation of TFLOPs?

Issue - State: closed - Opened by zetian1025 over 1 year ago - 2 comments

#86 - training logs

Issue - State: closed - Opened by stjaco over 1 year ago - 1 comment

#84 - Update README.md

Pull Request - State: closed - Opened by eltociear over 1 year ago

#83 - Weights of "step0" and "step1" checkpoints are identical for all pythia models

Issue - State: closed - Opened by byungdoh over 1 year ago - 6 comments

#82 - Add changelog section to README

Pull Request - State: closed - Opened by haileyschoelkopf over 1 year ago

#81 - Mistake in readme

Issue - State: closed - Opened by zplizzi over 1 year ago - 1 comment

#80 - reorganize v1.1 eval files

Pull Request - State: closed - Opened by haileyschoelkopf over 1 year ago

#79 - Add more details for reproducing training runs

Issue - State: closed - Opened by zplizzi over 1 year ago - 5 comments

#78 - Crowspairs Old with More Steps

Pull Request - State: closed - Opened by aflah02 over 1 year ago - 2 comments