Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / mosaicml/llm-foundry issues and pull requests
#1196 - Pass FC type along for all FFN types
Pull Request -
State: closed - Opened by dakinggg about 2 months ago
#1195 - Streaming version bump to 0.7.6
Pull Request -
State: closed - Opened by snarayan21 about 2 months ago
#1194 - Log exception on inactivity callback
Pull Request -
State: closed - Opened by jjanezhang about 2 months ago
#1193 - fix eval
Pull Request -
State: closed - Opened by milocress about 2 months ago
#1192 - Add te
Pull Request -
State: closed - Opened by j316chuck about 2 months ago
#1191 - test te once more
Pull Request -
State: closed - Opened by j316chuck about 2 months ago
#1190 - Remove to_container
Pull Request -
State: closed - Opened by dakinggg about 2 months ago
#1187 - Set ft dataloader name explicitly
Pull Request -
State: closed - Opened by milocress about 2 months ago
#1184 - add callback
Pull Request -
State: closed - Opened by dakinggg about 2 months ago
#1183 - Train with attention mask
Issue -
State: open - Opened by germanjke about 2 months ago
- 1 comment
#1182 - Refactoring attention
Pull Request -
State: open - Opened by ShashankMosaicML about 2 months ago
#1181 - Bump version v0.9.0.dev0
Pull Request -
State: closed - Opened by milocress about 2 months ago
- 1 comment
#1180 - Possibility of training with hostname instead IP
Issue -
State: open - Opened by germanjke about 2 months ago
- 1 comment
#1179 - log eval dataset misconfiguration
Pull Request -
State: closed - Opened by milocress about 2 months ago
- 1 comment
#1178 - error on misconfigured icl
Pull Request -
State: closed - Opened by milocress about 2 months ago
#1177 - Fix config access for DBRX
Pull Request -
State: closed - Opened by dakinggg about 2 months ago
#1176 - Add state space attention
Pull Request -
State: closed - Opened by mvpatel2000 about 2 months ago
#1175 - Add foundry te torch 2 1
Pull Request -
State: closed - Opened by j316chuck about 2 months ago
#1174 - Add State Space Models / Mamba Layer Support
Issue -
State: closed - Opened by devin-ai-integration[bot] about 2 months ago
#1173 - [TE][Install] Test te install one more time
Pull Request -
State: closed - Opened by j316chuck about 2 months ago
#1172 - Speedup add foundry te docker no dep
Pull Request -
State: open - Opened by j316chuck about 2 months ago
#1171 - Try add te image again
Pull Request -
State: closed - Opened by j316chuck about 2 months ago
#1170 - minor fix to `llmfoundry.data.utils.get_text_collator`
Pull Request -
State: closed - Opened by ShashankMosaicML about 2 months ago
#1169 - Fix import and mocking
Pull Request -
State: closed - Opened by dakinggg about 2 months ago
#1168 - Add foundry te no deps
Pull Request -
State: open - Opened by j316chuck about 2 months ago
#1167 - Add TE Docker image
Pull Request -
State: closed - Opened by j316chuck about 2 months ago
#1166 - Migrate eval output logging to foundry
Pull Request -
State: closed - Opened by maxisawesome about 2 months ago
#1165 - refactoring dataloader into registries.
Pull Request -
State: closed - Opened by ShashankMosaicML about 2 months ago
#1164 - fix dep group in torch 2.3 ci
Pull Request -
State: closed - Opened by dakinggg about 2 months ago
#1163 - Depend on coverage
Pull Request -
State: closed - Opened by milocress about 2 months ago
#1162 - Uncomment GPU tests
Pull Request -
State: closed - Opened by milocress 2 months ago
#1161 - Add line splitting and other linting
Pull Request -
State: closed - Opened by b-chu 2 months ago
- 1 comment
#1160 - Bump composer version to 0.22.0
Pull Request -
State: closed - Opened by snarayan21 2 months ago
#1159 - retry loop on chunked encoding error
Pull Request -
State: closed - Opened by milocress 2 months ago
- 1 comment
#1158 - Bump composer version
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1157 - Move sentencepiece import
Pull Request -
State: closed - Opened by aspfohl 2 months ago
#1156 - Fix yaml lint
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1155 - Comment out 2.3 tests
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1154 - Only create remote_ud on global rank 0 in HF checkpointer
Pull Request -
State: closed - Opened by irenedea 2 months ago
- 1 comment
#1153 - Observing 1/2 the throughput on AMD MI250
Issue -
State: closed - Opened by staghado 2 months ago
- 4 comments
Labels: bug
#1152 - Bump min torch version to 2.3.0
Pull Request -
State: closed - Opened by dakinggg 2 months ago
- 1 comment
#1151 - Torch 2.3 upgrade Part 2 - CI
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1150 - Bump flash attention version
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1149 - Torch 2.3 part 1 - build the images
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1148 - Remove olmo as a dependency
Pull Request -
State: closed - Opened by snarayan21 2 months ago
#1147 - build inner model
Pull Request -
State: closed - Opened by milocress 2 months ago
#1146 - Fix typos in callbacks with configs
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1145 - Upgrade ci-testing
Pull Request -
State: closed - Opened by mvpatel2000 2 months ago
#1144 - fix DatasetConstants.splints default value to protect dictionary overwriting
Pull Request -
State: closed - Opened by ivan-kud 2 months ago
#1143 - Add new FT instructions
Pull Request -
State: closed - Opened by b-chu 2 months ago
#1142 - HF path to DBFS
Pull Request -
State: closed - Opened by KuuCi 2 months ago
- 1 comment
#1141 - Opt-3b Pretrain YAML config failing with mosaicml/llm-foundry/2.2.1_cu121_flash2-4aef5de docker
Issue -
State: closed - Opened by bhavnicksm 2 months ago
- 1 comment
Labels: bug
#1140 - Barrier immediately after initialize dist with logs
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1139 - Revert "First initialize dist with gloo (#1133)"
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1138 - Bump datasets version
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1137 - Test hang
Pull Request -
State: open - Opened by irenedea 2 months ago
- 1 comment
#1136 - Add a timeout for dataset filtering
Pull Request -
State: closed - Opened by irenedea 2 months ago
- 1 comment
#1135 - Set default start method to spawn for multiprocessing
Pull Request -
State: closed - Opened by irenedea 2 months ago
- 1 comment
#1134 - Fix saving of generation_config for Llama-3
Pull Request -
State: closed - Opened by eldarkurtic 2 months ago
#1133 - First initialize dist with gloo
Pull Request -
State: closed - Opened by dakinggg 2 months ago
- 1 comment
#1132 - Use multiprocessing to add a timeout to dataset filtering
Pull Request -
State: closed - Opened by irenedea 2 months ago
#1131 - Strict key checking for dataset
Pull Request -
State: closed - Opened by b-chu 2 months ago
#1130 - Add llama3 instructions to llama2 yaml
Pull Request -
State: closed - Opened by b-chu 2 months ago
#1129 - Fix deprecation versions
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1128 - Clean up the publicly exported API
Pull Request -
State: closed - Opened by dakinggg 2 months ago
- 4 comments
#1127 - Permit 2.3 nightlies
Pull Request -
State: closed - Opened by dakinggg 2 months ago
- 1 comment
#1126 - Change main to a dev version
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1125 - Fix HF checkpointer + mlflow bugs
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1124 - Pin mlflow
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1123 - catch misconfigured hf dataset
Pull Request -
State: closed - Opened by milocress 2 months ago
#1122 - Bump Composer to 0.21.3
Pull Request -
State: closed - Opened by b-chu 2 months ago
#1121 - Add option for subclasses to convert model and tokenizer in hf checkpointer
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1120 - Bump transformers version
Pull Request -
State: closed - Opened by b-chu 2 months ago
#1119 - Mlflow datasets
Pull Request -
State: closed - Opened by KuuCi 2 months ago
#1118 - Bump transformers to 4.40
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1117 - Update tests to not rely on mistral
Pull Request -
State: closed - Opened by dakinggg 2 months ago
#1116 - Is there a way to figure out what dependencies are installed in the docker image?
Issue -
State: closed - Opened by sc-gr 3 months ago
- 1 comment
Labels: question
#1115 - Add FreebaseQA to tasks and gauntlet
Pull Request -
State: open - Opened by moeiniamir 3 months ago
- 2 comments
#1114 - add `.json` to SUPPORTED_EXTENSIONS
Pull Request -
State: closed - Opened by eitanturok 3 months ago
- 3 comments
#1113 - Add missing init file
Pull Request -
State: closed - Opened by dakinggg 3 months ago
#1112 - rm new_group todo
Pull Request -
State: closed - Opened by vchiley 3 months ago
#1111 - Revert "Update config_moe_args.py"
Pull Request -
State: closed - Opened by vchiley 3 months ago
#1110 - Update JSONL sources in eval README
Pull Request -
State: closed - Opened by emmanuel-ferdman 3 months ago
#1109 - new python file to upload mosaic-bert to hf
Pull Request -
State: open - Opened by Patchwork53 3 months ago
#1108 - Dbrx finetune yaml requires save folder specified to enable autoresume
Pull Request -
State: closed - Opened by mvpatel2000 3 months ago
#1107 - Fix overwriting FP8 act ckpt flag in the train script
Pull Request -
State: closed - Opened by cli99 3 months ago
#1106 - Add remote code option to allow execution of DBRX tokenizer
Pull Request -
State: closed - Opened by b-chu 3 months ago
#1105 - Fine-tune dbrx-instruct on a single VM with 8 H100s
Issue -
State: open - Opened by classicboyir 3 months ago
- 1 comment
Labels: question
#1104 - Update config_moe_args.py
Pull Request -
State: closed - Opened by vchiley 3 months ago
#1103 - Updating the streaming version in setup.py
Pull Request -
State: closed - Opened by ShashankMosaicML 3 months ago
- 3 comments
#1102 - MegaBlocks release
Pull Request -
State: closed - Opened by mvpatel2000 3 months ago
#1101 - Remove torch compile from GLU
Pull Request -
State: closed - Opened by josejg 3 months ago
#1100 - fixing evaluator microbatch size
Pull Request -
State: closed - Opened by ShashankMosaicML 3 months ago
#1099 - Add sam
Pull Request -
State: closed - Opened by Joqsan 3 months ago
#1098 - Support ShareGPT chat format
Pull Request -
State: closed - Opened by samhavens 3 months ago
- 2 comments
#1097 - Update yamls for 0.7.0
Pull Request -
State: closed - Opened by dakinggg 3 months ago
#1096 - Param init registry
Pull Request -
State: closed - Opened by dakinggg 3 months ago
- 3 comments
#1095 - FFN layer registry
Pull Request -
State: closed - Opened by dakinggg 3 months ago
- 4 comments
#1094 - Attention layer registry
Pull Request -
State: closed - Opened by dakinggg 3 months ago
- 2 comments
#1093 - FC layer registry
Pull Request -
State: closed - Opened by dakinggg 3 months ago
- 2 comments