Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / mosaicml/llm-foundry issues and pull requests
#1296 - Fix grad accum typing
Pull Request -
State: closed - Opened by dakinggg 3 months ago
#1295 - [Do Not Merge] Test patch
Pull Request -
State: closed - Opened by mvpatel2000 3 months ago
#1294 - Bump min composer version to 0.23.3
Pull Request -
State: closed - Opened by dakinggg 3 months ago
#1293 - Small refactor for update batch size
Pull Request -
State: closed - Opened by dakinggg 3 months ago
#1292 - Removing logging exception through update run metadata
Pull Request -
State: closed - Opened by jjanezhang 3 months ago
- 1 comment
#1291 - Unable to use self developed pre-trained model for finetuning in MosaicML
Issue -
State: open - Opened by sauravgrd 3 months ago
- 1 comment
Labels: question
#1290 - Extendability refactors
Pull Request -
State: closed - Opened by dakinggg 3 months ago
- 1 comment
#1289 - MPT training with ALiBi and Flash Attention 2
Issue -
State: open - Opened by rickgit16 3 months ago
- 3 comments
Labels: question
#1288 - Add TE to setup
Pull Request -
State: closed - Opened by j316chuck 3 months ago
#1287 - Add missing dependency group
Pull Request -
State: closed - Opened by dakinggg 4 months ago
- 1 comment
#1286 - Fix backwards compatibility for ICL arg
Pull Request -
State: closed - Opened by dakinggg 4 months ago
#1285 - Bump mlflow to 2.13.2
Pull Request -
State: closed - Opened by KuuCi 4 months ago
- 1 comment
#1283 - Try to fix eval_output_logging_callback.py
Pull Request -
State: open - Opened by sjawhar 4 months ago
- 6 comments
#1280 - Fix TE HF checkpoint saving
Pull Request -
State: closed - Opened by j316chuck 4 months ago
- 1 comment
#1277 - Fix packing + streaming + resumption
Pull Request -
State: closed - Opened by dakinggg 4 months ago
#1276 - Allow multiprocessing when preparing ICL dataset
Issue -
State: open - Opened by sanjari-orb 4 months ago
- 8 comments
Labels: enhancement
#1274 - Update Dockerfile
Pull Request -
State: open - Opened by j316chuck 4 months ago
#1273 - Update Dockerfile with TE main
Pull Request -
State: closed - Opened by j316chuck 4 months ago
- 2 comments
#1272 - Managing Timeout on Training Errors and Simultaneous Restart of All Nodes in LLM Foundry
Issue -
State: closed - Opened by germanjke 4 months ago
- 1 comment
#1271 - Why is there a warmup in hf_generate.py?
Issue -
State: open - Opened by palash04 4 months ago
Labels: question
#1270 - fix linting
Pull Request -
State: closed - Opened by milocress 4 months ago
#1269 - Bump Composer to version 0.23.2
Pull Request -
State: closed - Opened by dakinggg 4 months ago
#1268 - Revert "Bump Composer to 0.23.0 (#1259)"
Pull Request -
State: closed - Opened by dakinggg 4 months ago
#1267 - Revert to older TE version
Pull Request -
State: closed - Opened by mvpatel2000 4 months ago
- 1 comment
#1266 - Revert "Update TE Dockerfile (#1265)"
Pull Request -
State: closed - Opened by j316chuck 4 months ago
#1265 - Update TE Dockerfile
Pull Request -
State: closed - Opened by j316chuck 4 months ago
#1264 - Fill in the middle
Issue -
State: closed - Opened by germanjke 4 months ago
- 2 comments
Labels: enhancement
#1263 - Fix typo in setup.py
Pull Request -
State: closed - Opened by XiaohanZhangCMU 4 months ago
#1262 - Testing CI
Pull Request -
State: closed - Opened by dakinggg 4 months ago
#1261 - How to continue pretrain LLM fp8 with hf_causal_lm
Issue -
State: closed - Opened by YixinSong-e 4 months ago
- 2 comments
Labels: bug
#1260 - added systemMetricsMonitor callback
Pull Request -
State: closed - Opened by JackZ-db 4 months ago
- 1 comment
#1259 - Bump Composer to 0.23.0
Pull Request -
State: closed - Opened by KuuCi 4 months ago
- 1 comment
#1258 - Remove spurious warning
Pull Request -
State: closed - Opened by dakinggg 4 months ago
#1257 - Fix MPT HF conversion
Pull Request -
State: closed - Opened by dakinggg 4 months ago
#1256 - Add curriculum learning callback
Pull Request -
State: closed - Opened by b-chu 4 months ago
- 3 comments
#1255 - Bump Version to 0.10.0.dev0
Pull Request -
State: closed - Opened by KuuCi 4 months ago
- 2 comments
#1254 - Adding more token encoding types
Pull Request -
State: closed - Opened by snarayan21 4 months ago
- 1 comment
#1253 - fix signal_file_path to avoid race condition
Pull Request -
State: closed - Opened by ofivite 4 months ago
- 13 comments
#1252 - Add registry for ICL datasets
Pull Request -
State: open - Opened by sanjari-orb 4 months ago
- 5 comments
#1251 - Change TE docker image to enable te_shard_weight
Pull Request -
State: closed - Opened by j316chuck 4 months ago
#1250 - Replacing icl_task_type question_answering with generation_task_with_answers in long context eval yamls.
Pull Request -
State: closed - Opened by ShashankMosaicML 4 months ago
#1249 - Testing CI
Pull Request -
State: closed - Opened by dakinggg 4 months ago
#1248 - Update CODEOWNERS
Pull Request -
State: closed - Opened by dakinggg 4 months ago
#1247 - Add eval_drop_last flag to fix TE eval bug
Pull Request -
State: open - Opened by j316chuck 4 months ago
- 2 comments
#1246 - Wrap `FileNotFound` exceptions in the finetuning dataloader and `convert_text_to_mds`
Pull Request -
State: open - Opened by angel-ruiz7 4 months ago
- 1 comment
#1245 - [MCLOUD-4623] Add more detailed exception when user has uppercase in their example case but could potentially match the exampe type
Pull Request -
State: closed - Opened by shitaoli-db 4 months ago
- 2 comments
#1244 - Fix the error message thrown from dataloader
Pull Request -
State: open - Opened by shitaoli-db 4 months ago
#1243 - Add logging to convert_text_to_mds.py script
Pull Request -
State: closed - Opened by irenedea 4 months ago
#1242 - could you give an elaborated steps about how to run llm-foundry on AMD mi250 devices
Issue -
State: open - Opened by Alice1069 4 months ago
- 1 comment
Labels: bug
#1241 - Make HF conversion automatically add missing imports
Pull Request -
State: closed - Opened by dakinggg 4 months ago
#1240 - Chunk file reads and tokenization for text to mds conversion
Pull Request -
State: closed - Opened by irenedea 4 months ago
- 1 comment
#1239 - Make the exceptions serializable
Pull Request -
State: closed - Opened by dakinggg 4 months ago
#1238 - Add retries to downloads in convert_text_to_mds.py
Pull Request -
State: closed - Opened by irenedea 4 months ago
- 1 comment
#1237 - [WIP] use_remote_uploader_v2
Pull Request -
State: open - Opened by bigning 4 months ago
#1236 - Configurable submesh
Pull Request -
State: closed - Opened by dakinggg 4 months ago
#1235 - Fix tuple typing
Pull Request -
State: closed - Opened by dakinggg 4 months ago
#1234 - Move MLFlow dataset outside of log_config
Pull Request -
State: closed - Opened by KuuCi 4 months ago
- 1 comment
#1233 - Fix Mosaic Logger custom exception serialization
Pull Request -
State: closed - Opened by milocress 4 months ago
#1232 - Fixing the state.timestamp.batch.value issue in loss v len callback
Pull Request -
State: closed - Opened by ShashankMosaicML 4 months ago
#1231 - LLaMA PRO training resume problem
Issue -
State: open - Opened by germanjke 4 months ago
- 5 comments
#1230 - Fix attr error for attention_classes when using act ckpt
Pull Request -
State: closed - Opened by cli99 4 months ago
#1229 - Modularize backbone class and block creation
Pull Request -
State: closed - Opened by dakinggg 4 months ago
#1228 - Quick patch to check that Dataset Keys contain non-None Values
Pull Request -
State: closed - Opened by KuuCi 4 months ago
- 1 comment
#1227 - Make config/class properties on ComposerMPTForCausalLM
Pull Request -
State: closed - Opened by dakinggg 4 months ago
#1226 - Loss v len callback
Pull Request -
State: closed - Opened by ShashankMosaicML 4 months ago
#1225 - Add user error superclass
Pull Request -
State: closed - Opened by milocress 4 months ago
- 2 comments
#1224 - Modularize components of megablocks layer builder
Pull Request -
State: closed - Opened by dakinggg 4 months ago
- 3 comments
#1223 - Decompression tokens
Pull Request -
State: closed - Opened by milocress 4 months ago
#1222 - add error when chat template fails
Pull Request -
State: closed - Opened by milocress 4 months ago
#1221 - Finetuning does not work on nightly
Issue -
State: closed - Opened by eldarkurtic 5 months ago
- 2 comments
Labels: bug
#1220 - Conversion Sharded -> Monolithic checkpoint
Issue -
State: open - Opened by pretidav 5 months ago
- 1 comment
Labels: question
#1219 - Update readme to clarify flash-attn and TE installs
Pull Request -
State: closed - Opened by snarayan21 5 months ago
#1218 - Add example eval scripts for dbrx PT sizes
Pull Request -
State: closed - Opened by aspfohl 5 months ago
- 2 comments
#1217 - Add te for torch 2.4.0
Pull Request -
State: closed - Opened by j316chuck 5 months ago
#1216 - Fix dmoe tests GPU OOM
Pull Request -
State: closed - Opened by snarayan21 5 months ago
#1215 - Update Dockerfile
Pull Request -
State: closed - Opened by j316chuck 5 months ago
#1214 - Dbfs HF
Pull Request -
State: open - Opened by KuuCi 5 months ago
- 3 comments
#1213 - Removed debugging code in tests
Pull Request -
State: closed - Opened by dakinggg 5 months ago
#1212 - Adding HF source Path in for DBFS
Pull Request -
State: closed - Opened by KuuCi 5 months ago
#1211 - Using self.shift_labels instead of self.model.transformer.shift_label in the loss function.
Pull Request -
State: closed - Opened by ShashankMosaicML 5 months ago
#1210 - Added torch_dmoe defaults, bug fixes for 2D inputs
Pull Request -
State: closed - Opened by snarayan21 5 months ago
- 1 comment
#1209 - Add fc to HF export
Pull Request -
State: closed - Opened by dakinggg 5 months ago
#1208 - Update setup.py
Pull Request -
State: closed - Opened by j316chuck 5 months ago
#1207 - Chuck/gpu build te win
Pull Request -
State: closed - Opened by j316chuck 5 months ago
- 1 comment
#1206 - Build Te
Pull Request -
State: closed - Opened by j316chuck 5 months ago
#1205 - Mvpatel2000/te image stable
Pull Request -
State: closed - Opened by mvpatel2000 5 months ago
#1204 - TransformerEngine Image Build
Pull Request -
State: closed - Opened by mvpatel2000 5 months ago
#1203 - [don't merge lol] test pyhook
Pull Request -
State: closed - Opened by milocress 5 months ago
#1202 - Clearer error message for unknown example type
Pull Request -
State: closed - Opened by milocress 5 months ago
#1201 - Make `fc_type` a dict to pass fc kwargs through
Pull Request -
State: closed - Opened by snarayan21 5 months ago
- 1 comment
#1200 - commit change
Pull Request -
State: closed - Opened by j316chuck 5 months ago
#1199 - Allow EOS token for finetuning
Pull Request -
State: closed - Opened by jimwu6 5 months ago
- 3 comments
#1198 - Removing rich install
Pull Request -
State: closed - Opened by jjanezhang 5 months ago
#1197 - MoE with FSDP
Issue -
State: closed - Opened by Muennighoff 5 months ago
- 1 comment
Labels: question
#1196 - Pass FC type along for all FFN types
Pull Request -
State: closed - Opened by dakinggg 5 months ago
#1195 - Streaming version bump to 0.7.6
Pull Request -
State: closed - Opened by snarayan21 5 months ago
#1194 - Log exception on inactivity callback
Pull Request -
State: closed - Opened by jjanezhang 5 months ago
#1193 - fix eval
Pull Request -
State: closed - Opened by milocress 5 months ago
#1192 - Add te
Pull Request -
State: closed - Opened by j316chuck 5 months ago
#1191 - test te once more
Pull Request -
State: closed - Opened by j316chuck 5 months ago