pytorch/torchtune issues and pull requests

#931 - Llama-3 Inference and Uploading to Huggingface

Issue - State: closed - Opened by fabriceyhc 10 months ago - 19 comments

#930 - lm harness distributed evaluation?

Issue - State: open - Opened by monk1337 10 months ago - 3 comments
Labels: enhancement

#929 - Add recipe test for llama3

Pull Request - State: closed - Opened by SLR722 10 months ago - 2 comments
Labels: CLA Signed

#928 - [WIP] Free generation for evals

Pull Request - State: closed - Opened by joecummings 10 months ago - 1 comment
Labels: CLA Signed

#925 - add llama-70B memory and perf numbers to the README table

Issue - State: closed - Opened by soumith 10 months ago - 3 comments

#922 - How I can find all the checkpoints and merge it manually? (Lora)

Issue - State: closed - Opened by monk1337 10 months ago - 4 comments
Labels: question

#921 - Request support for Eleuther generative task

Issue - State: closed - Opened by ScottHoang 10 months ago - 3 comments

#917 - Exception: Error converting the state dict. ; KeyError: 'tok_embeddings.weight'.

Issue - State: closed - Opened by adityaarun1 10 months ago - 3 comments

#916 - Unify example dataset in configs

Issue - State: closed - Opened by joecummings 10 months ago - 4 comments

#915 - no error when lacking hugging face permissions

Issue - State: closed - Opened by dangbert 10 months ago - 4 comments

#914 - Document model tokenizers

Issue - State: closed - Opened by joecummings 10 months ago
Labels: documentation

#912 - Finish additions needed for Phi-3 Mini 4K

Pull Request - State: open - Opened by joecummings 10 months ago - 1 comment
Labels: CLA Signed

#911 - Loading mistral reward model checkpoints

Pull Request - State: closed - Opened by SalmanMohammadi 10 months ago - 1 comment
Labels: CLA Signed

#910 - New integration - CometLogger

Pull Request - State: closed - Opened by Lothiraldan 10 months ago - 9 comments
Labels: CLA Signed

#909 - enable QLoRA + FSDP2

Pull Request - State: closed - Opened by weifengpy 10 months ago - 3 comments
Labels: CLA Signed

#908 - Remove reference to 2.2.2 as latest stable in README

Pull Request - State: closed - Opened by rohan-varma 10 months ago - 1 comment
Labels: CLA Signed

#907 - Remove usage of LRU cache from peft_utils

Pull Request - State: closed - Opened by ebsmothers 10 months ago - 1 comment
Labels: CLA Signed

#906 - document integration with bitsandbytes?

Issue - State: open - Opened by Titus-von-Koeller 10 months ago - 3 comments
Labels: documentation

#905 - Test directory structure for models doesn't match corresponding implementation files

Issue - State: closed - Opened by ebsmothers 10 months ago - 1 comment
Labels: best practice

#904 - `get_adapter_params` does not free GPU memory

Issue - State: closed - Opened by Optimox 10 months ago - 2 comments

#903 - total_training_steps -> global_step

Pull Request - State: closed - Opened by tcapelle 10 months ago - 1 comment
Labels: CLA Signed

#902 - Remove non-existant objects from `all`

Pull Request - State: closed - Opened by vmoens 10 months ago - 1 comment
Labels: CLA Signed

#901 - [Help needed] Impact of padding on causal attention ?

Issue - State: open - Opened by Optimox 10 months ago - 7 comments
Labels: discussion

#900 - Remove recipe_state from eleuther config

Pull Request - State: closed - Opened by rohan-varma 10 months ago - 1 comment
Labels: CLA Signed

#899 - Simplify eleuther eval config

Issue - State: closed - Opened by rohan-varma 10 months ago - 1 comment

#898 - Remove unnecessary system role's index check for llama3

Pull Request - State: open - Opened by musab-mk 10 months ago - 6 comments
Labels: CLA Signed

#897 - Compute grad norm

Pull Request - State: open - Opened by tcapelle 10 months ago - 10 comments
Labels: CLA Signed

#896 - Add documentation information

Pull Request - State: closed - Opened by joecummings 10 months ago - 1 comment
Labels: CLA Signed

#895 - [Feature addition] Clearml logger integration

Pull Request - State: open - Opened by Prakyathkantharaju 10 months ago - 9 comments
Labels: CLA Signed

#894 - Feature Request : ORPO

Issue - State: closed - Opened by nivibilla 10 months ago - 4 comments
Labels: documentation

#893 - [FR] (Q)DoRA

Issue - State: closed - Opened by DreamGenX 10 months ago - 8 comments
Labels: enhancement, rfc

#892 - [FR] Sample Packing with correct attention mask

Issue - State: closed - Opened by DreamGenX 10 months ago - 3 comments

#891 - Runtime Error: BF16 unsupported on supported hardware

Issue - State: closed - Opened by slobodaapl 10 months ago - 8 comments

#890 - Support `conversation_style` of `openai` format (OpenAI API style)

Pull Request - State: closed - Opened by xingyaoww 10 months ago - 4 comments
Labels: CLA Signed

#889 - Feat: Add support of multiple datasets in config

Pull Request - State: open - Opened by EvilFreelancer 10 months ago - 14 comments
Labels: CLA Signed

#888 - Mistral testing

Pull Request - State: closed - Opened by SalmanMohammadi 10 months ago - 6 comments
Labels: CLA Signed

#887 - Compile workflows seem broken on 2.3

Issue - State: closed - Opened by rohan-varma 10 months ago - 1 comment

#886 - Remove unused imports in models/init.py

Pull Request - State: closed - Opened by ebsmothers 10 months ago - 1 comment
Labels: CLA Signed

#885 - Enable profiler only on rank 0

Issue - State: closed - Opened by rohan-varma 10 months ago - 1 comment
Labels: enhancement, help wanted

#884 - Add support for 8da4w quantization

Pull Request - State: closed - Opened by andrewor14 10 months ago - 3 comments
Labels: CLA Signed

#883 - Validation and early stopping during training

Issue - State: open - Opened by kinggongzilla 10 months ago - 7 comments
Labels: enhancement, high-priority, community help wanted

#882 - Decouple ModelType from checkpointer

Issue - State: closed - Opened by rohan-varma 10 months ago

#881 - refacto: expose output layer for gemma models

Pull Request - State: closed - Opened by Optimox 10 months ago - 9 comments
Labels: CLA Signed

#880 - Support for MS Phi-3, please.

Issue - State: closed - Opened by razvanab 10 months ago - 1 comment
Labels: enhancement

#879 - Got error when download llama3 via the 'tune download'

Issue - State: closed - Opened by mazzzystar 10 months ago - 1 comment

#878 - How to load ckpt files generated by`torchtune.utils.FullModelHFCheckpointer` into hf models

Issue - State: open - Opened by BMPixel 10 months ago - 2 comments

#877 - Understanding contents of the final checkpoint file

Issue - State: closed - Opened by man-shar 10 months ago - 2 comments

#876 - Add Phi3 Mini 4K Instruct Model to torchtune

Pull Request - State: closed - Opened by kartikayk 10 months ago - 1 comment
Labels: CLA Signed

#875 - Sample packing for map datasets with correct RoPE encoding and no cross-contamination

Pull Request - State: closed - Opened by RdoubleA 10 months ago - 18 comments
Labels: CLA Signed

#874 - Feature/raft fine tuning

Pull Request - State: open - Opened by efenocchi 10 months ago - 4 comments
Labels: CLA Signed

#873 - Correctly pass TORCHTUNE_VERSION_DOCS during the build

Pull Request - State: closed - Opened by svekars 10 months ago - 2 comments
Labels: CLA Signed

#872 - Update Llama capitalization in docs

Pull Request - State: closed - Opened by joecummings 10 months ago - 2 comments
Labels: CLA Signed

#871 - Support stopping on more than just eos during generation

Pull Request - State: closed - Opened by ebsmothers 10 months ago - 1 comment
Labels: CLA Signed

#870 - Can't seem to get latest version via pip

Issue - State: closed - Opened by man-shar 10 months ago - 2 comments

#869 - Understanding QLora memory consumption for inference

Issue - State: closed - Opened by Optimox 10 months ago - 6 comments

#868 - Support for unstructured text corpus datasets for CPT

Pull Request - State: closed - Opened by RdoubleA 10 months ago - 5 comments
Labels: CLA Signed

#867 - Add missing Gemma recipes to registry, update default log_every_n_steps

Pull Request - State: closed - Opened by ebsmothers 10 months ago - 1 comment
Labels: CLA Signed

#866 - Revise the implementation of set_activation_checkpointing

Pull Request - State: closed - Opened by rohan-varma 10 months ago - 2 comments
Labels: CLA Signed

#865 - FSDP Llama3 wrapping improvements for full finetune

Pull Request - State: closed - Opened by rohan-varma 10 months ago - 9 comments
Labels: CLA Signed

#864 - File issues with 70B Lora setup

Issue - State: closed - Opened by BedirT 10 months ago - 4 comments

#863 - Is there a plan for supporting full fine-tuning 70B model?

Issue - State: closed - Opened by dmammfl 10 months ago - 6 comments
Labels: question

#862 - Update README to include 70B download command

Pull Request - State: closed - Opened by rohan-varma 10 months ago - 2 comments
Labels: CLA Signed

#861 - Validate all paths before doing any expensive work

Issue - State: open - Opened by rohan-varma 10 months ago
Labels: good first issue, community help wanted, better engineering

#860 - LLama 3 : can't load the hugging face state dict

Issue - State: closed - Opened by Optimox 10 months ago - 2 comments

#859 - token

Pull Request - State: closed - Opened by stefanrattay1 10 months ago - 3 comments

#858 - Can not download llama3 via the 'tune download'

Issue - State: closed - Opened by MaxwelsDonc 10 months ago - 1 comment

#857 - Documentation: Clarify all llama3 recipes

Pull Request - State: closed - Opened by musabgultekin 10 months ago - 5 comments
Labels: CLA Signed

#856 - Context Length Increse Results in OOM

Issue - State: closed - Opened by BedirT 10 months ago - 3 comments
Labels: documentation

#855 - enable LoRA + FSDP2

Pull Request - State: closed - Opened by weifengpy 10 months ago - 4 comments
Labels: CLA Signed

#854 - Cant load llama3 8B into memory

Issue - State: closed - Opened by abpani 10 months ago - 4 comments

#853 - Add eval results to QLoRA tutorial

Issue - State: closed - Opened by rohan-varma 10 months ago

#852 - update readme.md - add command to download llama3

Pull Request - State: closed - Opened by lessw2020 10 months ago - 1 comment
Labels: CLA Signed

#851 - Default to llama3-8b-instruct

Pull Request - State: closed - Opened by ebsmothers 10 months ago - 6 comments
Labels: CLA Signed

#850 - `tune run` crashes w/ ` `NotImplementedError`

Issue - State: closed - Opened by optimass 10 months ago - 4 comments

#849 - `tune download` doesn't download the weights

Issue - State: closed - Opened by optimass 10 months ago - 3 comments

#848 - Testing for Mistral models

Issue - State: open - Opened by SalmanMohammadi 10 months ago

#847 - adding model builders for code-llama2 7b, 13b, and 70b

Pull Request - State: closed - Opened by SalmanMohammadi 10 months ago - 13 comments
Labels: CLA Signed

#846 - Can not import torchtune

Issue - State: closed - Opened by abpani 10 months ago - 4 comments

#845 - Can i fine tune "dolphin-2.2.1-mistral-7b.Q2_K.gguf" with torchtune ? using cpu ?

Issue - State: closed - Opened by walidbet18 10 months ago - 14 comments
Labels: documentation, question

#844 - Multi-GPU QLoRA?

Issue - State: closed - Opened by cuichenx 10 months ago - 7 comments

#843 - [FSDP1] reduce GPU memory usage from 78G instead of 23G

Pull Request - State: closed - Opened by weifengpy 10 months ago - 3 comments
Labels: CLA Signed

#842 - Update PR template

Pull Request - State: closed - Opened by ebsmothers 10 months ago - 1 comment
Labels: CLA Signed

#841 - Better dataset and data documentation

Pull Request - State: closed - Opened by joecummings 10 months ago - 2 comments
Labels: CLA Signed

#840 - Transformer classifier

Pull Request - State: closed - Opened by SalmanMohammadi 10 months ago - 12 comments
Labels: CLA Signed

#839 - vRAM usage display for 70B models

Issue - State: closed - Opened by BedirT 10 months ago - 4 comments
Labels: documentation

#833 - Update nightly build instructions

Pull Request - State: closed - Opened by joecummings 10 months ago - 1 comment
Labels: CLA Signed

#832 - How to save a trained model so it can be loaded with HF `from_pretrained()`?

Issue - State: closed - Opened by calmitchell617 10 months ago - 30 comments
Labels: enhancement

#831 - Metric logger improvements

Pull Request - State: closed - Opened by ebsmothers 10 months ago - 4 comments
Labels: CLA Signed

#830 - VRAM Usage / Training Time in comparison to Huggingface

Issue - State: closed - Opened by bdytx5 10 months ago - 5 comments

#829 - Make Linter requirements for PR more prominent

Issue - State: closed - Opened by kartikayk 10 months ago - 1 comment

#828 - Running LoRA + QLoRA on Colab Notebooks

Issue - State: open - Opened by kartikayk 10 months ago - 1 comment
Labels: enhancement

#827 - feat: added packed continued pretraining dataset functionality

Pull Request - State: open - Opened by calmitchell617 10 months ago - 5 comments
Labels: CLA Signed

#826 - Finetuning LLama 2 Code Instruct without using Hugging Face

Issue - State: closed - Opened by sgupta1007 10 months ago - 7 comments

#825 - e2e torchtune tutorial example

Issue - State: open - Opened by seyeint 10 months ago - 7 comments
Labels: enhancement

#824 - Llama3 ChatFormat?

Issue - State: closed - Opened by Broyojo 10 months ago - 14 comments

#823 - Chat data + prompt template tutorial

Pull Request - State: closed - Opened by RdoubleA 10 months ago - 1 comment
Labels: CLA Signed

#822 - Delete unused APIs

Pull Request - State: closed - Opened by rohan-varma 10 months ago - 5 comments
Labels: CLA Signed

#812 - [RFC] Proximal Policy Optimisation

Issue - State: closed - Opened by SalmanMohammadi 10 months ago - 9 comments
Labels: rfc

#811 - [Feature Request] Support multimodal LLM, e.g., llava

Issue - State: closed - Opened by StarCycle 10 months ago - 2 comments
Labels: enhancement

#809 - Seeking guidance on continuing pretraining

Issue - State: closed - Opened by calmitchell617 10 months ago - 5 comments

GitHub / pytorch/torchtune issues and pull requests