Lightning-AI/litgpt issues and pull requests

#1717 - Cannot attend to 9904, block size is only 4096

Issue - State: closed - Opened by starjob42 2 months ago - 1 comment
Labels: question

#1716 - "RuntimeError: All the chunks should have been deleted." on non-Studio machine

Issue - State: open - Opened by rasbt 2 months ago - 1 comment
Labels: bug

#1715 - llm.generate issue on CPU machines

Issue - State: closed - Opened by rasbt 2 months ago - 3 comments
Labels: bug

#1714 - `llm.generate` function does not work on Mac (MPS) devices anymore

Issue - State: closed - Opened by rasbt 2 months ago
Labels: bug

#1713 - Llama 3.1 RoPE not implemented -- Finetuning will lead to spelling/grammar mistakes

Issue - State: closed - Opened by calvintwr 2 months ago - 3 comments
Labels: bug

#1712 - Data Loading bug in `pretrain` on resume over multiple epochs

Issue - State: open - Opened by fdalvi 2 months ago
Labels: bug

#1711 - Manual convert_to_litgpt for Phi-3.5-mini-instruct downloaded weights from HF

Issue - State: closed - Opened by wasifferoze 2 months ago - 1 comment
Labels: bug

#1710 - minor Readme update/typos

Pull Request - State: closed - Opened by Borda 2 months ago

#1709 - Qwen series

Issue - State: open - Opened by Godlikemandyy 2 months ago - 1 comment
Labels: question, model-weights

#1708 - Unable to train using LoRA however training with adapter seems fine

Issue - State: closed - Opened by salokr 2 months ago - 5 comments
Labels: bug

#1707 - Fix device Error in Decode Stream

Pull Request - State: closed - Opened by Motsepe-Jr 3 months ago

#1706 - zsh: illegal hardware instruction

Issue - State: closed - Opened by Suddhasatwa 3 months ago - 6 comments
Labels: bug

#1705 - Way to load quantized model on the fly through Python SDK

Issue - State: closed - Opened by sovit-123 3 months ago - 2 comments
Labels: question

#1704 - `accelerator` argument not working properly with `LitServer`

Issue - State: closed - Opened by sovit-123 3 months ago - 2 comments
Labels: bug

#1703 - Liger Kernel Integration

Issue - State: closed - Opened by ByronHsu 3 months ago - 5 comments
Labels: enhancement

#1702 - Add `batched_generate_fn()`

Pull Request - State: closed - Opened by apaz-cli 3 months ago

#1701 - bump thunder dependency to main

Pull Request - State: closed - Opened by t-vi 3 months ago - 1 comment

#1700 - add support for batched input_pos to model

Pull Request - State: closed - Opened by t-vi 3 months ago

#1699 - [BUG] LLaMA 3.1 RoPE

Issue - State: closed - Opened by zzhhjjj 3 months ago - 3 comments
Labels: bug, question

#1698 - Training not working with default script

Issue - State: closed - Opened by ByteBrigand 3 months ago - 17 comments
Labels: bug

#1697 - Ligpt for windows - triton

Issue - State: closed - Opened by seshganesh 3 months ago - 1 comment
Labels: bug

#1696 - Fix falcon prompt template

Pull Request - State: closed - Opened by rasbt 3 months ago

#1695 - Bumb version to 0.4.11

Pull Request - State: closed - Opened by rasbt 3 months ago

#1694 - Preserve eos in encoding when max_seq_length = -1

Pull Request - State: closed - Opened by sanderland 3 months ago - 3 comments

#1693 - Add `batched_next_token()` and `batched_sample()`

Pull Request - State: closed - Opened by apaz-cli 3 months ago - 8 comments

#1692 - Add `batched_next_token()` and `batched_sample()`

Pull Request - State: closed - Opened by apaz-cli 3 months ago - 1 comment

#1691 - Avoid error when executing benchmark util outside a git folder

Pull Request - State: closed - Opened by rasbt 3 months ago - 1 comment

#1690 - Make number of generated tokens consistent with CLI

Pull Request - State: closed - Opened by rasbt 3 months ago - 3 comments

#1689 - about litgpt command

Issue - State: closed - Opened by aekx 3 months ago - 3 comments
Labels: question

#1688 - Disable attention mask during new token generation

Pull Request - State: closed - Opened by Andrei-Aksionov 3 months ago

#1687 - Add Microsoft Phi 3.5 checkpoint

Pull Request - State: closed - Opened by rasbt 3 months ago

#1686 - Microsoft Phi 3.5 MoE

Issue - State: open - Opened by rasbt 3 months ago
Labels: enhancement, model-weights

#1685 - Spelling fix

Pull Request - State: closed - Opened by rasbt 3 months ago

#1684 - Update check_nvlink_connectivity

Pull Request - State: closed - Opened by sanderland 3 months ago - 3 comments

#1683 - adding a UI for the training and the finetuning

Issue - State: open - Opened by Esmail-ibraheem 3 months ago - 1 comment
Labels: enhancement

#1682 - Llama3 finetuning and generation: Double begin_of_text, no eot_id

Issue - State: open - Opened by sanderland 3 months ago - 9 comments
Labels: bug

#1681 - Added git hash to benchmark utility.

Pull Request - State: closed - Opened by apaz-cli 3 months ago - 1 comment

#1680 - Add PR benchmark util for internal use

Pull Request - State: closed - Opened by rasbt 3 months ago - 3 comments

#1679 - Improved benchmark utils

Pull Request - State: closed - Opened by rasbt 3 months ago

#1678 - Fix KV cache issue in LLM API

Pull Request - State: closed - Opened by rasbt 3 months ago

#1677 - Auto device handling in LLM API

Pull Request - State: closed - Opened by rasbt 3 months ago

#1676 - Add distribute=None to python-api.md

Pull Request - State: closed - Opened by rasbt 3 months ago

#1675 - Combine `generate()` functions

Pull Request - State: closed - Opened by apaz-cli 3 months ago - 10 comments

#1674 - Bumb version to 0.4.10 for next release

Pull Request - State: closed - Opened by rasbt 3 months ago

#1673 - Add Mistral Large 123B

Pull Request - State: closed - Opened by rasbt 3 months ago

#1672 - attention mask is incorrect when generate with softcapping

Issue - State: open - Opened by twaka 3 months ago - 8 comments
Labels: bug

#1671 - Disable KV cache option

Issue - State: open - Opened by rasbt 3 months ago
Labels: enhancement

#1670 - Multi-gpu serving

Pull Request - State: closed - Opened by rasbt 3 months ago

#1669 - Update azure-gpu-test.yml

Pull Request - State: closed - Opened by rasbt 3 months ago

#1668 - Support the refactored API in litgpt serve

Pull Request - State: closed - Opened by rasbt 3 months ago

#1667 - Make LitGPT LLM API compatible with PyTorch Lightning Trainer 1/2

Pull Request - State: closed - Opened by rasbt 3 months ago

#1666 - Swap old Llama model with Phi-3

Pull Request - State: closed - Opened by rasbt 3 months ago

#1665 - Gemma 2B weights seem to have changed

Issue - State: open - Opened by rasbt 3 months ago - 1 comment
Labels: bug

#1664 - Bumb version for 0.4.9 release

Pull Request - State: closed - Opened by rasbt 3 months ago

#1663 - Tensor parallelism generates non-sensical outputs

Issue - State: open - Opened by rasbt 3 months ago - 1 comment
Labels: bug

#1662 - Use FlexAttention

Issue - State: open - Opened by rasbt 3 months ago
Labels: enhancement, performance

#1661 - Support Tensor Parallel in Python API

Pull Request - State: closed - Opened by rasbt 3 months ago

#1660 - Optionally return benchmark info in Python API

Pull Request - State: closed - Opened by rasbt 3 months ago - 3 comments

#1659 - Issue a warning if a local folder with the same name as the litgpt package exists

Pull Request - State: closed - Opened by rasbt 3 months ago

#1658 - Fix some issues with circular and relative imports

Pull Request - State: closed - Opened by rasbt 3 months ago

#1657 - Refactor Python API to introduce new distribute method (part of a larger refactor for PTL support)

Pull Request - State: closed - Opened by rasbt 3 months ago

#1656 - Add a PyTorch Lightning example

Pull Request - State: closed - Opened by rasbt 3 months ago

#1655 - MPS test gave out gibberish result

Issue - State: closed - Opened by edifice1989 3 months ago - 3 comments
Labels: bug

#1654 - Update LitServe version and tests

Pull Request - State: closed - Opened by rasbt 4 months ago - 1 comment

#1653 - Version bumb for Gemma 2 2B release

Pull Request - State: closed - Opened by rasbt 4 months ago

#1652 - Pin litserve version

Pull Request - State: closed - Opened by rasbt 4 months ago

#1638 - Implement prompt caching to speed up inference

Issue - State: open - Opened by rasbt 4 months ago - 1 comment
Labels: enhancement

#1637 - Support for using large models in the Python API via sequential generation

Pull Request - State: closed - Opened by rasbt 4 months ago

#1628 - Support sequential generation in litgpt serve

Issue - State: closed - Opened by rasbt 4 months ago - 1 comment
Labels: enhancement

#1627 - Mistral Large 2 Checkpoints

Issue - State: closed - Opened by rasbt 4 months ago - 1 comment
Labels: enhancement, checkpoints

#1616 - Support downloading and using quantized weights (GGUF)

Issue - State: open - Opened by rasbt 4 months ago - 1 comment
Labels: enhancement

#1610 - Set kv cache size based on resource needs

Issue - State: closed - Opened by rasbt 4 months ago - 1 comment
Labels: enhancement

#1607 - Results of GPT2 (124M) reproduction with LitGPT

Issue - State: closed - Opened by awaelchli 4 months ago - 2 comments
Labels: question

#1606 - Is there any support for visual generation?

Issue - State: open - Opened by dunbar12138 4 months ago - 2 comments
Labels: enhancement, question

#1598 - Apply Sliding Window Attention to Mistral

Issue - State: closed - Opened by rasbt 4 months ago
Labels: enhancement

#1586 - Update Azure workflow to use the latest stable PyTorch version

Pull Request - State: closed - Opened by rasbt 4 months ago - 1 comment

#1578 - Add FlashAttention v3 support

Pull Request - State: closed - Opened by rasbt 4 months ago - 2 comments

#1576 - Restore previous LitData assertions in tests

Pull Request - State: closed - Opened by awaelchli 4 months ago - 4 comments

#1566 - Remove duplicated bos_token for CodeLlama

Pull Request - State: closed - Opened by alealv 4 months ago - 4 comments

#1538 - Do not wrap LoRA layers with FSDP

Pull Request - State: open - Opened by janEbert 5 months ago - 20 comments

#1522 - Tokenizer: prefer HF Tokenizer

Pull Request - State: closed - Opened by rasbt 5 months ago - 1 comment

#1498 - add support for Qwen 2

Issue - State: open - Opened by aniketmaurya 5 months ago - 5 comments
Labels: model-weights

#1483 - Recreate Mistral PR

Pull Request - State: closed - Opened by rasbt 5 months ago - 3 comments

#1443 - validation output during finetuning

Issue - State: closed - Opened by richardzhuang0412 6 months ago - 3 comments

#1431 - Mistral v0.3

Pull Request - State: closed - Opened by rasbt 6 months ago

#1430 - performing continuous pretraining and then finetuning causes error

Issue - State: open - Opened by richardzhuang0412 6 months ago - 4 comments
Labels: bug

#1377 - After some iteration in pretraining a LLM, IndexError is raised related to dataset chunking

Issue - State: open - Opened by MusulmonLolayev 7 months ago - 1 comment

#1363 - litgpt download doesn't work

Issue - State: closed - Opened by natanloterio 7 months ago - 10 comments
Labels: bug

#1220 - Can I train a model on 7900XT 4 cards?

Issue - State: open - Opened by win10ogod 8 months ago - 2 comments
Labels: question

#1202 - Finetuning run times out at evaluation step on multiple devices

Issue - State: open - Opened by ecatkins 8 months ago - 13 comments

#1192 - Introduce OptimizerArgs and add support for GaLore

Pull Request - State: closed - Opened by rasbt 8 months ago - 13 comments
Labels: breaking change

#1182 - Config URLs for pinned version

Pull Request - State: closed - Opened by awaelchli 8 months ago

#1156 - Separate out the biases

Pull Request - State: closed - Opened by rasbt 8 months ago - 6 comments

#1107 - Add MPS configs

Issue - State: open - Opened by rasbt 8 months ago - 7 comments

#1086 - Error loading converted litgpt checkpoints in `pytorch_model.bin` format using huggingface `AutoModelForCausalLM`

Issue - State: open - Opened by jwkirchenbauer 8 months ago - 7 comments

#934 - Adding DoRA (Weight-Decomposed Low-Rank Adaptation) to improve LoRA

Issue - State: open - Opened by rasbt 9 months ago - 20 comments
Labels: enhancement, fine-tuning

#796 - Harcoded incorrect (and repeated) validation example

Issue - State: closed - Opened by DavidGOrtega 12 months ago - 20 comments
Labels: enhancement

#620 - Sample packing for pretraining/fine-tuning

Issue - State: open - Opened by alitirmizi23 about 1 year ago - 16 comments
Labels: enhancement, pre-training

#491 - RuntimeError: Cannot writeback when the parameter shape changes

Issue - State: closed - Opened by ngctnnnn about 1 year ago - 11 comments
Labels: bug, fine-tuning

#100 - Add `incremental_save` to utils, use in `convert_hf_checkpoint`

Pull Request - State: closed - Opened by carmocca over 1 year ago - 2 comments

GitHub / Lightning-AI/litgpt issues and pull requests