Lightning-AI/litgpt issues and pull requests

#2095 - pin: restrict datasets version to <4.0.0 for compatibility

Pull Request - State: closed - Opened by Borda 18 days ago

#2094 - add/debug Lit CI [wip]

Pull Request - State: open - Opened by Borda 18 days ago

#2093 - doc: add comments for clarifying query / KV groups

Pull Request - State: closed - Opened by raishish 24 days ago

#2092 - doc: add `n_query_groups` to attention notation table

Pull Request - State: closed - Opened by raishish 24 days ago

#2091 - [pre-commit.ci] pre-commit suggestions

Pull Request - State: closed - Opened by pre-commit-ci[bot] 25 days ago

#2090 - Secrets exfiltration vulnerability

Issue - State: open - Opened by darryk10 25 days ago

#2089 - Submission

Pull Request - State: open - Opened by Amrlmlna 26 days ago

#2088 - Complete pending todos in testing

Pull Request - State: closed - Opened by raishish 29 days ago - 3 comments

#2087 - How to bring my tokenizer and set vocabulary size accordingly for training a model with loaded weights

Issue - State: open - Opened by zvimarko 29 days ago
Labels: question

#2086 - finetune_lora upgrades

Pull Request - State: open - Opened by ysjprojects 30 days ago - 2 comments

#2085 - build(deps): update numpy requirement from <2 to none

Pull Request - State: closed - Opened by dependabot[bot] about 1 month ago
Labels: dependencies

#2084 - build(deps): update transformers requirement from <4.52,>=4.51.3 to >=4.51.3,<4.54

Pull Request - State: open - Opened by dependabot[bot] about 1 month ago
Labels: dependencies

#2083 - ci: show the longest tests for improvement

Pull Request - State: closed - Opened by Borda about 1 month ago

#2082 - docs: Add documentation for OpenAI-compatible API in LitGPT deployment

Pull Request - State: closed - Opened by bhimrazy about 1 month ago

#2081 - update bug-report/issue with reproducing in Studio

Pull Request - State: closed - Opened by Borda about 1 month ago

#2080 - feat(serve.py): add api_path parameter to cli options to allow custom API endpoint configuration

Pull Request - State: open - Opened by botirk38 about 1 month ago - 5 comments

#2079 - Deferring import of torch in config to allow faster import

Pull Request - State: closed - Opened by JackUrb about 2 months ago - 1 comment

#2078 - limit PR permissions vol.2

Pull Request - State: closed - Opened by Borda about 2 months ago

#2077 - limit PR permissions

Pull Request - State: closed - Opened by Borda about 2 months ago

#2075 - debug some failing standalone tests with compiler

Pull Request - State: closed - Opened by Borda about 2 months ago

#2074 - doc: Misleading QKV shape code comments

Issue - State: closed - Opened by d-kleine about 2 months ago - 4 comments
Labels: bug

#2073 - Moving to lazy root imports to make config loading snappy

Pull Request - State: open - Opened by JackUrb about 2 months ago - 12 comments
Labels: enhancement, performance

#2072 - debug installing `torch` for Thunder

Pull Request - State: closed - Opened by Borda about 2 months ago - 1 comment

#2071 - support swanlab

Issue - State: open - Opened by guangyuli-uoe about 2 months ago
Labels: enhancement

#2070 - build(deps): update transformers requirement from <4.52,>=4.51.3 to >=4.51.3,<4.53

Pull Request - State: closed - Opened by dependabot[bot] about 2 months ago - 3 comments
Labels: dependencies

#2069 - build(deps): update bitsandbytes requirement from <0.43,>=0.42 to >=0.42,<0.47

Pull Request - State: open - Opened by dependabot[bot] about 2 months ago - 1 comment
Labels: dependencies

#2068 - build(deps): bump litdata from 0.2.45 to 0.2.49

Pull Request - State: closed - Opened by dependabot[bot] about 2 months ago - 1 comment
Labels: dependencies

#2067 - build(deps): bump the gha-updates group with 3 updates

Pull Request - State: closed - Opened by dependabot[bot] about 2 months ago - 1 comment
Labels: CI / actions

#2066 - Add Dependabot for Pip & GitHub Actions

Pull Request - State: closed - Opened by Borda about 2 months ago

#2065 - debug failing standalone tests with Thunder

Pull Request - State: closed - Opened by Borda about 2 months ago - 3 comments

#2064 - args bug in setup

Issue - State: closed - Opened by guangyuli-uoe about 2 months ago - 5 comments
Labels: bug, waiting on author

#2063 - bump: testing with PT 2.7.1

Pull Request - State: closed - Opened by Borda about 2 months ago

#2062 - Fix in `convert_hf_checkpoint` related to Gemma 3

Pull Request - State: closed - Opened by mseeger 2 months ago - 8 comments

#2061 - Refactoring of multi-head attention and support for KV caching

Pull Request - State: open - Opened by mseeger 2 months ago - 27 comments
Labels: enhancement

#2060 - Qwen3 MoE

Pull Request - State: closed - Opened by ysjprojects 2 months ago - 1 comment

#2059 - Update oom.md

Pull Request - State: closed - Opened by givemesomeoptionsforemail 2 months ago - 1 comment

#2058 - Update spacing in README.md

Pull Request - State: closed - Opened by Borda 2 months ago

#2057 - req: pin `bitsandbytes>=0.45.2,<0.45.5`

Pull Request - State: closed - Opened by Borda 2 months ago

#2056 - ci: extend testing with `ubuntu-24.04`

Pull Request - State: closed - Opened by Borda 2 months ago

#2055 - Remove litserve version constraint

Pull Request - State: closed - Opened by twsl 2 months ago - 1 comment

#2054 - ci: use Thunder dev images for testing

Pull Request - State: closed - Opened by Borda 2 months ago - 2 comments

#2053 - simplify the GPU testing flow

Pull Request - State: closed - Opened by Borda 2 months ago

#2052 - bump: update Torch to resolve failing test

Pull Request - State: closed - Opened by Borda 2 months ago

#2051 - Add pip list command after installing dependencies in GPU tests

Pull Request - State: closed - Opened by bhimrazy 2 months ago - 1 comment

#2050 - Xfail Thunder integration test due to Dynamo bug

Pull Request - State: closed - Opened by deependujha 2 months ago - 5 comments

#2049 - tests: mark `test_evaluate_script` as flaky

Pull Request - State: closed - Opened by Borda 2 months ago

#2048 - fix: Pretraining text files with recent litdata versions

Pull Request - State: closed - Opened by andyland 2 months ago

#2047 - phi-4 reasoning models

Pull Request - State: closed - Opened by ysjprojects 2 months ago

#2046 - Qwen3 MoE Preliminary: add intermediate_size argument to MLP modules

Pull Request - State: closed - Opened by ysjprojects 3 months ago

#2045 - litserve version constraint

Issue - State: closed - Opened by twsl 3 months ago - 2 comments
Labels: question

#2044 - Qwen3 Dense

Pull Request - State: closed - Opened by ysjprojects 3 months ago - 3 comments

#2043 - ci: update guardian for PRs

Pull Request - State: closed - Opened by Borda 3 months ago

#2042 - How to use fabric.clip_gradients in 16-mixed?

Issue - State: open - Opened by VJJJJJJ1 3 months ago
Labels: help wanted, question

#2041 - ci: run `pull_request` only on own PRs

Pull Request - State: closed - Opened by Borda 3 months ago

#2040 - fix: Add fallback chat template

Pull Request - State: closed - Opened by andyland 3 months ago

#2039 - update file

Pull Request - State: closed - Opened by darryk10 3 months ago

#2038 - done

Pull Request - State: closed - Opened by darryk10 3 months ago

#2037 - Update test_batch.py

Pull Request - State: closed - Opened by ahash23 3 months ago

#2036 - Add optional sys prompt

Pull Request - State: closed - Opened by twsl 3 months ago

#2035 - Add devcontainer

Pull Request - State: closed - Opened by twsl 3 months ago

#2034 - Remove dependency from config to utils

Pull Request - State: closed - Opened by lukemerrick 3 months ago - 1 comment

#2033 - Add devcontainer

Issue - State: closed - Opened by twsl 3 months ago
Labels: enhancement

#2032 - AssertionError: Rank 2 has different values for step: 49996.0 Other ranks: 49991.0

Issue - State: open - Opened by VJJJJJJ1 3 months ago

#2031 - Add support for Qwen 3

Issue - State: open - Opened by ShinnosukeUesaka 3 months ago - 1 comment
Labels: enhancement

#2030 - Transformers version bump for recent model support

Issue - State: open - Opened by KaelanDt 3 months ago

#2029 - Transformers version bump

Pull Request - State: closed - Opened by KaelanDt 3 months ago - 9 comments

#2028 - Adds Qwen3 dense models

Pull Request - State: closed - Opened by KaelanDt 3 months ago - 1 comment

#2027 - Update README.md for installation instructions

Pull Request - State: closed - Opened by prabhuteja12 3 months ago - 3 comments

#2026 - Installation of V0.5.8

Issue - State: closed - Opened by prabhuteja12 3 months ago
Labels: bug

#2025 - add testing for py3.12 & py3.13

Pull Request - State: closed - Opened by Borda 3 months ago

#2024 - bump version post 0.5.8

Pull Request - State: closed - Opened by t-vi 3 months ago

#2023 - prepare 0.5.8

Pull Request - State: closed - Opened by t-vi 3 months ago

#2022 - drop upper bounds in dependencies

Pull Request - State: closed - Opened by t-vi 3 months ago - 4 comments

#2021 - how to resume training from a lora checkpoint?

Issue - State: closed - Opened by millix19 3 months ago - 1 comment
Labels: question

#2020 - finetune_lora on gemma bug

Issue - State: open - Opened by qqandy0120 3 months ago - 1 comment
Labels: bug, help wanted

#2019 - Converting Safetensors Format Weights from Llama Model with New Tokens to LitGPT Format

Issue - State: closed - Opened by GuocunWang 3 months ago - 1 comment
Labels: question

#2018 - fix typo

Pull Request - State: closed - Opened by Lynsoo 3 months ago

#2017 - Cast tensors in KVCache only when needed

Pull Request - State: closed - Opened by Andrei-Aksionov 4 months ago

#2016 - input_pos_maxp1 as a Python integer

Pull Request - State: closed - Opened by Andrei-Aksionov 4 months ago - 2 comments

#2015 - Llama 4 support

Issue - State: open - Opened by codestar12 4 months ago - 6 comments
Labels: enhancement

#2014 - LLaMAMoE fixes

Pull Request - State: closed - Opened by ysjprojects 4 months ago - 5 comments

#2013 - Mismatch in LLaMAMoE litgpt and hf implementation for Mixtral

Issue - State: closed - Opened by ysjprojects 4 months ago - 1 comment
Labels: bug

#2012 - (WIP) DeepseekV3 (and Multi-Head Latent Attention)

Pull Request - State: open - Opened by ysjprojects 4 months ago - 2 comments

#2011 - building tutorials as mkdocs pages

Pull Request - State: closed - Opened by Borda 4 months ago

#2010 - How do I finetune with validation on my own dataset?

Issue - State: closed - Opened by tannerhs 4 months ago
Labels: question

#2009 - [pre-commit.ci] pre-commit suggestions

Pull Request - State: closed - Opened by pre-commit-ci[bot] 4 months ago

#2008 - feat: load only text weights from multimodal gemma

Pull Request - State: closed - Opened by pquadri 4 months ago - 6 comments