GitHub / Lightning-AI/litgpt issues and pull requests
#2095 - pin: restrict datasets version to <4.0.0 for compatibility
Pull Request -
State: closed - Opened by Borda 18 days ago
#2094 - add/debug Lit CI [wip]
Pull Request -
State: open - Opened by Borda 18 days ago
#2093 - doc: add comments for clarifying query / KV groups
Pull Request -
State: closed - Opened by raishish 24 days ago
#2092 - doc: add `n_query_groups` to attention notation table
Pull Request -
State: closed - Opened by raishish 24 days ago
#2091 - [pre-commit.ci] pre-commit suggestions
Pull Request -
State: closed - Opened by pre-commit-ci[bot] 25 days ago
#2090 - Secrets exfiltration vulnerability
Issue -
State: open - Opened by darryk10 25 days ago
#2089 - Submission
Pull Request -
State: open - Opened by Amrlmlna 26 days ago
#2088 - Complete pending todos in testing
Pull Request -
State: closed - Opened by raishish 29 days ago
- 3 comments
#2087 - How to bring my tokenizer and set vocabulary size accordingly for training a model with loaded weights
Issue -
State: open - Opened by zvimarko 29 days ago
Labels: question
#2086 - finetune_lora upgrades
Pull Request -
State: open - Opened by ysjprojects 30 days ago
- 2 comments
#2085 - build(deps): update numpy requirement from <2 to none
Pull Request -
State: closed - Opened by dependabot[bot] about 1 month ago
Labels: dependencies
#2084 - build(deps): update transformers requirement from <4.52,>=4.51.3 to >=4.51.3,<4.54
Pull Request -
State: open - Opened by dependabot[bot] about 1 month ago
Labels: dependencies
#2083 - ci: show the longest tests for improvement
Pull Request -
State: closed - Opened by Borda about 1 month ago
#2082 - docs: Add documentation for OpenAI-compatible API in LitGPT deployment
Pull Request -
State: closed - Opened by bhimrazy about 1 month ago
#2081 - update bug-report/issue with reproducing in Studio
Pull Request -
State: closed - Opened by Borda about 1 month ago
#2080 - feat(serve.py): add api_path parameter to cli options to allow custom API endpoint configuration
Pull Request -
State: open - Opened by botirk38 about 1 month ago
- 5 comments
#2079 - Deferring import of torch in config to allow faster import
Pull Request -
State: closed - Opened by JackUrb about 2 months ago
- 1 comment
#2078 - limit PR permissions vol.2
Pull Request -
State: closed - Opened by Borda about 2 months ago
#2077 - limit PR permissions
Pull Request -
State: closed - Opened by Borda about 2 months ago
#2075 - debug some failing standalone tests with compiler
Pull Request -
State: closed - Opened by Borda about 2 months ago
#2074 - doc: Misleading QKV shape code comments
Issue -
State: closed - Opened by d-kleine about 2 months ago
- 4 comments
Labels: bug
#2073 - Moving to lazy root imports to make config loading snappy
Pull Request -
State: open - Opened by JackUrb about 2 months ago
- 12 comments
Labels: enhancement, performance
#2072 - debug installing `torch` for Thunder
Pull Request -
State: closed - Opened by Borda about 2 months ago
- 1 comment
#2071 - support swanlab
Issue -
State: open - Opened by guangyuli-uoe about 2 months ago
Labels: enhancement
#2070 - build(deps): update transformers requirement from <4.52,>=4.51.3 to >=4.51.3,<4.53
Pull Request -
State: closed - Opened by dependabot[bot] about 2 months ago
- 3 comments
Labels: dependencies
#2069 - build(deps): update bitsandbytes requirement from <0.43,>=0.42 to >=0.42,<0.47
Pull Request -
State: open - Opened by dependabot[bot] about 2 months ago
- 1 comment
Labels: dependencies
#2068 - build(deps): bump litdata from 0.2.45 to 0.2.49
Pull Request -
State: closed - Opened by dependabot[bot] about 2 months ago
- 1 comment
Labels: dependencies
#2067 - build(deps): bump the gha-updates group with 3 updates
Pull Request -
State: closed - Opened by dependabot[bot] about 2 months ago
- 1 comment
Labels: CI / actions
#2066 - Add Dependabot for Pip & GitHub Actions
Pull Request -
State: closed - Opened by Borda about 2 months ago
#2065 - debug failing standalone tests with Thunder
Pull Request -
State: closed - Opened by Borda about 2 months ago
- 3 comments
#2064 - args bug in setup
Issue -
State: closed - Opened by guangyuli-uoe about 2 months ago
- 5 comments
Labels: bug, waiting on author
#2063 - bump: testing with PT 2.7.1
Pull Request -
State: closed - Opened by Borda about 2 months ago
#2062 - Fix in `convert_hf_checkpoint` related to Gemma 3
Pull Request -
State: closed - Opened by mseeger 2 months ago
- 8 comments
#2061 - Refactoring of multi-head attention and support for KV caching
Pull Request -
State: open - Opened by mseeger 2 months ago
- 27 comments
Labels: enhancement
#2060 - Qwen3 MoE
Pull Request -
State: closed - Opened by ysjprojects 2 months ago
- 1 comment
#2059 - Update oom.md
Pull Request -
State: closed - Opened by givemesomeoptionsforemail 2 months ago
- 1 comment
#2058 - Update spacing in README.md
Pull Request -
State: closed - Opened by Borda 2 months ago
#2057 - req: pin `bitsandbytes>=0.45.2,<0.45.5`
Pull Request -
State: closed - Opened by Borda 2 months ago
#2056 - ci: extend testing with `ubuntu-24.04`
Pull Request -
State: closed - Opened by Borda 2 months ago
#2055 - Remove litserve version constraint
Pull Request -
State: closed - Opened by twsl 2 months ago
- 1 comment
#2054 - ci: use Thunder dev images for testing
Pull Request -
State: closed - Opened by Borda 2 months ago
- 2 comments
#2053 - simplify the GPU testing flow
Pull Request -
State: closed - Opened by Borda 2 months ago
#2052 - bump: update Torch to resolve failing test
Pull Request -
State: closed - Opened by Borda 2 months ago
#2051 - Add pip list command after installing dependencies in GPU tests
Pull Request -
State: closed - Opened by bhimrazy 2 months ago
- 1 comment
#2050 - Xfail Thunder integration test due to Dynamo bug
Pull Request -
State: closed - Opened by deependujha 2 months ago
- 5 comments
#2049 - tests: mark `test_evaluate_script` as flaky
Pull Request -
State: closed - Opened by Borda 2 months ago
#2048 - fix: Pretraining text files with recent litdata versions
Pull Request -
State: closed - Opened by andyland 2 months ago
#2047 - phi-4 reasoning models
Pull Request -
State: closed - Opened by ysjprojects 2 months ago
#2046 - Qwen3 MoE Preliminary: add intermediate_size argument to MLP modules
Pull Request -
State: closed - Opened by ysjprojects 3 months ago
#2045 - litserve version constraint
Issue -
State: closed - Opened by twsl 3 months ago
- 2 comments
Labels: question
#2044 - Qwen3 Dense
Pull Request -
State: closed - Opened by ysjprojects 3 months ago
- 3 comments
#2043 - ci: update guardian for PRs
Pull Request -
State: closed - Opened by Borda 3 months ago
#2042 - How to use fabric.clip_gradients in 16-mixed?
Issue -
State: open - Opened by VJJJJJJ1 3 months ago
Labels: help wanted, question
#2041 - ci: run `pull_request` only on own PRs
Pull Request -
State: closed - Opened by Borda 3 months ago
#2040 - fix: Add fallback chat template
Pull Request -
State: closed - Opened by andyland 3 months ago
#2039 - update file
Pull Request -
State: closed - Opened by darryk10 3 months ago
#2038 - done
Pull Request -
State: closed - Opened by darryk10 3 months ago
#2037 - Update test_batch.py
Pull Request -
State: closed - Opened by ahash23 3 months ago
#2036 - Add optional sys prompt
Pull Request -
State: closed - Opened by twsl 3 months ago
#2035 - Add devcontainer
Pull Request -
State: closed - Opened by twsl 3 months ago
#2034 - Remove dependency from config to utils
Pull Request -
State: closed - Opened by lukemerrick 3 months ago
- 1 comment
#2033 - Add devcontainer
Issue -
State: closed - Opened by twsl 3 months ago
Labels: enhancement
#2032 - AssertionError: Rank 2 has different values for step: 49996.0 Other ranks: 49991.0
Issue -
State: open - Opened by VJJJJJJ1 3 months ago
#2031 - Add support for Qwen 3
Issue -
State: open - Opened by ShinnosukeUesaka 3 months ago
- 1 comment
Labels: enhancement
#2030 - Transformers version bump for recent model support
Issue -
State: open - Opened by KaelanDt 3 months ago
#2029 - Transformers version bump
Pull Request -
State: closed - Opened by KaelanDt 3 months ago
- 9 comments
#2028 - Adds Qwen3 dense models
Pull Request -
State: closed - Opened by KaelanDt 3 months ago
- 1 comment
#2027 - Update README.md for installation instructions
Pull Request -
State: closed - Opened by prabhuteja12 3 months ago
- 3 comments
#2026 - Installation of V0.5.8
Issue -
State: closed - Opened by prabhuteja12 3 months ago
Labels: bug
#2025 - add testing for py3.12 & py3.13
Pull Request -
State: closed - Opened by Borda 3 months ago
#2024 - bump version post 0.5.8
Pull Request -
State: closed - Opened by t-vi 3 months ago
#2023 - prepare 0.5.8
Pull Request -
State: closed - Opened by t-vi 3 months ago
#2022 - drop upper bounds in dependencies
Pull Request -
State: closed - Opened by t-vi 3 months ago
- 4 comments
#2021 - how to resume training from a lora checkpoint?
Issue -
State: closed - Opened by millix19 3 months ago
- 1 comment
Labels: question
#2020 - finetune_lora on gemma bug
Issue -
State: open - Opened by qqandy0120 3 months ago
- 1 comment
Labels: bug, help wanted
#2019 - Converting Safetensors Format Weights from Llama Model with New Tokens to LitGPT Format
Issue -
State: closed - Opened by GuocunWang 3 months ago
- 1 comment
Labels: question
#2018 - fix typo
Pull Request -
State: closed - Opened by Lynsoo 3 months ago
#2017 - Cast tensors in KVCache only when needed
Pull Request -
State: closed - Opened by Andrei-Aksionov 4 months ago
#2016 - input_pos_maxp1 as a Python integer
Pull Request -
State: closed - Opened by Andrei-Aksionov 4 months ago
- 2 comments
#2015 - Llama 4 support
Issue -
State: open - Opened by codestar12 4 months ago
- 6 comments
Labels: enhancement
#2014 - LLaMAMoE fixes
Pull Request -
State: closed - Opened by ysjprojects 4 months ago
- 5 comments
#2013 - Mismatch in LLaMAMoE litgpt and hf implementation for Mixtral
Issue -
State: closed - Opened by ysjprojects 4 months ago
- 1 comment
Labels: bug
#2012 - (WIP) DeepseekV3 (and Multi-Head Latent Attention)
Pull Request -
State: open - Opened by ysjprojects 4 months ago
- 2 comments
#2011 - building tutorials as mkdocs pages
Pull Request -
State: closed - Opened by Borda 4 months ago
#2010 - How do I finetune with validation on my own dataset?
Issue -
State: closed - Opened by tannerhs 4 months ago
Labels: question
#2009 - [pre-commit.ci] pre-commit suggestions
Pull Request -
State: closed - Opened by pre-commit-ci[bot] 4 months ago
#2008 - feat: load only text weights from multimodal gemma
Pull Request -
State: closed - Opened by pquadri 4 months ago
- 6 comments
#2007 - add borda as codeowner
Pull Request -
State: closed - Opened by t-vi 4 months ago
#2006 - feat: add tests for gemma3
Pull Request -
State: closed - Opened by k223kim 4 months ago
#2005 - feat: add gemma 3 in readme and tutorials
Pull Request -
State: closed - Opened by k223kim 4 months ago
#2004 - Fix/loading gemma 3 1b
Pull Request -
State: closed - Opened by pquadri 4 months ago
- 3 comments
#2003 - New `rope_indices` in `config.py`: Unintended effects?
Issue -
State: closed - Opened by mseeger 4 months ago
- 3 comments
Labels: question
#2002 - feat: add gemma-3-12b
Pull Request -
State: closed - Opened by k223kim 4 months ago
#2001 - [3/4] feat: add gemma 3 4b
Pull Request -
State: closed - Opened by k223kim 4 months ago
#2000 - [2/4] add gemma 3 1b
Pull Request -
State: closed - Opened by k223kim 4 months ago
#1999 - try pyupgrade-up py38
Pull Request -
State: closed - Opened by Borda 4 months ago
#1998 - [1/4] feat: add gemma 3 27b
Pull Request -
State: closed - Opened by k223kim 4 months ago
- 3 comments
#1997 - feat: add rope indices
Pull Request -
State: closed - Opened by k223kim 4 months ago
#1996 - test: flexible wait for serve start
Pull Request -
State: closed - Opened by Borda 4 months ago
#1995 - fix: replace sliding window configuration parameters to sliding windows indices
Pull Request -
State: closed - Opened by k223kim 4 months ago
- 1 comment