Ecosyste.ms: Issues
An open API service for providing issue and pull request metadata for open source projects.
GitHub / sgl-project/sglang issues and pull requests
#1746 - Fix sliding window attention and gemma-2 unit tests in CI
Pull Request -
State: closed - Opened by merrymercy 19 days ago
#1745 - Introducing SGLang Guru on Gurubase.io
Pull Request -
State: open - Opened by kursataktas 20 days ago
- 4 comments
#1744 - [Bug] Issue in latest sglang docker image
Issue -
State: closed - Opened by shubhamgajbhiye1994 20 days ago
- 1 comment
#1743 - Fix prefill oom
Pull Request -
State: closed - Opened by hnyls2002 20 days ago
#1742 - [Bug] when sglang received over 16 concurrency request(means i create 16 thread to call the service all the time), it will return abnormal result, and in the log, will occur NaN
Issue -
State: closed - Opened by GGBond8488 20 days ago
- 5 comments
#1741 - Maintain seq_lens_sum to make more FlashInfer operations non-blocking
Pull Request -
State: closed - Opened by merrymercy 20 days ago
#1740 - Make token mapping non-blocking in the overlapped mode
Pull Request -
State: closed - Opened by merrymercy 20 days ago
#1739 - [Bug] Prefill OOM!
Issue -
State: closed - Opened by yichuan520030910320 20 days ago
- 2 comments
#1738 - Faster overlap mode scheduler
Pull Request -
State: closed - Opened by merrymercy 20 days ago
#1737 - misc: add CODEOWNERS
Pull Request -
State: closed - Opened by zhyncs 20 days ago
#1736 - Add GLM-4 TextGeneration Model support for SGLang
Pull Request -
State: closed - Opened by sixsixcoder 20 days ago
#1735 - Simplify batch result resolution
Pull Request -
State: closed - Opened by merrymercy 20 days ago
#1734 - Simplify the usage of device
Pull Request -
State: closed - Opened by merrymercy 20 days ago
#1733 - Add documentations for Installation
Pull Request -
State: closed - Opened by zhaochenyang20 20 days ago
- 3 comments
#1732 - [Feature] Cache-aware Data Parallel Router
Issue -
State: open - Opened by ByronHsu 20 days ago
Labels: feature
#1731 - Optimize ZMQ receive operations to reduce idle CPU usage
Pull Request -
State: closed - Opened by zyearw1024 21 days ago
- 1 comment
#1730 - [Bug] 100% CPU Usage When Idle in sglang
Issue -
State: closed - Opened by zyearw1024 21 days ago
- 1 comment
#1729 - [Bug][minimal reproducible demo] High variability across batch inference runs
Issue -
State: open - Opened by FredericOdermatt 21 days ago
- 6 comments
Labels: bug
#1728 - [LoRA, Performance] Add gemm expand triton kernel for multi-LoRA
Pull Request -
State: open - Opened by Ying1123 21 days ago
#1727 - [Bugfix] qwen2vl forward_extend
Pull Request -
State: closed - Opened by yizhang2077 21 days ago
#1726 - Split the overlapped version of TpModelWorkerClient into a separate file
Pull Request -
State: closed - Opened by merrymercy 21 days ago
#1725 - Temporarily skip the test_mixed_batch for QWen2VL
Pull Request -
State: closed - Opened by merrymercy 21 days ago
- 1 comment
#1724 - Unify the memory pool api and tp worker API
Pull Request -
State: closed - Opened by merrymercy 21 days ago
#1723 - docs: fix README
Pull Request -
State: closed - Opened by zhyncs 21 days ago
- 1 comment
#1722 - Update README.md
Pull Request -
State: closed - Opened by Ying1123 21 days ago
#1721 - Support qwen2 vl model
Pull Request -
State: closed - Opened by zhyncs 21 days ago
- 5 comments
#1720 - Update vllm to 0.6.3 (#1711)
Pull Request -
State: closed - Opened by zhyncs 21 days ago
#1719 - CPU Inference
Issue -
State: closed - Opened by JocelynPanPan 21 days ago
- 1 comment
#1718 - Simplify the interface of tp_worker
Pull Request -
State: closed - Opened by merrymercy 21 days ago
#1717 - Created SECURITY.md
Pull Request -
State: closed - Opened by NishantRana07 21 days ago
#1716 - Update readme and workflow
Pull Request -
State: closed - Opened by merrymercy 21 days ago
#1715 - [Feature] Cascade attention kernels
Issue -
State: open - Opened by merrymercy 22 days ago
- 2 comments
Labels: good first issue
#1714 - Release v0.3.4
Pull Request -
State: closed - Opened by merrymercy 22 days ago
#1713 - Update README.md
Pull Request -
State: closed - Opened by merrymercy 22 days ago
#1712 - Fix the race condition in overlap mode
Pull Request -
State: closed - Opened by merrymercy 22 days ago
#1711 - Update vllm to 0.6.3
Pull Request -
State: closed - Opened by ispobock 22 days ago
- 3 comments
#1710 - Fix `is_all_ready` for overlap copy
Pull Request -
State: closed - Opened by merrymercy 22 days ago
#1709 - Simplify the nan detection and greedy check in sampler
Pull Request -
State: closed - Opened by merrymercy 22 days ago
#1708 - Does frontend language support multi-image QA?
Issue -
State: closed - Opened by joeyy5588 22 days ago
- 3 comments
#1707 - Skip unnecessary penalizer
Pull Request -
State: closed - Opened by merrymercy 22 days ago
#1706 - Add grouped free operations
Pull Request -
State: closed - Opened by merrymercy 22 days ago
#1705 - Add dtype for more operations
Pull Request -
State: closed - Opened by merrymercy 23 days ago
#1704 - Simplify flashinfer utilities
Pull Request -
State: closed - Opened by merrymercy 23 days ago
#1703 - Fix regex and logprob conflicts when chunked prefilling
Pull Request -
State: closed - Opened by hnyls2002 23 days ago
#1702 - Fix mixed batch for multi modal models
Pull Request -
State: closed - Opened by merrymercy 23 days ago
#1701 - Fix engine unit test
Pull Request -
State: closed - Opened by merrymercy 23 days ago
#1700 - Fix failed ci tests on long prompts; Better error messages for embedding models
Pull Request -
State: closed - Opened by merrymercy 24 days ago
- 1 comment
#1699 - Fix the failed unit tests
Pull Request -
State: closed - Opened by merrymercy 24 days ago
#1698 - [Bug] AttributeError in `openai.Client` Embeddings API
Issue -
State: closed - Opened by tanzelin430 24 days ago
- 4 comments
#1697 - feat: radix tree code optimize
Pull Request -
State: closed - Opened by wxsms 24 days ago
- 1 comment
#1696 - Use SGLang imports for linear layer
Pull Request -
State: closed - Opened by janimo 24 days ago
#1695 - [Router] Implement router backbone
Pull Request -
State: closed - Opened by ByronHsu 24 days ago
- 1 comment
#1694 - ORJson. Faster Json serialization
Pull Request -
State: closed - Opened by michaelfeil 24 days ago
- 1 comment
#1693 - [Bug] crash about `c10d::ProcessGroupNCCL::WorkNCCL::checkTimeout`
Issue -
State: open - Opened by zeng-zc 24 days ago
- 2 comments
#1692 - [Bug] IndexError: Inconsistent batch_size and len(image_input)
Issue -
State: closed - Opened by OBJECT907 24 days ago
- 2 comments
Labels: high priority
#1691 - [Bug] deadlock or hang on Qwen2-7B models
Issue -
State: closed - Opened by zeng-zc 24 days ago
- 17 comments
#1690 - Update the transformers version in CI
Pull Request -
State: closed - Opened by merrymercy 24 days ago
#1689 - Update README.md
Pull Request -
State: closed - Opened by merrymercy 24 days ago
#1688 - add orjson for jsonresponse
Pull Request -
State: closed - Opened by michaelfeil 24 days ago
- 4 comments
#1687 - Launch a thread to overlap CPU and GPU
Pull Request -
State: closed - Opened by merrymercy 24 days ago
- 2 comments
#1686 - [Event] Add online meetup meeting link
Pull Request -
State: closed - Opened by Ying1123 24 days ago
#1685 - Fix srt dependency
Pull Request -
State: closed - Opened by ispobock 25 days ago
#1684 - Add matched_stop token or str to distinguish between eos or stop str finish_reason generation
Pull Request -
State: closed - Opened by g-drozdov 25 days ago
- 1 comment
#1683 - [Bug] ROCm6.1.2 sglang0.3.3 cuda graph coredump
Issue -
State: closed - Opened by linqingxu 25 days ago
- 7 comments
#1682 - Fixes for running reward model inference using sglang
Pull Request -
State: closed - Opened by corbt 25 days ago
- 3 comments
#1681 - Fix filter_batch function call
Pull Request -
State: closed - Opened by hnyls2002 25 days ago
#1680 - [Performance] Support `xgrammar` for faster constrained decoding
Pull Request -
State: closed - Opened by DarkSharpness 25 days ago
- 10 comments
#1679 - Add date to logging messages (#1623)
Pull Request -
State: closed - Opened by zeng-zc 25 days ago
#1678 - slides link to .pdf
Pull Request -
State: closed - Opened by ziliangpeng 25 days ago
- 2 comments
#1677 - Add a new event loop
Pull Request -
State: closed - Opened by merrymercy 25 days ago
#1676 - Add OLMo model
Pull Request -
State: closed - Opened by janimo 25 days ago
- 2 comments
#1674 - Fix memory leak during abort
Pull Request -
State: closed - Opened by merrymercy 26 days ago
#1673 - [Feature] Make vLLM optional in model code
Issue -
State: open - Opened by ByronHsu 26 days ago
Labels: wip
#1672 - Improve benchmark scripts
Pull Request -
State: closed - Opened by merrymercy 26 days ago
#1671 - [Minor] Add some utility functions
Pull Request -
State: closed - Opened by merrymercy 26 days ago
#1670 - [doc] improve engine doc and add to readme
Pull Request -
State: closed - Opened by ByronHsu 26 days ago
#1668 - [Feature] When will a version of S-Lora be available?
Issue -
State: open - Opened by kunkunzhang123 27 days ago
- 1 comment
#1667 - Simplify chunked prefill
Pull Request -
State: closed - Opened by merrymercy 27 days ago
#1666 - [Minor] Improve style
Pull Request -
State: closed - Opened by merrymercy 27 days ago
#1665 - Fix unit test order to balance the tasks in CI
Pull Request -
State: closed - Opened by merrymercy 27 days ago
#1664 - [Bug] difference of kv-cache-prefixing between vLLM and sglang
Issue -
State: closed - Opened by chenchunhui97 27 days ago
#1663 - Move filter_batch out of stream_output
Pull Request -
State: closed - Opened by merrymercy 27 days ago
- 2 comments
#1662 - Add a test case to test retract
Pull Request -
State: closed - Opened by merrymercy 27 days ago
#1661 - [Minor] Rename no_eos_trim to no_stop_trim
Pull Request -
State: closed - Opened by Ying1123 27 days ago
#1660 - docs: add zh_CN po files
Pull Request -
State: closed - Opened by llama-factory 27 days ago
- 4 comments
#1659 - Add output_ids into ScheduleBatch
Pull Request -
State: closed - Opened by merrymercy 27 days ago
#1658 - [1/N] Remove `CacheConfig` import in all model files
Pull Request -
State: closed - Opened by ByronHsu 27 days ago
#1657 - temp
Pull Request -
State: closed - Opened by yukavio 28 days ago
#1656 - [doc] Add engine section in backend.md
Pull Request -
State: closed - Opened by ByronHsu 28 days ago
- 1 comment
#1655 - [Feature] sanic Custom Server example support openai stream api ?
Issue -
State: closed - Opened by lys791227 28 days ago
- 3 comments
#1654 - Fix the batch_is_full check for jump-forward decoding
Pull Request -
State: closed - Opened by merrymercy 28 days ago
#1653 - Add get_tokenizer function for Engine class
Pull Request -
State: closed - Opened by pjyi2147 28 days ago
- 1 comment
#1652 - Simplify the event loop and expose `--num-continuous-decode-steps` as an argument
Pull Request -
State: closed - Opened by merrymercy 28 days ago
#1651 - Add an option to disable penalizer
Pull Request -
State: closed - Opened by merrymercy 28 days ago
#1650 - [Fix] fix eos trim inconsistency
Pull Request -
State: closed - Opened by Ying1123 28 days ago
#1649 - [Feature] Multi-instance deployment
Issue -
State: open - Opened by vkc1vk 28 days ago
- 3 comments
#1648 - Fix unit tests and type annotations
Pull Request -
State: closed - Opened by merrymercy 28 days ago
#1647 - docs: add zh_CN po files
Pull Request -
State: closed - Opened by llama-factory 29 days ago
#1646 - dead
Pull Request -
State: closed - Opened by llama-factory 29 days ago
#1645 - Fix ignore_eos in the OpenAI ChatCompletions API
Pull Request -
State: closed - Opened by merrymercy 29 days ago