-
Notifications
You must be signed in to change notification settings - Fork 3.9k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[do not merge] add AMD flashinfer for diffusion tests
amd
#16116
opened Dec 30, 2025 by
IPostYellow
Loading…
6 tasks
[Ascend]bugfix: Qwen3 encountered an error when enabling the LM-head
#16115
opened Dec 30, 2025 by
chenxu214
Loading…
6 tasks
[WIP][diffusion] new model: support Wan Animate with Data preprocessing
diffusion
SGLang Diffusion
npu
#16113
opened Dec 30, 2025 by
tom-jerr
Loading…
2 of 6 tasks
e2e Sonic-MoE support
dependencies
Pull requests that update a dependency file
quant
LLM Quantization
fix npu ci dataset load issue
npu
run-ci
#16111
opened Dec 30, 2025 by
iforgetmyname
Loading…
6 tasks
dp-attention: add follow_bootstrap_room + auto load-balance; drop decode_round_robin
documentation
Improvements or additions to documentation
#16110
opened Dec 30, 2025 by
mufeez-amjad
Loading…
5 of 6 tasks
Slightly improve deepgemm latency in normal dispatch on hopper
#16109
opened Dec 29, 2025 by
vincentzed
•
Draft
6 tasks
Refactor: Moving
extend_logprob_start_len calculation out of prepare_for_extend
run-ci
#16105
opened Dec 29, 2025 by
ch-wan
Loading…
6 tasks
perf(vision): avoid GPU→CPU sync on attention mask cache hits
Multi-modal
multi-modal language model
#16104
opened Dec 29, 2025 by
tom-doerr
Loading…
2 tasks
Fix KeyError when logprobs=false in completions endpoint
high priority
run-ci
#16095
opened Dec 29, 2025 by
haikux
Loading…
2 of 6 tasks
[model-gateway] Add embedding correctness test comparing against HuggingFace
model-gateway
run-ci
#16092
opened Dec 29, 2025 by
slin1237
Loading…
6 tasks
[Tool Call] Stream DeepSeek-V3.2 function call parameters in JSON format.
deepseek
#16091
opened Dec 29, 2025 by
Muqi1029
Loading…
2 of 6 tasks
[WIP] Improve multimodallm perfermance
documentation
Improvements or additions to documentation
Multi-modal
multi-modal language model
npu
#16090
opened Dec 29, 2025 by
yhyang201
Loading…
6 tasks
[model-gateway]: move PD configuration conflict checks to model gateway
model-gateway
#16088
opened Dec 29, 2025 by
Ratish1
Loading…
3 of 6 tasks
[3/N][Sparse With Hicache]: Init sparse coordinator
#16086
opened Dec 29, 2025 by
hzh0425
Loading…
6 tasks
Fix: Parallel sampling log requests TypeError
#16082
opened Dec 29, 2025 by
Leoyzen
Loading…
2 of 6 tasks
[diffusion] improve: tiny improve layerwise offload manager by consolidating weights per layer
diffusion
SGLang Diffusion
run-ci
#16081
opened Dec 29, 2025 by
mickqian
Loading…
6 tasks
[Performance] Change sparse MLA and dense MHA switching threshold DSv3.2
#16079
opened Dec 29, 2025 by
zhangxiaolei123456
Loading…
6 tasks
[http-server] Add kv cache info to model_info endpoint
#16077
opened Dec 29, 2025 by
peleg-yair
Loading…
2 of 6 tasks
[diffusion] pipeline: free DiT GPU memory after denoising
diffusion
SGLang Diffusion
#16074
opened Dec 29, 2025 by
6somehow
Loading…
3 of 5 tasks
[Schedule] dp scheduler enhancer support with chunked prefill
#16073
opened Dec 29, 2025 by
liupeng374
Loading…
1 of 6 tasks
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-11-29.