-
Notifications
You must be signed in to change notification settings - Fork 130
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix: Improve
SmoothQuant
Support for Mixture of Experts (MoE) Models
#1455
opened May 20, 2025 by
rahul-tuli
Loading…
Disable kernels during calibration (and tracing)
ready
When a PR is ready for review
#1454
opened May 20, 2025 by
kylesayrs
Loading…
[GPTQ] Fix actorder resolution, add sentinel
ready
When a PR is ready for review
#1453
opened May 20, 2025 by
kylesayrs
Loading…
AWQ Apply Scales Bugfix when smooth layer output length doesn't match balance layer input length
ready
When a PR is ready for review
#1451
opened May 19, 2025 by
brian-dellabetta
Loading…
[Observer] Optimize mse observer
ready
When a PR is ready for review
#1450
opened May 19, 2025 by
shanjiaz
Loading…
Fix missing logs when calling oneshot
ready
When a PR is ready for review
#1446
opened May 19, 2025 by
kelkelcheng
Loading…
oneshot entrypoint update
ready
When a PR is ready for review
#1445
opened May 17, 2025 by
ved1beta
Loading…
AWQModifier fast resolve mappings, better logging
#1444
opened May 16, 2025 by
brian-dellabetta
•
Draft
1 task
AWQ Qwen and Phi mappings
ready
When a PR is ready for review
#1440
opened May 16, 2025 by
brian-dellabetta
Loading…
1 task
Initial implementation for the docs site and setup for LLM Compressor
#1436
opened May 15, 2025 by
markurtz
Loading…
Use model compression pathways
ready
When a PR is ready for review
#1419
opened May 8, 2025 by
kylesayrs
Loading…
Add warning for non-divisible group quantization
ready
When a PR is ready for review
#1401
opened Apr 29, 2025 by
kylesayrs
Loading…
[Tracing] Raise When a PR is ready for review
_is_compiling_flag
while tracing
ready
#1388
opened Apr 27, 2025 by
kylesayrs
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.