-
Notifications
You must be signed in to change notification settings - Fork 400
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bugfix] Reduce device movement while checking layer divisibility
#2385
opened Feb 18, 2026 by
kylesayrs
Loading…
perf: make MSE observer compatible with torch.compile (39x speedup)
quality-failed
#2384
opened Feb 18, 2026 by
Bias92
Loading…
feat: add Qwen3.5 MoE calibration module
documentation
Improvements or additions to documentation
nvfp4
For any PR / issue related to NVFP4 support
quality-failed
qwen
For any PR / issue related to Qwen support
ready
When a PR is ready for review
#2383
opened Feb 18, 2026 by
Sehyo
Loading…
[Offloading] Support Disk Offloading
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2373
opened Feb 17, 2026 by
kylesayrs
Loading…
[GPTQ] Move modifier to top-level for consistent folder structure
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2368
opened Feb 16, 2026 by
dik654
Loading…
add qwen3 vl autoround example
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2357
opened Feb 12, 2026 by
xin3he
Loading…
Add model_free_ptq example for glm 4.6 block fp8
documentation
Improvements or additions to documentation
#2343
opened Feb 10, 2026 by
mgoin
Loading…
[MoE] MiniMax-M2/M2.1 calibration follow-up
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2335
opened Feb 6, 2026 by
LudovicoYIN
Loading…
[GPTQ][ddp] enabling DDP for GPTQ
dist
Work pertaining to distributed work
documentation
Improvements or additions to documentation
enhancement
New feature or request
gptq
For any PR / issue related to GPTQ support
quality-failed
ready
When a PR is ready for review
#2333
opened Feb 6, 2026 by
HDCharles
Loading…
Add GSM8K evaluation script and AWQ+FP8 results
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2330
opened Feb 4, 2026 by
rtj1
Loading…
[AWQ] Add option to consider smooth layer quantization in scale search
needs-rebase
#2323
opened Jan 31, 2026 by
Ramshankar07
Loading…
Benchmark torch.compile optimization for quantization
ready
When a PR is ready for review
#2320
opened Jan 31, 2026 by
colldata79
Loading…
Add AFMOE mappings for awq and smoothquant
needs-rebase
ready
When a PR is ready for review
#2316
opened Jan 30, 2026 by
bartowski1182
Loading…
move smoothquant to transforms
documentation
Improvements or additions to documentation
needs-rebase
ready
When a PR is ready for review
#2314
opened Jan 30, 2026 by
Etelis
Loading…
Refactor Matching Logic to Use compressed-tensors Utilities
needs-rebase
ready
When a PR is ready for review
#2284
opened Jan 24, 2026 by
Etelis
Loading…
[Observers] Allow for case when weight shape and block size are not evenly divisble
ready
When a PR is ready for review
#2283
opened Jan 23, 2026 by
brian-dellabetta
Loading…
2 tasks done
[Docs][Examples] Add MoE Guide and remove finetune examples
documentation
Improvements or additions to documentation
needs-rebase
ready
When a PR is ready for review
#2281
opened Jan 23, 2026 by
dsikka
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2026-01-18.