Skip to content

Pull requests: Andyyyy64/whichllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix: filter model search by parameter count instead of substring for size tokens
#126 opened Jun 20, 2026 by devangpratap Contributor Loading…
1 of 3 tasks
fix: size FP16 and ternary (TQ1_0/TQ2_0) GGUF quant types correctly
#125 opened Jun 20, 2026 by SuperMarioYL Contributor Loading…
feat(vram): model sliding-window attention in KV cache estimation
#124 opened Jun 19, 2026 by SuperMarioYL Contributor Loading…
ProTip! Follow long discussions with comments:>50.