-
Notifications
You must be signed in to change notification settings - Fork 208
Pull requests: Luce-Org/lucebox-hub
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix(server): Qwen3.6-27B tool calling for claude-code Anthropic path
#276
opened May 25, 2026 by
dusterbloom
Contributor
Loading…
5 of 7 tasks
feat(drafter): ee3 as production default (depends on #274)
#275
opened May 24, 2026 by
dusterbloom
Contributor
Loading…
feat(pFlash): ee7 early-exit drafter saves up to 9.3× drafter wall at 128K
#274
opened May 24, 2026 by
dusterbloom
Contributor
Loading…
feat(cpp-server): thinking-budget v2 + multi-dialect reasoning aliases + spec
#269
opened May 23, 2026 by
easel
Contributor
Loading…
6 tasks done
feat(harness): typed adapters + format-aware session-inject proxy + multi-turn bandit driver
#266
opened May 23, 2026 by
dusterbloom
Contributor
Loading…
feat(pflash): adaptive keep_ratio bandit MVP
#264
opened May 23, 2026 by
dusterbloom
Contributor
Loading…
feat(server): add mixed-backend PFlash phase split
#263
opened May 23, 2026 by
weicj
Contributor
Loading…
Split MoE weights between CPU & CUDA, support qwen35moe models
#262
opened May 23, 2026 by
howard0su
Contributor
Loading…
1 of 3 tasks
mtp: prefix-cache WARM hit (perfect + partial via range-warm)
#221
opened May 18, 2026 by
dusterbloom
Contributor
Loading…
feat(gemma4): feature-complete backend with DFlash + MTP + sparse-FA decode (supersedes PR #175 skeleton)
#193
opened May 14, 2026 by
dusterbloom
Contributor
•
Draft
feat(gemma4): target-graph MTP integration (h_prev capture + asymmetric KV)
#183
opened May 13, 2026 by
dusterbloom
Contributor
Loading…
feat(gemma4): add mtp loader and step graph
#182
opened May 13, 2026 by
dusterbloom
Contributor
Loading…
feat(gemma4): add draft loader and quantization support
#180
opened May 13, 2026 by
dusterbloom
Contributor
Loading…
fix(gemma4): add long-context KV correctness
#177
opened May 13, 2026 by
dusterbloom
Contributor
Loading…
docs(dflash): document small-vram cuda vmm guidance
#174
opened May 13, 2026 by
dusterbloom
Contributor
Loading…
feat(dflash): linear native MTP integrated decode CLI (stacked on #153)
#154
opened May 11, 2026 by
javierpazo
Contributor
Loading…
feat(dflash): native Qwen3.6 MTP (NextN) runtime + contract test
#153
opened May 11, 2026 by
javierpazo
Contributor
Loading…
feat(dflash): accept FP16 safetensors drafter alongside BF16
#142
opened May 9, 2026 by
javierpazo
Contributor
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.