-
Notifications
You must be signed in to change notification settings - Fork 836
Pull requests: flashinfer-ai/flashinfer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Update Docker CI tags to 20260401-2c675fb
automated
docker
#2936
opened Apr 1, 2026 by
flashinfer-bot
Loading…
fix: vectorize get_shuffle_matrix_a_row_indices to eliminate CPU contention
run-ci
#2935
opened Apr 1, 2026 by
youkaichao
Loading…
feat: SM121 (GB10) tile filtering and autotuner robustness
run-ci
#2927
opened Mar 31, 2026 by
askliar
Loading…
2 tasks
doc: add CI triggering guide to CONTRIBUTING.md
#2924
opened Mar 31, 2026 by
yongwww
Loading…
5 tasks
ci: narrow skip scope in test_fmha_v2_prefill.py
op: attention
run-ci
#2922
opened Mar 31, 2026 by
bobboli
Loading…
feat: add delimiterless multi-item scoring (MaskMode::kMultiItemScoringV2)
#2921
opened Mar 31, 2026 by
chanh
Loading…
3 tasks
docs: document replay command in CLI reference
#2919
opened Mar 31, 2026 by
ooooo-create
Loading…
3 of 5 tasks
feat: Add cuBLASLt backend for
mm_bf16 and enable multi-tactic autotuning for FP8/MXFP8 runners
op: gemm
#2914
opened Mar 30, 2026 by
vadiklyutiy
Loading…
feat(moe): Add MxInt4 x FP8 grouped GEMM support for trtllm-gen fused MoE
#2912
opened Mar 29, 2026 by
StudyingShao
Loading…
5 tasks done
fix: segfault using packed topk_id/weight as trtllm_bf16_routed_moe input in DeepSeek routing
#2911
opened Mar 29, 2026 by
rosenrodt
Loading…
2 of 5 tasks
Yanqinz/dynamic shape unified api
op: gemm
run-ci
#2910
opened Mar 29, 2026 by
yanqinz2
Loading…
3 of 5 tasks
feat(gdn): state checkpointing in chunk_gated_delta_rule
#2908
opened Mar 28, 2026 by
feldsherov
Loading…
4 of 5 tasks
fix(moe): make hidden_states_scale optional in trtllm_fp4_block_scale_moe
#2906
opened Mar 28, 2026 by
kuttivicky
Loading…
1 of 5 tasks
feat(gdn): separate input and output pool indices
#2905
opened Mar 28, 2026 by
feldsherov
Loading…
4 of 5 tasks
perf: Optimize CuTe-DSL fp4 and fp8 quantization kernels
#2904
opened Mar 27, 2026 by
bkryu
Loading…
3 of 5 tasks
fix: avoid re-downloading BMM export headers when flashinfer-cubin is installed
op: moe
run-ci
#2903
opened Mar 27, 2026 by
yzh119
Loading…
1 of 2 tasks
feat: add MXFP8 GEMM support for SM120
op: gemm
run-ci
#2902
opened Mar 27, 2026 by
samuellees
Loading…
5 tasks done
bench: fix GPU power throttling in benchmark utilities
#2899
opened Mar 26, 2026 by
Edenzzzz
Loading…
4 of 5 tasks
fix: snap weight_scale_vec_size to handle block_scale_interleave padding for SM120
op: moe
run-ci
#2898
opened Mar 26, 2026 by
samuellees
Loading…
3 tasks done
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.