Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

bugfix: skip CUTLASS kernel generation when AOT cache exists
#2248 opened Dec 19, 2025 by yongwww Loading…
5 tasks
feat: Support numLocalTokens=0 for moe All-to-all
#2247 opened Dec 19, 2025 by trevor-m Loading…
5 tasks
Fp8 attention are now part of cuDNN 9.17.1
#2241 opened Dec 18, 2025 by Anerudhan Draft
5 tasks done
agent: add CLAUDE.md and claude skills
#2240 opened Dec 18, 2025 by yzh119 Loading…
5 tasks done
fix: Handle zeros in Mistral Large 3 MoE inference
#2238 opened Dec 18, 2025 by dbari Draft
8 of 9 tasks
cicd / testing: Add xfails tracker script
#2227 opened Dec 16, 2025 by kahyunnam Loading…
5 tasks done
misc: support checks unit test tracking
#2224 opened Dec 16, 2025 by jimmyzho Loading…
5 tasks
refactor: update fa3 codebase [part 2]
#2192 opened Dec 9, 2025 by yzh119 Loading…
4 of 5 tasks
Add CUDA graph buffers for persistent attention
#2185 opened Dec 7, 2025 by Edenzzzz Loading…
5 tasks
Fix/moe_sm110 (to be tested)
#2183 opened Dec 6, 2025 by aleozlx Draft
5 tasks
Enable Hopper FA3 FP8 attention in decode.py
#2148 opened Nov 28, 2025 by nvpohanh Loading…
5 tasks done
feat: add sink to flashinfer decode
#2087 opened Nov 13, 2025 by djmmoss Loading…
feat: BF16 GEMM using CUTLASS backend for SM100
#2070 opened Nov 10, 2025 by raayandhar Loading…
5 tasks done
Blockwise GEMM with all reduce overlapping
#2007 opened Oct 30, 2025 by Amir-19 Draft
5 tasks
chore: agentic workflow for automatic version bump
#1947 opened Oct 19, 2025 by yzh119 Loading…
5 tasks
add blockwise gemm cute dsl
#1922 opened Oct 13, 2025 by Amir-19 Loading…
5 tasks
Sampling non contiguous
#1916 opened Oct 12, 2025 by zcin Loading…
5 tasks done
ProTip! What’s not been updated in a month: updated:<2025-11-20.