Skip to content

Pull requests: flashinfer-ai/flashinfer

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add trtllm-gen per-tensor sparseMla kernels.
#2138 opened Nov 24, 2025 by PerkzZheng Loading…
5 tasks done
fix: some bugs of headDim 256 trtllm-gen fmha kernels.
#2137 opened Nov 24, 2025 by PerkzZheng Loading…
5 tasks done
feat: add seed offset args to sampler to allow cuda graph support
#2132 opened Nov 23, 2025 by ksukrit Loading…
5 tasks done
perf: using multi-cta optimization for top-k/top-p
#2119 opened Nov 20, 2025 by yzh119 Loading…
4 of 5 tasks
Refactor trtllm_mnnvl_allreduce
#2118 opened Nov 20, 2025 by timlee0212 Loading…
5 tasks done
refactor: update fa3 codebase and fix hopper unittest [part 1]
#2111 opened Nov 19, 2025 by yzh119 Loading…
4 of 5 tasks
feat: support more head dim in RoPE kernel
#2109 opened Nov 19, 2025 by raayandhar Loading…
5 tasks done
make DeepGEMM swapAB available for linear gemm SM90
#2101 opened Nov 17, 2025 by xuanzic Loading…
5 tasks
refactor: pass hopper deepgemm include directory through python
#2090 opened Nov 14, 2025 by yzh119 Loading…
4 of 5 tasks
feat: add sink to flashinfer decode
#2087 opened Nov 13, 2025 by djmmoss Loading…
feat: BF16 GEMM using CUTLASS backend for SM100
#2070 opened Nov 10, 2025 by raayandhar Loading…
5 tasks done
Blockwise GEMM with all reduce overlapping
#2007 opened Oct 30, 2025 by Amir-19 Draft
5 tasks
chore: agentic workflow for automatic version bump
#1947 opened Oct 19, 2025 by yzh119 Loading…
5 tasks
add blockwise gemm cute dsl
#1922 opened Oct 13, 2025 by Amir-19 Loading…
5 tasks
ProTip! no:milestone will show everything without a milestone.