Skip to content

Commit 0d5c474

Browse files
comments
Signed-off-by: Matthew Bonanni <mbonanni@redhat.com>
1 parent 1f0e776 commit 0d5c474

File tree

2 files changed

+1
-4
lines changed

2 files changed

+1
-4
lines changed

benchmarks/attention_benchmarks/configs/cutlass_numsplits.yaml

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,4 @@
1-
# Study 1: Should we revert CUTLASS MLA num-splits heuristic?
2-
# Question: What is the optimal num_kv_splits for different batch sizes?
3-
# Related PRs: #24966, #25509
1+
# Study 1: What is the optimal CUTLASS_MLA num_kv_splits for different batch sizes?
42

53
description: "CUTLASS MLA num-splits optimization study"
64

benchmarks/attention_benchmarks/configs/flashinfer_vs_cutlass.yaml

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,4 @@
11
# Study 3: Is FlashInfer-MLA better than CUTLASS MLA after num-splits optimization?
2-
# Question: After optimizing CUTLASS MLA's num_kv_splits, is FlashInfer-MLA still competitive?
32

43
description: "FlashInfer-MLA vs optimized CUTLASS MLA comparison"
54

0 commit comments

Comments
 (0)