Skip to content

Commit cddd0da

Browse files
TheEpicDolphinamd-xiaoyu12
authored andcommitted
[V1] reduce block size for tree attention correctness test to fix 'ou… (vllm-project#22207)
Signed-off-by: Giancarlo Delfin <gdelfin@meta.com> Signed-off-by: Xiao Yu <xiao.yu@amd.com>
1 parent 2a8f51d commit cddd0da

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tests/v1/spec_decode/test_tree_attention.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -155,7 +155,7 @@ def test_tree_attn_correctness() -> None:
155155

156156
dim_per_head = 128
157157
num_kv_heads = 2
158-
block_size = 128
158+
block_size = 32
159159
max_sequence_length = 8192
160160
randomize_blocks = True
161161
for batch_size in [1, 16, 32]:

0 commit comments

Comments
 (0)