Skip to content

Commit f314fe4

Browse files
author
ilmarkov
committed
Update min sizes
Signed-off-by: ilmarkov <imarkov@redhat.com>
1 parent 75d3df1 commit f314fe4

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

vllm/distributed/device_communicators/custom_all_reduce.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -72,10 +72,10 @@ class CustomAllreduce:
7272
MB = 1024 * 1024
7373
_QR_MIN_SIZE = {
7474
(torch.float16, 2): [1 * MB, 2 * MB, 2 * MB, 1 * MB],
75-
(torch.float16, 4): [1 * MB, 64 * MB, 4 * MB, 2 * MB],
75+
(torch.float16, 4): [1 * MB, 16 * MB, 4 * MB, 2 * MB],
7676
(torch.float16, 8): [16 * MB, 4 * MB, 4 * MB, 2 * MB],
7777
(torch.bfloat16, 2): [2 * MB, 8 * MB, 8 * MB, 8 * MB],
78-
(torch.bfloat16, 4): [8 * MB, 128 * MB, 128 * MB, 16 * MB],
78+
(torch.bfloat16, 4): [8 * MB, 64 * MB, 64 * MB, 16 * MB],
7979
(torch.bfloat16, 8): [16 * MB, 2048 * MB, 2048 * MB, 2048 * MB],
8080
}
8181

0 commit comments

Comments
 (0)