Skip to content

Commit 23be767

Browse files
correct jitter pointed out by Yelong
1 parent 3ef2172 commit 23be767

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/model_executor/models/mixtral.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -166,7 +166,7 @@ def backward(
166166
)
167167

168168

169-
def sparsemixer(scores, top_k, jitter_eps=0.1):
169+
def sparsemixer(scores, top_k, jitter_eps=0.01):
170170
assert top_k == 2
171171

172172
################ first expert ################

0 commit comments

Comments
 (0)