Skip to content

Commit 6dfda81

Browse files
committed
clean code
Signed-off-by: Yu Gong <yu3.gong@gmail.com>
1 parent a1dd0d5 commit 6dfda81

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

vllm/lora/layers/fused_moe.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -136,7 +136,7 @@ def wrapper(*args, **kwargs):
136136
M = min(num_tokens, CHUNK_SIZE)
137137

138138
shrink_config, expand_config = self._get_lora_moe_configs(
139-
op_prefix="gate_up",
139+
op_prefix="w13",
140140
lora_a_stacked=self.w1_lora_a_stacked,
141141
lora_b_stacked=self.w1_lora_b_stacked,
142142
num_slices=2,
@@ -214,7 +214,7 @@ def wrapper(*args, **kwargs):
214214
M = min(num_tokens, CHUNK_SIZE)
215215

216216
shrink_config, expand_config = self._get_lora_moe_configs(
217-
op_prefix="down",
217+
op_prefix="w2",
218218
lora_a_stacked=self.w2_lora_a_stacked,
219219
lora_b_stacked=self.w2_lora_b_stacked,
220220
num_slices=1,

0 commit comments

Comments
 (0)