Skip to content

Commit db85cb5

Browse files
zRzRzRzRzRzRzRepwalsh
authored andcommitted
fuse fp32 for GLM-4.5 e_score_correction_bias (vllm-project#22143)
Signed-off-by: zRzRzRzRzRzRzR <2448370773@qq.com>
1 parent 0ce6ddc commit db85cb5

File tree

1 file changed

+2
-3
lines changed

1 file changed

+2
-3
lines changed

vllm/model_executor/models/glm4_moe.py

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -125,9 +125,8 @@ def __init__(
125125
quant_config=None,
126126
prefix=f"{prefix}.gate")
127127

128-
# noaux_tc is not set in transformers new config now
129-
self.gate.e_score_correction_bias = (nn.Parameter(
130-
torch.empty(config.n_routed_experts)))
128+
self.gate.e_score_correction_bias = nn.Parameter(
129+
torch.empty(config.n_routed_experts, dtype=torch.float32))
131130

132131
# Load balancing settings.
133132
vllm_config = get_current_vllm_config()

0 commit comments

Comments
 (0)