You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Refactor MoE parameter initialization for flexibility
Pass the Hugging Face configuration object directly to the MoE communication method constructor. This allows the method to handle different attribute names for MoE parameters, such as `num_experts` and `n_routed_experts`.
This change improves robustness and makes the implementation more compatible with various MoE model configurations.
Signed-off-by: Yizhou Liu <liu_yizhou@outlook.com>
0 commit comments