Skip to content

Conversation

@delock
Copy link
Collaborator

@delock delock commented May 19, 2025

This PR fixes #7275 to enable Qwen3 meta loading for AutoTP

@delock delock requested review from hwchen2017 and loadams as code owners May 19, 2025 02:25
@delock delock force-pushed the gma/enable_qwen3_meta branch from 169e1a6 to 268c8e1 Compare May 19, 2025 02:32
Signed-off-by: Ma, Guokai <guokai.ma@intel.com>
@loadams loadams added this pull request to the merge queue May 19, 2025
Merged via the queue into master with commit 80bc7b7 May 19, 2025
11 of 13 checks passed
@loadams loadams deleted the gma/enable_qwen3_meta branch May 19, 2025 18:09
deepcharm pushed a commit to deepcharm/DeepSpeed that referenced this pull request Jun 16, 2025
This PR fixes deepspeedai#7275 to
enable Qwen3 meta loading for AutoTP

Signed-off-by: Ma, Guokai <guokai.ma@intel.com>
Signed-off-by: Max Kovalenko <mkovalenko@habana.ai>
mauryaavinash95 pushed a commit to DataStates/DeepSpeed that referenced this pull request Oct 4, 2025
This PR fixes deepspeedai#7275 to
enable Qwen3 meta loading for AutoTP

Signed-off-by: Ma, Guokai <guokai.ma@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[BUG] Qwen3: model loading failed when using meta device

3 participants