[megatron] feat: support qwen2 megatron backend #261

kinman0224 · 2025-02-13T02:25:28Z

Support Qwen2 Megatron backend

The code is primarily adapted from the llama folder, with modifications to use QKV bias and remove the rope_scaling of RoPE in verl/models/qwen2/megatron/layers/parallel_attention.py.

Train using Qwen2-7B-Instruct with PPO, GSM8k score can reach 0.87 at step 75.
not support saver now

eric-haibin-lin · 2025-02-14T01:29:08Z

verl/third_party/vllm/vllm_v_0_5_4/megatron_weight_loaders.py

@@ -282,6 +282,7 @@ def mistral_megatron_weight_loader(actor_weights: Dict, vllm_model: nn.Module) -
    'LlamaForCausalLM': llama_megatron_weight_loader,  # use te backend for open-source megatron
    'LLaMAForCausalLM': llama_megatron_weight_loader,
    'MistralForCausalLM': mistral_megatron_weight_loader,
+    'Qwen2ForCausalLM': llama_megatron_weight_loader,


did you test them all? maybe only including v0.6.3 is sufficient

Yes, I have tested the loaders on v0.4.2, v0.5.3, and v0.6.3. But the score was only tested on v0.6.3. Maybe i remove the it in v0.4.2 and v0.5.3?

eric-haibin-lin · 2025-02-14T01:30:10Z

examples/ppo_trainer/run_qwen2-7b_math_gsm8k_megatron.sh

@@ -0,0 +1,42 @@
+set -x


could u later also add a section to https://github.com/volcengine/verl/blob/main/docs/experiment/ppo.rst , add a new table for MATH, and include the logs, command, and test score in next PR

vermouth1992 · 2025-02-14T14:06:56Z

Could you add QWen 2.5 0.5b to the CI?

Viper403 · 2025-02-17T06:47:08Z

Hi, Dude~ Thank you for your work. Is there any plan to support saver for Qwen2?

Viper403 · 2025-02-17T08:11:53Z

https://github.com/kinman0224/verl/blob/e7e9e569deed4ec144cbfb3e6386165b99f42483/verl/third_party/vllm/vllm_v_0_6_3/megatron_weight_loaders.py#L81

For models that enable tie_word_embedding, we cannot get "lm_head".

kinman0224 · 2025-02-17T08:15:02Z

Yes, I have noticed that. I am working on it.

kinman0224 · 2025-02-18T16:48:57Z

It can now handle tie_word_embedding=True.

The result will be updated in next PR.

ZYHowell · 2025-02-19T21:03:13Z

verl/models/qwen2/megatron/checkpoint_utils/qwen2_saver.py

+    return layer_map
+
+
+def merge_megatron_ckpt_llama(wrapped_models, config, is_value_model=False, dtype='bf16'):


the function name here should be a typo? (llama -> qwen2)

Yes, here is a typo. This PR has not supported the saver yet, but it may support it in the future.

[feat] support qwen2 megatron backend

75d369d

kinman0224 marked this pull request as draft February 13, 2025 02:25

add example

b33325c

kinman0224 marked this pull request as ready for review February 13, 2025 03:47

eric-haibin-lin reviewed Feb 14, 2025

View reviewed changes

kinman0224 added 2 commits February 15, 2025 11:51

Merge branch 'main' into kinman/feat_qwen_megatron

d545ba7

misc

8e36b3e

kinman0224 marked this pull request as draft February 15, 2025 03:58

kinman0224 added 2 commits February 17, 2025 13:23

update ci

2e2ab19

update ci

e7e9e56

kinman0224 added 2 commits February 19, 2025 00:26

fix share_embeddings_and_output_weights problem

45daa8f

Merge branch 'volcengine:main' into kinman/feat_qwen_megatron

f7ec091

kinman0224 marked this pull request as ready for review February 18, 2025 16:29

vermouth1992 approved these changes Feb 19, 2025

View reviewed changes

vermouth1992 enabled auto-merge (squash) February 19, 2025 14:17

vermouth1992 merged commit 9448762 into volcengine:main Feb 19, 2025
12 checks passed

ZYHowell reviewed Feb 19, 2025

View reviewed changes

kinman0224 deleted the kinman/feat_qwen_megatron branch February 20, 2025 05:49

eric-haibin-lin mentioned this pull request Feb 23, 2025

verl v0.2.1 & v0.3 release checklist #354

Open

15 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[megatron] feat: support qwen2 megatron backend #261

[megatron] feat: support qwen2 megatron backend #261

kinman0224 commented Feb 13, 2025 •

edited

Loading

eric-haibin-lin Feb 14, 2025

kinman0224 Feb 14, 2025

eric-haibin-lin Feb 14, 2025

vermouth1992 commented Feb 14, 2025

Viper403 commented Feb 17, 2025 •

edited

Loading

Viper403 commented Feb 17, 2025

kinman0224 commented Feb 17, 2025

kinman0224 commented Feb 18, 2025 •

edited

Loading

ZYHowell Feb 19, 2025

kinman0224 Feb 20, 2025

		return layer_map


		def merge_megatron_ckpt_llama(wrapped_models, config, is_value_model=False, dtype='bf16'):

[megatron] feat: support qwen2 megatron backend #261

[megatron] feat: support qwen2 megatron backend #261

Conversation

kinman0224 commented Feb 13, 2025 • edited Loading

eric-haibin-lin Feb 14, 2025

Choose a reason for hiding this comment

kinman0224 Feb 14, 2025

Choose a reason for hiding this comment

eric-haibin-lin Feb 14, 2025

Choose a reason for hiding this comment

vermouth1992 commented Feb 14, 2025

Viper403 commented Feb 17, 2025 • edited Loading

Viper403 commented Feb 17, 2025

kinman0224 commented Feb 17, 2025

kinman0224 commented Feb 18, 2025 • edited Loading

ZYHowell Feb 19, 2025

Choose a reason for hiding this comment

kinman0224 Feb 20, 2025

Choose a reason for hiding this comment

kinman0224 commented Feb 13, 2025 •

edited

Loading

Viper403 commented Feb 17, 2025 •

edited

Loading

kinman0224 commented Feb 18, 2025 •

edited

Loading