7B模型RL训练需要多少显存 #60

linyaoyang · 2024-11-21T09:02:40Z

System Info

请问7B的推理模型结合7B的PRM训练需要多少显存？在测试中发现80G会报显存溢出？是否能在多卡上训练呢？

Who can help?

@ziyuwan

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the codebase (such as scrips/, ...)
My own task or dataset (give details below)

Reproduction

python -u train_math.py
--dataset_path "./math_500.jsonl"
--model_name_or_path "./deepseek-math-7b-instruct" \
--prm_model_name_or_path "./math-shepherd-mistral-7b-prm"
--algorithm_name "APPO"
--num_mini_batch 4
--ppo_epoch 1

基座模型用的是deepseek-math-7b-instruct，PRM用的是math-shepherd-mistral-7b-prm

Expected behavior

利用80G单卡或多卡实现对7B基座模型的RL训练

ziyuwan · 2024-11-21T09:07:10Z

@morning9393 can help answer this question.

You could first try to use a smaller batch size.

AbdullahVanlioglu · 2025-01-22T17:54:40Z

@morning9393 can help answer this question.

You could first try to use a smaller batch size.

I would also like to ask if others have encountered memory leaks. During fine-tuning by using the RL-based models GPU memory increases over time. For example, 2 hours after the training started, memory allocation on the GPU increases.

linyaoyang added the bug Something isn't working label Nov 21, 2024

ziyuwan assigned morning9393 Nov 21, 2024

ziyuwan added question Further information is requested and removed bug Something isn't working labels Dec 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

7B模型RL训练需要多少显存 #60

7B模型RL训练需要多少显存 #60

linyaoyang commented Nov 21, 2024

ziyuwan commented Nov 21, 2024

AbdullahVanlioglu commented Jan 22, 2025

7B模型RL训练需要多少显存 #60

7B模型RL训练需要多少显存 #60

Comments

linyaoyang commented Nov 21, 2024

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

ziyuwan commented Nov 21, 2024

AbdullahVanlioglu commented Jan 22, 2025