Skip to content

Commit 56bebc5

Browse files
authored
Apply suggestions from code review
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
1 parent 8495f2a commit 56bebc5

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

docs/training/trl.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -21,10 +21,9 @@ To enable vLLM in TRL, set the `use_vllm` flag in the trainer configuration to `
2121

2222
Some trainers also support **vLLM sleep mode**, which offloads parameters and caches to GPU RAM during training, helping reduce memory usage. Learn more in the [memory optimization docs](https://huggingface.co/docs/trl/main/en/reducing_memory_usage#vllm-sleep-mode).
2323

24-
25-
2624
!!! info
2725
For more information on the `use_vllm` flag you can provide to the configs of these online methods, see:
26+
2827
- [`trl.GRPOConfig.use_vllm`](https://huggingface.co/docs/trl/main/en/grpo_trainer#trl.GRPOConfig.use_vllm)
2928
- [`trl.OnlineDPOConfig.use_vllm`](https://huggingface.co/docs/trl/main/en/online_dpo_trainer#trl.OnlineDPOConfig.use_vllm)
3029
- [`trl.RLOOConfig.use_vllm`](https://huggingface.co/docs/trl/main/en/rloo_trainer#trl.RLOOConfig.use_vllm)

0 commit comments

Comments
 (0)