docs: Expand training customization examples #4427

behroozazarkhalili · 2025-11-02T20:29:27Z

Summary

This PR addresses issue #4379 by expanding the Training Customization documentation section with 5 new comprehensive examples, rather than removing it.

Resolves #4379

Changes Made

New Examples Added (5):

Custom Callbacks - Shows how to add custom callbacks for logging, monitoring, or early stopping
Custom Evaluation Metrics - Demonstrates defining custom metrics to track during training
Mixed Precision Training - Explains bf16/fp16 usage for speed and memory optimization
Gradient Accumulation - Shows how to simulate larger batch sizes with limited GPU memory
Custom Data Collator - Demonstrates custom data preprocessing and padding strategies

Documentation Improvements:

Updated introduction for better clarity and consistency
All examples follow the same pattern as existing examples
All code examples verified against the codebase
Proper imports and configuration options validated

Statistics

Original examples: 5
New examples: 5
Total examples: 10 (doubled!)
Lines added: ~150

Verification

✅ All imports verified against codebase
✅ All config options verified in DPOConfig
✅ DataCollatorForPreference import path corrected
✅ Consistent code style with existing examples
✅ Examples apply to most/all trainers as stated

Test Plan

Verified all imports exist in the codebase
Validated config parameters against DPOConfig
Ensured consistent formatting with existing examples
Checked that examples follow DPOTrainer pattern as stated in intro

Resolves huggingface#4379 - Add custom callbacks example for logging and monitoring - Add custom evaluation metrics example - Add mixed precision training example (bf16/fp16) - Add gradient accumulation example - Add custom data collator example - Update introduction for better clarity

HuggingFaceDocBuilderDev · 2025-11-02T20:32:09Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

docs/source/customization.md

@qgallouedec

- Clarify that bf16 is the default in mixed precision section - Move gradient accumulation section to reducing memory guide - Expand gradient accumulation examples to include DPO, SFT, and Reward trainers Addresses review comments from @qgallouedec on PR #4427

behroozazarkhalili · 2025-11-03T18:06:30Z

I've addressed both review comments:

Mixed precision section: Added clarification that bf16=True is the default in TRL. Updated the example to show when/how to override defaults for older GPUs or to disable mixed precision.
Gradient accumulation section: Moved from customization guide to the reducing memory usage guide (reducing_memory_usage.md), as it's primarily a memory optimization technique. Expanded the examples to include DPO, SFT, and Reward trainers.

Ready for re-review!

sergiopaniego

Thanks!!
I'd aim on reducing the code snippets content, only including the relevant part. Otherwise it's complicated to understand what's been added. A good example of this is the subsection Use the accelerator cache optimizer, where it directly points to the added param.
Btw the optimize_device_cache is no longer part of the codebase so that subsection can actually be removed :)

qgallouedec reviewed Nov 3, 2025

View reviewed changes

docs/source/customization.md Show resolved Hide resolved

qgallouedec reviewed Nov 3, 2025

View reviewed changes

docs/source/customization.md Show resolved Hide resolved

Merge branch 'main' into docs/expand-training-customization

1236772

Merge branch 'main' into docs/expand-training-customization

324f2c2

behroozazarkhalili enabled auto-merge (squash) November 4, 2025 15:51

sergiopaniego reviewed Nov 4, 2025

View reviewed changes

sergiopaniego mentioned this pull request Nov 4, 2025

docs: Expand speeding up training guide with acceleration methods #4428

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: Expand training customization examples #4427

docs: Expand training customization examples #4427

Uh oh!

behroozazarkhalili commented Nov 2, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Nov 2, 2025

Uh oh!

Uh oh!

Uh oh!

behroozazarkhalili commented Nov 3, 2025

Uh oh!

sergiopaniego left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

docs: Expand training customization examples #4427

Are you sure you want to change the base?

docs: Expand training customization examples #4427

Uh oh!

Conversation

behroozazarkhalili commented Nov 2, 2025

Summary

Changes Made

New Examples Added (5):

Documentation Improvements:

Statistics

Verification

Test Plan

Uh oh!

HuggingFaceDocBuilderDev commented Nov 2, 2025

Uh oh!

Uh oh!

Uh oh!

behroozazarkhalili commented Nov 3, 2025

Uh oh!

sergiopaniego left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants