Skip to content

Conversation

@behroozazarkhalili
Copy link
Collaborator

Resolves #4399

Summary

This PR removes three outdated notebooks as identified in issue #4399:

  • best_of_n.ipynb
  • gpt2-sentiment.ipynb
  • gpt2-sentiment-control.ipynb

Changes

Removed Notebooks

  • best_of_n.ipynb: Outdated Best-of-N sampling strategy example using PPO
  • gpt2-sentiment.ipynb: Outdated GPT-2 IMDB sentiment tuning notebook
  • gpt2-sentiment-control.ipynb: Outdated GPT-2 sentiment control notebook

Documentation Updates

  • example_overview.md: Removed "Legacy / Older Notebooks" section containing references to the three notebooks
  • sentiment_tuning.md: Removed notebook references while keeping the ppo.py script reference, which remains current

Rationale

These notebooks were suspected to be outdated and not maintained. The functionality they demonstrate is better covered by:

  • Current example scripts in examples/scripts/
  • Modern notebooks using recent TRL features
  • Updated documentation in the main guides

The PPO script (examples/scripts/ppo.py) continues to provide current examples for sentiment tuning workflows.

Resolves #4399

- Removed best_of_n.ipynb (outdated PPO Best-of-N example)
- Removed gpt2-sentiment.ipynb (outdated GPT-2 sentiment tuning)
- Removed gpt2-sentiment-control.ipynb (outdated GPT-2 sentiment control)
- Updated example_overview.md to remove legacy notebooks section
- Updated sentiment_tuning.md to remove notebook references
- Kept ppo.py script reference in sentiment_tuning.md as it remains current
@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

behroozazarkhalili and others added 4 commits November 3, 2025 10:16
commit 4677cf2
Author: Harras Mansoor <98635627+Harras3@users.noreply.github.com>
Date:   Wed Nov 5 04:06:13 2025 +0500

    Removed Sentiment Tuning Examples (#4424)

commit 7a9592b
Author: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
Date:   Tue Nov 4 14:32:04 2025 -0700

    🐍 Drop Python 3.9 (#4183)

commit 7f15a7f
Author: Harras Mansoor <98635627+Harras3@users.noreply.github.com>
Date:   Wed Nov 5 02:06:31 2025 +0500

    Removed outdated warning about batch contamination (#4423)

commit 8b0a3ce
Author: Albert Villanova del Moral <8515462+albertvillanova@users.noreply.github.com>
Date:   Tue Nov 4 21:37:39 2025 +0100

    Update tokenizer apply_chat_template with return_dict=True default (#4448)

commit d9f9e2b
Author: Pramodith Ballapuram <16939722+pramodith@users.noreply.github.com>
Date:   Tue Nov 4 19:56:58 2025 +0000

    Support casting to fp32 when word embeddings are tied to lm_head (#4446)

commit 4e138ab
Author: Sergio Paniego Blanco <sergiopaniegoblanco@gmail.com>
Date:   Tue Nov 4 15:15:23 2025 +0100

    Upload notebook with T4 selected (#4449)

commit 43253b2
Author: Pramodith Ballapuram <16939722+pramodith@users.noreply.github.com>
Date:   Mon Nov 3 21:07:31 2025 +0000

    Add On-Policy Distillation from thinking labs to paper index. (#4410)

    Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>

commit 6f41b18
Author: Behrooz Azarkhalili <80390531+behroozazarkhalili@users.noreply.github.com>
Date:   Mon Nov 3 10:57:51 2025 -0800

    fix: Remove chat template setting from non-SFT trainer scripts (#4437)

    Co-authored-by: Quentin Gallouédec <gallouedec.quentin@gmail.com>
    Co-authored-by: Quentin Gallouédec <45557362+qgallouedec@users.noreply.github.com>
@qgallouedec qgallouedec merged commit 91e7cdc into main Nov 4, 2025
3 checks passed
@qgallouedec qgallouedec deleted the docs/remove-outdated-notebooks branch November 4, 2025 23:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Update or remove some of the notebooks

4 participants