Skip to content

Commit f7ac974

Browse files
Update OpenEnv guide with new notebook (#4555)
1 parent c0de042 commit f7ac974

File tree

1 file changed

+4
-1
lines changed

1 file changed

+4
-1
lines changed

docs/source/openenv.md

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@
55
In this guide, we’ll focus on **how to integrate OpenEnv with TRL**, but feel free to explore the links above to dive deeper into OpenEnv itself.
66

77
> [!NOTE]
8-
> You can explore ready-to-use example scripts in the [`examples/scripts/openenv/`](https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/) directory.
8+
> You can explore ready-to-use example [scripts](example_overview#scripts) and [notebooks](example_overview#notebooks) in the Examples Overview.
99
1010
> [!NOTE]
1111
> Explore the [OpenEnv docs](https://meta-pytorch.org/OpenEnv/) for more details.
@@ -349,6 +349,9 @@ Below is the reward curve from training:
349349

350350
Let's level this up a bit by training a model to interact with a more complex environment. We'll use the game word guessing game [wordle](https://www.nytimes.com/games/wordle/index.html) from the [`TextArena`](https://meta-pytorch.org/OpenEnv/environments/textarena/) environment.
351351

352+
> [!NOTE]
353+
> You can explore the notebook version of this example [here](https://github.com/huggingface/trl/blob/main/examples/notebooks/openenv_wordle_grpo.ipynb).
354+
352355
### The TextArena Environment
353356

354357
[TextArena](https://huggingface.co/papers/2504.11442) is an open-source collection of competitive text-based games designed to evaluate reasoning skills in LLMs using textual games like Wordle, Snake, Tic-Tac-Toe, and more. Research has shown that such games improve model performance on reasoning tasks.

0 commit comments

Comments
 (0)