Update documentation for reward scaling wrappers #1285

keraJLi · 2025-01-02T11:54:14Z

Description

Changes the documentation of reward scaling wrappers. It mainly removes incorrect or unsubstantiated information.
Affected wrappers are wrappers/stateful_reward.py and wrappers/vector/stateful_reward.py.

Fixes #1272

Type of change

Please delete options that are not relevant.

Documentation only change (no code changed)

Checklist:

I have run the pre-commit checks with pre-commit run --all-files (see CONTRIBUTING.md instructions to set it up)
I have made corresponding changes to the documentation
My changes generate no new warnings
New and existing unit tests pass locally with my changes

pseudo-rnd-thoughts

Thanks for the PR @keraJLi, to clarify what do you mean by their exponential moving average?
To me, this isn't clear what the expected mean is or what exactly the rewards are normalised by?

Update documentation for reward scaling wrappers

993de0e

keraJLi force-pushed the main branch from 3e6b31e to 993de0e Compare January 2, 2025 11:58

pseudo-rnd-thoughts reviewed Jan 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update documentation for reward scaling wrappers #1285

Update documentation for reward scaling wrappers #1285

keraJLi commented Jan 2, 2025

pseudo-rnd-thoughts left a comment

Update documentation for reward scaling wrappers #1285

Are you sure you want to change the base?

Update documentation for reward scaling wrappers #1285

Conversation

keraJLi commented Jan 2, 2025

Description

Type of change

Checklist:

pseudo-rnd-thoughts left a comment

Choose a reason for hiding this comment