Skip to content

Commit

Permalink
[RLlib] Add log-std clipping to 'MLPHead's. (ray-project#47827)
Browse files Browse the repository at this point in the history
Signed-off-by: ujjawal-khare <ujjawal.khare@dream11.com>
  • Loading branch information
simonsays1980 authored and ujjawal-khare committed Oct 15, 2024
1 parent eeb18e5 commit 5acec92
Showing 1 changed file with 8 additions and 0 deletions.
8 changes: 8 additions & 0 deletions rllib/models/catalog.py
Original file line number Diff line number Diff line change
Expand Up @@ -73,7 +73,15 @@
"post_fcnet_bias_initializer": None,
"post_fcnet_bias_initializer_config": None,
"free_log_std": False,
# Whether to clip the log standard deviation when using a Gaussian (or any
# other continuous control distribution). This can stabilize training and avoid
# very small or large log standard deviations leading to numerical instabilities
# which can turn network outputs to `nan`. The default is to clamp the log std
# in between -20 and 20.
"log_std_clip_param": 20.0,
# Whether to skip the final linear layer used to resize the hidden layer
# outputs to size `num_outputs`. If True, then the last hidden layer
# should already match num_outputs.
"no_final_linear": False,
"vf_share_layers": True,
"use_lstm": False,
Expand Down

0 comments on commit 5acec92

Please sign in to comment.