added support for multiple dimension continuous action spaces #177

devin-m-NRL · 2021-10-01T19:12:38Z

Four themes to changes

prediction_policy_network output is 2*action space, one mean and standard deviation for each joint. Log_prob is summed after being calculated for each joint
dynamics_encoded_state_network function now takes into account an action array
Functions that now need to work for arrays: Np.random.choice, item, and dictionary
changes for tensorboard to save video renders

devin-m-NRL · 2021-10-01T19:21:43Z

Results: Sawyer shelf environment I added had reward of -43 which is not great but performs okay. It trained with one gpu for 110,000 training steps and 55,000 self play games over 10 days.

added support for multiple dimension continuous action spaces

fa9e6c6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added support for multiple dimension continuous action spaces #177

added support for multiple dimension continuous action spaces #177

devin-m-NRL commented Oct 1, 2021

devin-m-NRL commented Oct 1, 2021 •

edited

Loading

added support for multiple dimension continuous action spaces #177

Are you sure you want to change the base?

added support for multiple dimension continuous action spaces #177

Conversation

devin-m-NRL commented Oct 1, 2021

devin-m-NRL commented Oct 1, 2021 • edited Loading

devin-m-NRL commented Oct 1, 2021 •

edited

Loading