Add Anymal-D torchrl cfg #1180

fyu-bdai · 2024-10-08T03:02:10Z

Description

Adds TorchRL training configuration for the Anymal-D velocity environment as a template for training IsaacLab environments with the new torchrl training workflow.

You can try training Anymal with TorchRL (after merging all TorchRL PRs) using
/workspace/isaaclab/source/standalone/workflows/torchrl python train.py --task Isaac-Velocity-Flat-Anymal-D-v0 --num_envs 4096

Related PRs:
#1178, #1179

This is the last PR in the group of 3 that adds the TorchRL training pipeline.

⚠️ The not so good parts ⚠️

Unfortunately, the Anymal-D environment converges slower than RSL-RL. This is probably due to policy architecture and PPO implementation differences which requires different hyperparameter settings. I have provided the best hyperparameters that have worked for me so far. While it is possible to speed up the convergence by increasing desired_kl targets and entropy_coef to closely match RSL-RL, torchrl policies seem to crash late during training due to spurious KL/action noise spikes after the reward has long converged.

Training curves and video

Video_3197_7d9452698382ed1512bd.mp4

Type of change

New feature (non-breaking change which adds functionality)
This change requires a documentation update

Checklist

I have run the pre-commit checks with ./isaaclab.sh --format
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
I have updated the changelog and the corresponding version in the extension's config/extension.toml file
I have added my name to the CONTRIBUTORS.md or my name already exists there

...lab_tasks/omni/isaac/lab_tasks/manager_based/locomotion/velocity/config/anymal_d/__init__.py

Toni-SM · 2024-10-08T18:10:35Z

.../isaac/lab_tasks/manager_based/locomotion/velocity/config/anymal_d/agents/torchrl_ppo_cfg.py

+)
+
+
+class AnymalDActorNN(nn.Module):


Question: How having the model manually defined in the agent config will support changing it from CLI (e.g.: using hydra)?

I suppose we can add an argument to the torchrl CLI args to specify which model definition method the user would like to use if we want to support both methods?

fyu-bdai added 4 commits October 7, 2024 22:33

edit loss param

756245e

run formatter

1dcb99e

update changelog

04b32aa

run formatter

ac27d97

fyu-bdai requested review from Dhoeller19, Mayankm96, jsmith-bdai, jtigue-bdai and kellyguo11 as code owners October 8, 2024 03:02

fyu-bdai mentioned this pull request Oct 8, 2024

Add wrappers for TorchRL training workflow #1178

Open

6 tasks

Toni-SM reviewed Oct 8, 2024

View reviewed changes

...lab_tasks/omni/isaac/lab_tasks/manager_based/locomotion/velocity/config/anymal_d/__init__.py Show resolved Hide resolved

fix skrl registration

8bed387

Toni-SM reviewed Oct 8, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Anymal-D torchrl cfg #1180

Add Anymal-D torchrl cfg #1180

fyu-bdai commented Oct 8, 2024 •

edited

Loading

Toni-SM Oct 8, 2024

fyu-bdai Oct 8, 2024

		)


		class AnymalDActorNN(nn.Module):

Add Anymal-D torchrl cfg #1180

Are you sure you want to change the base?

Add Anymal-D torchrl cfg #1180

Conversation

fyu-bdai commented Oct 8, 2024 • edited Loading

Description

⚠️ The not so good parts ⚠️

Training curves and video

Type of change

Checklist

Toni-SM Oct 8, 2024

Choose a reason for hiding this comment

fyu-bdai Oct 8, 2024

Choose a reason for hiding this comment

fyu-bdai commented Oct 8, 2024 •

edited

Loading