Name	Name	Last commit message	Last commit date
Latest commit priest-yang add missing files: vec_env.py and history_wrapper.py Nov 25, 2024 52494e9 · Nov 25, 2024 History 71 Commits
legged_gym	legged_gym	update readme	Aug 16, 2024
resources/robots	resources/robots	update	Jun 17, 2024
rsl_rl	rsl_rl	add missing files: vec_env.py and history_wrapper.py	Nov 25, 2024
.gitignore	.gitignore	add missing files: vec_env.py and history_wrapper.py	Nov 25, 2024
README.md	README.md	add missing files: vec_env.py and history_wrapper.py	Nov 25, 2024
requirements.txt	requirements.txt	update path solver	Jun 17, 2024
setup.py	setup.py	update all	Jun 17, 2024

Repository files navigation

Deep Tracking Control with Lite3

Main contribution

To summarize, this project aims at combining the traditional MPC-based and terrain-aware foothold planner with the deep reinforcement learning(DRL) . The goal is to achieve robust control in extremely risky terrains such as stepping stone.

You can find the modifications in legged_robot_dtc.py and legged_robot_config.py.

Foothold Planner

In this project, we adapt a method similar to TAMOLS and Mini-Cheetah.

An estimated foothold will firstly be calculated by the formula:

$r_{i}^{c m d} = p_{s h o u l d e r, i} + p_{s y m m e t r y} + p_{c e n t r i f u g a l}$

where

$p_{s h o u l d e r, i} = p_{k} + R_{z} (Ψ_{k}) l_{i}$

$p_{s y m m e t r y} = \frac{t_{s t a n c e}}{2} v + k (v - v^{c m d})$

The centrifugal term is omitted. $p_{k}$ is body position at k-timestep. $l_{i}$ is the shoulder position for $i^{t h}$ leg with respect to local frame. $R_{z} (Ψ_{k})$ is the rotation matrix translating velocity to global frame. $t_{s t a n c e}$ is time cycle and $k = 0.03$ is the feedback gain.

However, we choose the footholds solely based on quantitative score from various aspects (distance current pos, terrain variance/gradient, support area etc.), rather than solving a optimization problem.

DRL

We use the framework from isaac-gym, with PPO algorithm. With the following feature added:

Remove teacher-student framework
Add GRU and CE-net as terrain encoder. Latent dimension was increased from 64 to 512.
TODO: symmetric data augmentation

To integrate the foothold into DRL, the relative position to the optimized foothold was fed as observations for both actor and critic network. Moreover, a sparse reward term was also added, which will be triggered in the touch-down time.

Estimated training time is 10 hours.

Set up

pip install -e rsl_rl

pip install -e .

Reference

DTC: Deep Tracking Control

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Tracking Control with Lite3

Main contribution

Foothold Planner

DRL

Set up

Reference

About

Releases

Packages

Languages

priest-yang/Deep-Tracking-Control

Folders and files

Latest commit

History

Repository files navigation

Deep Tracking Control with Lite3

Main contribution

Foothold Planner

DRL

Set up

Reference

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages