Equivariant Diffusion Policy

Project Website | Paper | Video
Dian Wang¹, Stephen Hart², David Surovik², Tarik Kelestemur², Haojie Huang¹, Haibo Zhao¹, Mark Yeatman², Jiuguang Wang², Robin Walters¹, Robert Platt¹²
¹Northeastern Univeristy, ²Boston Dynamics AI Institute
Conference on Robot Learning 2024 (Oral) |

Installation

Install the following apt packages for mujoco:

sudo apt install -y libosmesa6-dev libgl1-mesa-glx libglfw3 patchelf

Install gfortran (dependancy for escnn)
```
sudo apt install -y gfortran
```
Install Mambaforge (strongly recommended) or Anaconda

Clone this repo

git clone https://github.com/pointW/equidiff.git
cd equidiff

Install environment: Use Mambaforge (strongly recommended):

mamba env create -f conda_environment.yaml
conda activate equidiff

or use Anaconda (not recommended):

conda env create -f conda_environment.yaml
conda activate equidiff

Install mimicgen:

cd ..
git clone https://github.com/NVlabs/mimicgen_environments.git
cd mimicgen_environments
# This project was developed with Mimicgen v0.1.0. The latest version should work fine, but it is not tested
git checkout 081f7dbbe5fff17b28c67ce8ec87c371f32526a9
pip install -e .
cd ../equidiff

Make sure mujoco version is 2.3.2 (required by mimicgen)
```
pip list | grep mujoco
```

Dataset

Download Dataset

Download dataset from MimicGen's hugging face: https://huggingface.co/datasets/amandlek/mimicgen_datasets/tree/main/core
Make sure the dataset is kept under /path/to/equidiff/data/robomimic/datasets/[dataset]/[dataset].hdf5

Generating Voxel and Point Cloud Observation

# Template
python equi_diffpo/scripts/dataset_states_to_obs.py --input data/robomimic/datasets/[dataset]/[dataset].hdf5 --output data/robomimic/datasets/[dataset]/[dataset]_voxel.hdf5 --num_workers=[n_worker]
# Replace [dataset] and [n_worker] with your choices.
# E.g., use 24 workers to generate point cloud and voxel observation for stack_d1
python equi_diffpo/scripts/dataset_states_to_obs.py --input data/robomimic/datasets/stack_d1/stack_d1.hdf5 --output data/robomimic/datasets/stack_d1/stack_d1_voxel.hdf5 --num_workers=24

Convert Action Space in Dataset

The downloaded dataset has a relative action space. To train with absolute action space, the dataset needs to be converted accordingly

# Template
python equi_diffpo/scripts/robomimic_dataset_conversion.py -i data/robomimic/datasets/[dataset]/[dataset].hdf5 -o data/robomimic/datasets/[dataset]/[dataset]_abs.hdf5 -n [n_worker]
# Replace [dataset] and [n_worker] with your choices.
# E.g., convert stack_d1 (non-voxel) with 12 workers
python equi_diffpo/scripts/robomimic_dataset_conversion.py -i data/robomimic/datasets/stack_d1/stack_d1_voxel.hdf5 -o data/robomimic/datasets/stack_d1/stack_d1_abs.hdf5 -n 12
# E.g., convert stack_d1_voxel (voxel) with 12 workers
python equi_diffpo/scripts/robomimic_dataset_conversion.py -i data/robomimic/datasets/stack_d1/stack_d1_voxel.hdf5 -o data/robomimic/datasets/stack_d1/stack_d1_voxel_abs.hdf5 -n 12

Training with image observation

To train Equivariant Diffusion Policy (with absolute pose control) in Stack D1 task:

# Make sure you have the non-voxel converted dataset with absolute action space from the previous step 
python train.py --config-name=train_equi_diffusion_unet_abs task_name=stack_d1 n_demo=100

To train with relative pose control instead:

python train.py --config-name=train_equi_diffusion_unet_rel task_name=stack_d1 n_demo=100

To train in other tasks, replace stack_d1 with stack_three_d1, square_d2, threading_d2, coffee_d2, three_piece_assembly_d2, hammer_cleanup_d1, mug_cleanup_d1, kitchen_d1, nut_assembly_d0, pick_place_d0, coffee_preparation_d1. Notice that the corresponding dataset should be downloaded already. If training absolute pose control, the data conversion is also needed.

To run environments on CPU (to save GPU memory), use osmesa instead of egl through MUJOCO_GL=osmesa PYOPENGL_PLATFORM=osmesa, e.g.,

MUJOCO_GL=osmesa PYOPENGL_PLATFORM=osmesa python train.py --config-name=train_equi_diffusion_unet_abs task_name=stack_d1

Equivariant Diffusion Policy requires around 22G GPU memory to run with batch size of 128 (default). To reduce the GPU usage, consider training with smaller batch size and/or reducing the hidden dimension

# to train with batch size of 64 and hidden dimension of 64
MUJOCO_GL=osmesa PYOPENGL_PLATTFORM=osmesa python train.py --config-name=train_equi_diffusion_unet_abs task_name=stack_d1 policy.enc_n_hidden=64 dataloader.batch_size=64

Training with voxel observation

To train Equivariant Diffusion Policy (with absolute pose control) in Stack D1 task:

# Make sure you have the voxel converted dataset with absolute action space from the previous step 
python train.py --config-name=train_equi_diffusion_unet_voxel_abs task_name=stack_d1 n_demo=100

License

This repository is released under the MIT license. See LICENSE for additional details.

Acknowledgement

Our repo is built upon the origional Diffusion Policy
Our ACT baseline is adaped from its original repo
Our DP3 baseline is adaped from its original repo

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
equi_diffpo		equi_diffpo
img		img
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
conda_environment.yaml		conda_environment.yaml
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Equivariant Diffusion Policy

Installation

Dataset

Download Dataset

Generating Voxel and Point Cloud Observation

Convert Action Space in Dataset

Training with image observation

Training with voxel observation

License

Acknowledgement

About

Releases

Packages

Contributors 2

Languages

License

pointW/equidiff

Folders and files

Latest commit

History

Repository files navigation

Equivariant Diffusion Policy

Installation

Dataset

Download Dataset

Generating Voxel and Point Cloud Observation

Convert Action Space in Dataset

Training with image observation

Training with voxel observation

License

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages