MUTEX: Learning Unified Policies from Multimodal Task Specifications

Rutav Shah, Roberto Martín-Martín¹, Yuke Zhu¹
7th Annual Conference on Robot Learning
[Paper] [Project Website] [Dataset] [Pretrained Weights] [Real Robot Controller]
¹ Equal Advising

Setup

Installation

git clone --recursive https://github.com/UT-Austin-RPL/MUTEX.git
cd MUTEX && git submodule update --init --recursive
conda create -n mutex python=3.8
conda activate mutex
pip install -r requirements.txt
pip install -e LIBERO/.
pip install -e .

Datasets

Please set the argument folder= to the dataset directory in the configs.

Pretrained Weights

To use pretrained weights, follow the evaluation instructions mentioned below.

Usage

Training

MUTEX is trained in two stages: a) Masked Modeling and b) Cross-Modal Matching.

To run Masked Modeling,

CUDA_VISIBLE_DEVICES=0 python3 mutex/main_masked_modeling.py \
        benchmark_name=LIBERO_100 \
        policy.task_spec_modalities=gl_inst_img_vid_ai_ag \
        policy.add_mim=True policy.add_mgm=True policy.add_mrm=True \
        policy.add_mfm=True policy.add_maim=True policy.add_magm=True \
        folder=dataset-path \
        hydra.run.dir=experiments/mutex

To run Cross-Modal Matching,

CUDA_VISIBLE_DEVICES=0 python3 mutex/main_cmm.py \
        benchmark_name=LIBERO_100 \
        folder=dataset-path \
        experiment_dir=experiments/mutex

Evaluation

MUTEX is a unified policy capable of executing tasks specified by any modality: video demonstration vid, image goal img, text goals gl, text instructions inst, speech goal ag, and speech instructions ai. To run the model after cross-modal matching at epoch 20 (used in the paper), set model_name=cmm_LIBERO_100_multitask_model_ep020.pth.
An example with text goal modality is given below,

MUJOCO_EGL_DEVICE_ID=0 CUDA_VISIBLE_DEVICES=0 python mutex/eval.py \
        benchmark_name=LIBERO_100 \
        folder=dataset-path \
        eval_spec_modalities=gl \
        experiment_dir=mutex_pretrained \
        model_name=mutex_weights.pth

Citation

@inproceedings{
    shah2023mutex,
    title={{MUTEX}: Learning Unified Policies from Multimodal Task Specifications},
    author={Rutav Shah and Roberto Mart{\'\i}n-Mart{\'\i}n and Yuke Zhu},
    booktitle={7th Annual Conference on Robot Learning},
    year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
LIBERO @ 169435b		LIBERO @ 169435b
configs		configs
imgs		imgs
mutex		mutex
scripts		scripts
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
acknowledgements.md		acknowledgements.md
cross_modal_matching.sh		cross_modal_matching.sh
install.sh		install.sh
libero.yml		libero.yml
masked_modeling.sh		masked_modeling.sh
requirements.txt		requirements.txt
run_eval.sh		run_eval.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MUTEX: Learning Unified Policies from Multimodal Task Specifications

Setup

Installation

Datasets

Pretrained Weights

Usage

Training

Evaluation

Citation

Acknowledgements: Mentioned here

About

Releases

Packages

Contributors 2

Languages

License

UT-Austin-RPL/MUTEX

Folders and files

Latest commit

History

Repository files navigation

MUTEX: Learning Unified Policies from Multimodal Task Specifications

Setup

Installation

Datasets

Pretrained Weights

Usage

Training

Evaluation

Citation

Acknowledgements: Mentioned here

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages