Reinforcement learning code for RoboPianist.
Note: Make sure you are using the same conda environment you created for RoboPianist (see here).
- Install JAX
- Run
pip install -r requirements.txt
We provide an example bash script to train a policy to play Twinkle Twinkle Little Star with the task parameters used in the paper.
bash run.sh
To look at all the possible command-line flags, run:
python train.py --help
If you use this code, please cite our paper:
@article{zakka2023robopianist,
author = {Zakka, Kevin and Smith, Laura and Gileadi, Nimrod and Howell, Taylor and Peng, Xue Bin and Singh, Sumeet and Tassa, Yuval and Florence, Pete and Zeng, Andy and Abbeel, Pieter},
title = {{RoboPianist: A Benchmark for High-Dimensional Robot Control}},
journal = {arXiv preprint arXiv:2304.04150},
year = {2023},
}