VIPER_RL-torch

Pytorch implementation of Video Prediction Models as Rewards for Reinforcement Learning. VIPER leverages the next-frame log likelihoods of a pre-trained video prediction model as rewards for downstream reinforcement learning tasks. The method is flexible to the particular choice of video prediction model and reinforcement learning algorithm. The general method outline is shown below:

Install:

Create a conda environment with Python 3.10:

conda create -n viper python=3.10
conda activate viper

Install dependencies:

pip install -r requirements.txt

Downloading Data

Download the DeepMind Control Suite expert dataset with the following command:

python -m viper_rl_data.download dataset dmc

and the Atari dataset with:

python -m viper_rl_data.download dataset atari

This will produce datasets in <VIPER_INSTALL_PATH>/viper_rl_data/datasets/ which are used for training the video prediction model. The location of the datasets can be retrieved via the viper_rl_data.VIPER_DATASET_PATH variable.

Video Model Training

Use the following command to first train a VQ-GAN:

python scripts/train_vqgan.py -o viper_rl_data/checkpoints/dmc_vqgan -c viper_rl/configs/vqgan/dmc.yaml

To train the VideoGPT, update ae_ckpt in viper_rl/configs/dmc.yaml to point to the VQGAN checkpoint, and then run:

python scripts/train_videogpt.py -o viper_rl_data/checkpoints/dmc_videogpt_l16_s1 -c viper_rl/configs/videogpt/dmc.yaml

Policy training

python scripts/train_dreamer.py --configs=dmc_vision videogpt_prior_rb --task=dmc_walker_walk --reward_model=dmc_clen16_fskip1 --logdir=./logdir

Custom checkpoint directories can be specified with the $VIPER_CHECKPOINT_DIR environment variable. The default checkpoint path is set to viper_rl_data/checkpoints/.

Acknowledgments

This code is heavily inspired by the following works:

Alejandro's viper_rl jax implementation: https://github.com/Alescontrela/viper_rl
Dreamer-v3 torch implementation: https://github.com/NM512/dreamerv3-torch
danijar's Dreamer-v3 jax implementation: https://github.com/danijar/dreamerv3
danijar's Dreamer-v2 tensorflow implementation: https://github.com/danijar/dreamerv2
jsikyoon's Dreamer-v2 pytorch implementation: https://github.com/jsikyoon/dreamer-torch
RajGhugare19's Dreamer-v2 pytorch implementation: https://github.com/RajGhugare19/dreamerv2
denisyarats's DrQ-v2 original implementation: https://github.com/facebookresearch/drqv2

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
ding		ding
scripts		scripts
viper_rl		viper_rl
viper_rl_data		viper_rl_data
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
clip_demo.py		clip_demo.py
env_test.py		env_test.py
mc_obs.txt		mc_obs.txt
mdj_tasks.txt		mdj_tasks.txt
minerl_out.txt		minerl_out.txt
minerl_run_error.txt		minerl_run_error.txt
minerl_test.py		minerl_test.py
plot.py		plot.py
requirements.txt		requirements.txt
rlbench_error.txt		rlbench_error.txt
setup.py		setup.py
test.py		test.py
test_rlbench.py		test_rlbench.py
video_dreamer_out.txt		video_dreamer_out.txt
videogpt_error.txt		videogpt_error.txt
vqgan_test.txt		vqgan_test.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VIPER_RL-torch

Install:

Downloading Data

Video Model Training

Policy training

Acknowledgments

About

Releases

Packages

Languages

License

nyuolab/VIPER-torch

Folders and files

Latest commit

History

Repository files navigation

VIPER_RL-torch

Install:

Downloading Data

Video Model Training

Policy training

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages