This is the TensorFlow implementation for the paper Bidrectional Model-based Policy Optimization.
pip install -r requirements.txt
python main.py --config=config.hopperNT
To change hyper-parameters, please modify the corresponding config file in config/
.
This code is mainly modified based on the mbpo codebase.