Example CEM implementation with ReLAx

This repository contains an implementation of cross entropy method (CEM) with ReLAx.

CEM actor was trained on HalfCheetah-v2 Mujoco Gym environment for 50k env-steps.

The graph of average return vs training step is shown below (batch_size=5000):

The graph below shows actual rewards vs rewards fitted with environment model:

Resulting Policy:

cem_run.mp4

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.ipynb_checkpoints		.ipynb_checkpoints
content/video		content/video
tensorboard_logs/cem_HalfCheetah-v2		tensorboard_logs/cem_HalfCheetah-v2
trained_models		trained_models
README.md		README.md
cem_model_rews.png		cem_model_rews.png
cem_training.png		cem_training.png
cem_tutorial.ipynb		cem_tutorial.ipynb

Provide feedback