VPG reinforcement learning algorithm in PyTorch
Tested on CartPole-v0 environment from OpenAI Gym
Writeup at vitez.me/vanilla-policy-gradient
VPG reinforcement learning algorithm in PyTorch
Tested on CartPole-v0 environment from OpenAI Gym
Writeup at vitez.me/vanilla-policy-gradient