Skip to content

Latest commit

 

History

History
11 lines (6 loc) · 267 Bytes

README.md

File metadata and controls

11 lines (6 loc) · 267 Bytes

Vanilla Policy Gradient

VPG reinforcement learning algorithm in PyTorch

Tested on CartPole-v0 environment from OpenAI Gym

Writeup at vitez.me/vanilla-policy-gradient

graph

trained