Skip to content

Latest commit

 

History

History
12 lines (7 loc) · 700 Bytes

README.md

File metadata and controls

12 lines (7 loc) · 700 Bytes

DDPG:Deep Deterministic Policy Gradient_implementation

This is a Tensorflow implementation of the 'Continuous control with Deep Reinforcement learning' paper (Both of the jupyter notebook and python script is used for the 'main' file)

Paper: Continuous control with deep reinforcement learning

Env:Pendulum-v0 (with normalization and OU Noise)

alt text

Note:In the title of the above plot,there should be '250 scores' instead of '100 scores'

References: openai/baselines