DDPG:Deep Deterministic Policy Gradient_implementation

This is a Tensorflow implementation of the 'Continuous control with Deep Reinforcement learning' paper (Both of the jupyter notebook and python script is used for the 'main' file)

Paper: Continuous control with deep reinforcement learning

Env:Pendulum-v0 (with normalization and OU Noise)

Note:In the title of the above plot,there should be '250 scores' instead of '100 scores'

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

DDPG:Deep Deterministic Policy Gradient_implementation

Env:Pendulum-v0 (with normalization and OU Noise)

References: openai/baselines

Files

README.md

Latest commit

History

README.md

File metadata and controls

DDPG:Deep Deterministic Policy Gradient_implementation

Env:Pendulum-v0 (with normalization and OU Noise)

References: openai/baselines