My simple implementation of actor-critic RL in Tensorflow.
See the blog at https://medium.com/@asteinbach/actor-critic-using-deep-rl-continuous-mountain-car-in-tensorflow-4c1fb2110f7c for description of this code.
For the theory of this actor-critic algo see my blog at https://medium.com/@asteinbach/rl-introduction-simple-actor-critic-for-continuous-actions-4e22afb712
This code originally inspired by Denny Britz' implementation here: https://github.com/dennybritz/reinforcement-learning/blob/master/PolicyGradient/Continuous%20MountainCar%20Actor%20Critic%20Solution.ipynb