Deep reinforcement algorithms

Implementations of classic Deep RL algorithms, mostly based on CS 294: Deep Reinforcement Learning at Berekley

Current state:

Policy Gradient:

Basically my solution to heavily refactored CS294 second homework. Tensorflow ლ(ಠ益ಠლ)

DQN

Still based on CS294 but implemented in  PyTorch ✧ﾟ･: *ヽ(◕ヮ◕ヽ)

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
DQN		DQN
policy-gradient		policy-gradient
res		res
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback