This is a simple implementation of the Deep Q-learning algorithm on the Atari Pong environment.
⚠ my implementation can't reach the best performance
## Requirements-
python >=3.4
-
tensorboardX
-
gym >= 0.10
-
pytorch >= 0.4
-
Playing Atari with Deep Reinforcement Learning [arxiv] [code]
-
Deep Reinforcement Learning with Double Q-learning [arxiv] [code]
-
Dueling Network Architectures for Deep Reinforcement Learning [arxiv] [code]
-
A Distributional Perspective on Reinforcement Learning [arxiv] [code]
-
Rainbow: Combining Improvements in Deep Reinforcement Learning [arxiv] [code]
-
Distributional Reinforcement Learning with Quantile Regression [arxiv] [code]
-
Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation [arxiv] [code]