Pong-Game-DQN

Analysis of Deep Q-Network algorithm on Simple Pong Environment

State Representation

I defined agent’s states like the following:

A set of predefined constants delineate the rewards and penalties for different in-game events as following:

Game End Condition: If the game concludes , the function checks the outcome: - If the agent scores, the +10 reward is granted.

The function first checks for the ball's collision with the agent's paddle. If the ball hits the center of the paddle, a reward of +0.1 is given; otherwise, a penalty of -0.1 is applied.
If there's no collision, the function evaluates the agent's movement towards or away from the ball, using the difference in vertical distance between the ball and the paddle. Depending on the movement direction, a corresponding reward of +0.5 or penalty of - 0.5 is assigned.

Test of agent (DQN) against nominal player

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Ball.py		Ball.py
CollusionStrategies.py		CollusionStrategies.py
DQN.py		DQN.py
Environments.py		Environments.py
PPO.py		PPO.py
Paddle.py		Paddle.py
PaddleStrategies.py		PaddleStrategies.py
README.md		README.md
main.py		main.py
model_weights.pth		model_weights.pth
parameters.py		parameters.py
test.py		test.py
train.py		train.py