RL Project

Author: 黄正翔 (Huang, Zhengxiang)

This is the final project for Reinforcement Learning (RL)

Demo video clips:

Ant

HalfCheetah

Hopper

Humanoid

VideoPinball

Boxing

0. Environment

I used OPENAI gymnasium as the environment, since gym is not officially supported now.

pip install gymnasium
pip install gymnasium[atari]
pip install gymnasium[accept-rom-license]
pip install gymnasium[mujoco]

You also need opencv

pip install opencv-python

The tutorial I referred to is on https://gymnasium.farama.org/environments/classic_control/pendulum/

Also, I referred to https://gymnasium.farama.org/tutorials/training_agents/blackjack_tutorial/ in order to build an agent.

1. train the agent

For the discrete discision space games, we recommend to use DQN. Train the agent with DQN.

python run.py --env_name VideoPinball-ramNoFrameskip-v4
python run.py --env_name BreakoutNoFrameskip-v4
python run.py --env_name PongNoFrameskip-v4
python run.py --env_name BoxingNoFrameskip-v4

python run.py --env_name Hopper-v2
python run.py --env_name Humanoid-v2
python run.py --env_name HalfCheetah-v2
python run.py --env_name Ant-v2

2. visualize the results

See the results illustration.

python vis.py --render_mode human --test_times 1

Calculate the average score.

python vis.py --test_times 10

3. "Human Expert"

BoxingNoFrameskip-v4: 5 BreakoutNoFrameskip-v4: 30 PongNoFrameskip-v4: -4.5 VideoPinball-ramNoFrameskip-v4: 5210

python human.py --env_name VideoPinball-ramNoFrameskip-v4
python human.py --env_name BreakoutNoFrameskip-v4
python human.py --env_name PongNoFrameskip-v4
python human.py --env_name BoxingNoFrameskip-v4

4. Other Utils

python vis.py --render_mode rgb_array --test_times 1 --env_name VideoPinball-ramNoFrameskip-v4
python vis.py --render_mode rgb_array --test_times 1 --env_name BoxingNoFrameskip-v4
python vis.py --env_name BreakoutNoFrameskip-v4 --render_mode human --test_times 1
python vis.py --env_name Humanoid-v2 --render_mode human --test_times 1
python run.py --env_name VideoPinball-ramNoFrameskip-v4 --load

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
best_ckpt		best_ckpt
config		config
logger		logger
video		video
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Report.pdf		Report.pdf
agent.py		agent.py
convert.py		convert.py
fig.py		fig.py
human.py		human.py
model.py		model.py
results.txt		results.txt
results.xlsx		results.xlsx
run.py		run.py
vis.py		vis.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL Project

0. Environment

1. train the agent

2. visualize the results

3. "Human Expert"

4. Other Utils

About

Releases 4

Packages

Languages

License

huangzhengxiang/RL_Project

Folders and files

Latest commit

History

Repository files navigation

RL Project

0. Environment

1. train the agent

2. visualize the results

3. "Human Expert"

4. Other Utils

About

Resources

License

Stars

Watchers

Forks

Releases 4

Packages 0

Languages

Packages