GitHub - michibinder/acrobot-agents

Acrobot-agents

Development of multiple agents learning the Acrobot-v1 environment of the OpenAI Gym library.

First group of agents uses neuronal networks to estimate the action-value function Q of the continuous state, discrete actions environment. These are semi-gradient Q and SARSA algorithms in a classic and a DQN (deep Q network) approach.

Second group of agents uses neuronal networks to estimate the policy directly. These are Monte-Carlo policy gradient (REINFORCE algorithm), Monte-Carlo Advantage Actor-Critic (A2C) & TD(0)-A2C agents.

Results have been summarised in the PDF and Figures and Videos to visualise the agents' performance are within the respective subfolders.

Additionally, it is shown that the developed Q-learning DQN agent was also capable of solving the harder MountainCar-v0 environment of the OpenAI Gym library.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
__pycache__		__pycache__
images		images
models		models
videos		videos
.DS_Store		.DS_Store
ADVML-Acrobot-Villeda-Binder.pdf		ADVML-Acrobot-Villeda-Binder.pdf
README.md		README.md
agents.py		agents.py
agents_nnp.py		agents_nnp.py
evalAndRender.py		evalAndRender.py
evalAndRender_MountainCar.py		evalAndRender_MountainCar.py
networks.py		networks.py
requirements_acrobot.txt		requirements_acrobot.txt
train_A2C_agents_hiddenDim.py		train_A2C_agents_hiddenDim.py
train_MountainCar_DQN.py		train_MountainCar_DQN.py
train_TD0_A2C_agents_hiddenDim.py		train_TD0_A2C_agents_hiddenDim.py
train_agents_DQN.py		train_agents_DQN.py
train_agents_QandSarsa.py		train_agents_QandSarsa.py
train_agents_nnp.py		train_agents_nnp.py
train_best_A2C.py		train_best_A2C.py
train_best_DQN.py		train_best_DQN.py
train_best_TD0_A2C.py		train_best_TD0_A2C.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Acrobot-agents

About

Releases

Packages

Languages

michibinder/acrobot-agents

Folders and files

Latest commit

History

Repository files navigation

Acrobot-agents

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages