teamwork_marl

Teamwork-based map exploration through multi-agent reinforcement learning (MARL).

This project implements the QMIX learning algorithm, training multiple agents with heterogeneous capabilities to efficiently and quickly explore a randomly generated map.

Example:

Two agents explore a random map. The red agent can move through green tiles while the purple agent can move through blue tiles. The left side of the gif below shows the "ground truth" global state, while the right side shows the red agent's observation combined with the purple agent's observation history (i.e. the shared local observations).

Project Breakdown

Map Engine

The Map Engine randomly generates explorable areas and handles agent actions. It receives agent action tensors as input and outputs the global ground truth state and each agent's shared local observations.

QMIX implementation

The QMIX algorithm is implemented with PyTorch / TorchRL.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
media		media
.gitignore		.gitignore
README.md		README.md
environment_engine.py		environment_engine.py
learner.py		learner.py
simple_map.csv		simple_map.csv
tile.py		tile.py
visualizer.py		visualizer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

teamwork_marl

Project Breakdown

Map Engine

QMIX implementation

About

Releases

Packages

Contributors 2

Languages

jesseinouye/teamwork_marl

Folders and files

Latest commit

History

Repository files navigation

teamwork_marl

Project Breakdown

Map Engine

QMIX implementation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages