Multi-Agent-Learning-Environments

Hello, I pushed some python environments for Multi Agent Reinforcement Learning. Some are single agent version that can be used for algorithm testing. I provide documents for each environment, you can check the corresponding pdf files in each directory. These are just toy problems, though some of them are still hard to solve. Some environments are like:

Multi Agent Soccer Game

Multi Agent Rescue

Multi Agent Cleaner

Multi Agent Move Box

Multi Agent Catching Pig

Multi Drones Monitoring

Multi Agent Maze Running

Multi Agent Find Treasure

Firefighters

Go Together

Warehouse

Opposite

Dependency

OpenCV, swig

Multi-Agent Environment Standard

Assumption:

Each agent works synchronously.

Member Functions

reset()

reward_list, done = step(action_list)

obs_list = get_obs()

reward_list records the single step reward for each agent, it should be a list like [reward1, reward2,......]. The length should be the same as the number of agents. Each element in the list should be a integer.

done True/False, mark when an episode finishes.

action_list records the single step action instruction for each agent, it should be a list like [action1, action2,...]. The length should be the same as the number of agents. Each element in the list should be a non-negative integer.

obs_list records the single step observation for each agent, it should be a list like [obs1, obs2,...]. The length should be the same as the number of agents. Each element in the list can be any form of data, but should be in same dimension, usually a list of variables or an image.

Typical Monte Carlo Procedures

reset environment by calling reset() get initial observation get_obs() for i in range(max_MC_iter): get action_list from controller apply action by step() record returned reward list record new observation by get_obs()

Citation

Cite the environment of the following paper as:

@inproceedings{jiang2021multi,
 title={Multi-agent reinforcement learning with directed exploration and selective memory reuse},
 author={Jiang, Shuo and Amato, Christopher},
 booktitle={Proceedings of the 36th Annual ACM Symposium on Applied Computing},
 pages={777--784},
 year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
README		README
env_CatchPigs		env_CatchPigs
env_Cleaner		env_Cleaner
env_Drones		env_Drones
env_FindGoals		env_FindGoals
env_FindTreasure		env_FindTreasure
env_FireFighter		env_FireFighter
env_GoTogether		env_GoTogether
env_MoveBox		env_MoveBox
env_Opposite		env_Opposite
env_Rescue		env_Rescue
env_SingleCatchPigs		env_SingleCatchPigs
env_Soccer		env_Soccer
env_Warehouse		env_Warehouse
Design Standard.pdf		Design Standard.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Agent-Learning-Environments

Multi Agent Soccer Game

Multi Agent Rescue

Multi Agent Cleaner

Multi Agent Move Box

Multi Agent Catching Pig

Multi Drones Monitoring

Multi Agent Maze Running

Multi Agent Find Treasure

Firefighters

Go Together

Warehouse

Opposite

Dependency

Multi-Agent Environment Standard

About

Releases

Packages

Languages

Bigpig4396/Multi-Agent-Reinforcement-Learning-Environment

Folders and files

Latest commit

History

Repository files navigation

Multi-Agent-Learning-Environments

Multi Agent Soccer Game

Multi Agent Rescue

Multi Agent Cleaner

Multi Agent Move Box

Multi Agent Catching Pig

Multi Drones Monitoring

Multi Agent Maze Running

Multi Agent Find Treasure

Firefighters

Go Together

Warehouse

Opposite

Dependency

Multi-Agent Environment Standard

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages