Trust Region Competitive Policy Optimization (TRCoPO)

This repository contains code of trust region competitve policy optimisation (TRCoPO) algorithm. The paper for competitive policy gradient can be found here, The code for Competitive Policy Gradient (CoPG) algorithm can be found here.

Experiment videos are available here

Dependencies

Code is tested on python 3.5.2.
Only Markov Soccer experiment requires OpenSpiel library, Other experiments can be run directly.
Require torch.utils.tensorboard

Repository structure

.
├── notebooks
│   ├── RockPaperScissors.ipynb
│   ├── MatchingPennies.ipynb
├── game                            # Each game have a saparate folder with this structure
│   ├── game.py                     
│   ├── copg_game.py                
│   ├── gda_game.py
│   ├── network.py
├── copg_optim
│   ├── copg.py 
│   ├── critic_functions.py 
│   ├── utils.py 
├── car_racing_simulator
└── ...

[Jupyter notebooks] are the best point to start. It contains demonstrations and results.
Folder [copg_optim] contains optimization code

How to start ?

Open jupyter notebook and run it to see results.

or

git clone "adress"
cd trcopo
cd RockPaperScissors
python3 trcopo_rps.py
cd ..
cd tensorboard
tensordboard --logdir .

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.idea		.idea
CarRacing		CarRacing
MarkovSoccer		MarkovSoccer
MatchingPennies		MatchingPennies
Notebooks		Notebooks
RockPaperScissors		RockPaperScissors
car_racing_simulator		car_racing_simulator
others		others
trcopo_optim		trcopo_optim
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Trust Region Competitive Policy Optimization (TRCoPO)

Experiment videos are available here

Dependencies

Repository structure

How to start ?

Experiment Demonstration

TRGDA vs TRGDA TRCoPO vs TRCoPO

ORCA Car Racing

Rock Paper Scissors

Markov Soccer

Matching Pennies

About

Releases

Packages

Languages

License

manish-pra/trcopo

Folders and files

Latest commit

History

Repository files navigation

Trust Region Competitive Policy Optimization (TRCoPO)

Experiment videos are available here

Dependencies

Repository structure

How to start ?

Experiment Demonstration

TRGDA vs TRGDA TRCoPO vs TRCoPO

ORCA Car Racing

Rock Paper Scissors

Markov Soccer

Matching Pennies

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages