bandit-simulations

Simulation Code for Bandit Algorithms https://docs.google.com/presentation/d/1D2xYWAkfR0exT9pfThozlPL8S_zqgrKYEAVWA77Q_xA/edit?usp=sharing

MHA Project

TS-Contextual Bandit

TS-Contextual Bandit Algorithm in used for many bandit designs in Mental Health America (MHA) project.

Detailed code of TS-Contextual Bandit can be accessed through the following link here.

TS-Traditional Bandit

TS-Traditional Bandit Algorithm follows Beta-Bernoulli with Thompson Sampling method.

Detailed code of Traditional Bandit can be accessed through the following link here.

TS-PostDiff Bandit

TS-PostDiff Bandit Algorithm is similar to TS-Traditional but involves a threshold c to adjust the policy of the bandit algorithm. Similar to epsilon-greedy, it mixes Uniform Random and TS-Traditional policy in the algorithm.

Detailed code of Traditional Bandit can be accessed through the following link here.

Setup

Install packages

This code supports Python 3.9+.
pip install -r requirements.txt
Someitmes installation can occur via the wrong python version if pip is already associated with a python version:
- if you run pip install and continue to run into 'package not found' issues try python3.9 -m pip install <PacakgeName>.
- pip itself is written in python so you can choose the version of python that runs pip and for which packages are installed with the above command

How To Run?

Note: If you are running this code under Jupyter Notebook/Google Colab environments, you should include --notebook_mode=True to all following commands.

Running Simulations

To run simulations for different policy settings, run the following command from the root directory to this repository:

python main.py simulate --config_path=<path_to_your_configs_file> --output_path=<path_to_your_outputs> --checkpoint_path=<path_to_your_checkpoints>

This command will write simulation results and evaluation results under two directory to <path_to_your_outputs>.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
datasets		datasets
examples		examples
metrics		metrics
policies		policies
sample_configs		sample_configs
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bandit-simulations

MHA Project

TS-Contextual Bandit

TS-Traditional Bandit

TS-PostDiff Bandit

Setup

Install packages

How To Run?

Running Simulations

About

Releases

Packages

Contributors 3

Languages

License

Intelligent-Adaptive-Interventions-Lab/bandit-simulations

Folders and files

Latest commit

History

Repository files navigation

bandit-simulations

MHA Project

TS-Contextual Bandit

TS-Traditional Bandit

TS-PostDiff Bandit

Setup

Install packages

How To Run?

Running Simulations

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages