Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Regarding using Regret as a Loss Function #2

Open
SachinVashisth opened this issue Dec 30, 2023 · 0 comments
Open

Regarding using Regret as a Loss Function #2

SachinVashisth opened this issue Dec 30, 2023 · 0 comments

Comments

@SachinVashisth
Copy link

Hi

Thanks for a wonderful article related to Multi-Arm Bandits (MAB).
In the article, you discussed that the loss function is the total regret we might have by not selecting the optimal action up to the time step T.
lossfunction_regret

In the code, regret is calculated and stored to create a graph when MAB iterations are completed.
But how are we using it to make a better selection of arms while MAB is running?

Is the regret only calculated for plotting the graphs OR is it used in some way in the MAB setup?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant