-
Notifications
You must be signed in to change notification settings - Fork 835
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Potential problem in EpsilonGreedy.py? #336
Comments
Try
|
Wow, sending the request as well did the trick indeed! In case of AB-test, I only sent the response and the reward, and that was enough just to have the feedback counters work with Prometheus, that's why I did not consider sending the request here either. Anyway, it works now, thank you! |
I just noticed that you were right with the 2nd hint as well, seems that only binary reward works. Isn't it supposed to accept any real number between 0 and 1? |
@sasvaritoni I'm in the process of overhauling the example router components, you can find a pull request with an updated e-greedy implementation at #335. On the second point, the reward can be any real number and for the e-greedy example it is assumed to be a real number in [0,1]. The main assumption/limitation here is that because the send-feedback endpoint supports batch requests, we assume that the reward is interpreted as the proportion of successful (in a binary sense) samples in the batch (i.e. if the batch is only one sample, the reward can be only 0 or 1, if it is 2 samples, it can be 0, 0.5 or 1, etc. - the logic is in the n_success_failures method in the e-greedy implementation). If, for example, in your application you only ever send one sample per batch and your rewards are arbitrary floats, I would suggest modifying the e-greedy router component to your needs (i.e. get rid of the n_success_failures method and use the raw reward). |
Thank you for the help and explanation, this was really useful! |
* Add tag pattern to ignore RC builds in generated release notes * Add changelog handling for core versions vs. RC builds, nightlies, etc. * Remove duplicate release notes generation command * Use GITHUB_ENV instead of exporting vars * Use simple toggle not interpolated auto-changelog args
I created a MAB by using the Kubeflow template seldon-mab-v1alpha2.
It seems that the router is stuck at best branch 0, no matter what feedback I send in. The counters are not changing any time, constant 0's are printed:
This is the feedback sample I am sending in:
Any hints? Anyone tested the MAB router before?
The text was updated successfully, but these errors were encountered: