-
Notifications
You must be signed in to change notification settings - Fork 17
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Simulate logic adds extra case in which UCB is not used #21
Comments
if n=0 and ltn = 0, the sqrt(ltn/n) calculation will produce a NaN. Can you think of any other way to handle this? |
It seems like instead of producing a NaN, we are assigning 0/0 a value of 0. Assigning 0 to undefined expression 0/0 does help in disabling UCB for all children nodes (even those with One failing edge case is pure exploration (completely random policy) where exploration coefficient
The easy solution to this is to disable UCB for all children nodes if ltn=0. This way we avoid the
If this edge case is important enough to address, I can send over a PR. |
I am not sure of the logic behind adding
ltn<= 0.0
in the simulate() function. It implies that for two first two tree queries (ltn
=-Inf
and0
), it uses the node value as criterion. I understand that we use node value whenltn = -Inf
, but don't understand why it is the case forltn = 0
Ref this line in simulate()
The text was updated successfully, but these errors were encountered: