Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

tunables for client allocation update frequency #17869

Open
tgross opened this issue Jul 10, 2023 · 0 comments
Open

tunables for client allocation update frequency #17869

tgross opened this issue Jul 10, 2023 · 0 comments

Comments

@tgross
Copy link
Member

tgross commented Jul 10, 2023

In #17354 we added prioritization to client allocation update frequency. Updates are sent on a 1000ms interval unless there's a high-priority update (like a failed allocation) which shrinks the interval to 200ms for that batch.

We didn't want to expose this behavior as a required tunable because users should mostly not have to care about it. The user-facing behavior didn't meaningfully change for most users because the speed of failure detection was unchanged. But for users who don't care about liveness so much (mostly folks running huge batch workloads), it could be valuable to have a configuration knob that lets them increase the interval even further. For example, if you're not serving live requests from your workloads, you might not care about the difference between 200ms and 10s in terms of failure recovery.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant