Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Work Queue Latency increased 10 times #428

Open
eshikhov opened this issue Dec 3, 2024 · 5 comments
Open

Work Queue Latency increased 10 times #428

eshikhov opened this issue Dec 3, 2024 · 5 comments

Comments

@eshikhov
Copy link

eshikhov commented Dec 3, 2024

When applying Linstore CRDs (group internal.linstor.linbit.com) Work Queue Latency increased 10 times

I rolled these CRDs into different clusters with the same effect everywhere

Only after the CRDs are removed, the delay of the work queue returns to normal time. On the graph you can see the moment of installing and removing CRDs.

Снимок экрана 2024-11-27 в 18 33 21

Environment
OS: Talos v1.8.3
Kubernetes version: v1.30.7
Linstore CRDs: v1_27_1

@ghernadi
Copy link
Contributor

ghernadi commented Dec 3, 2024

I am not fully sure if I understand the issue here. Since I am not a k8s expert, please correct me if I'm wrong, but this is how I understand what you are saying:

The pure existence of LINSTOR's CRDs make k8s itself slower (higher worker queue latency).

If my understanding is correct, I am not really sure what LINSTOR can do about this. Can you try to create some arbitrary CRDs and check if k8s also gets slower from those? If so, I'd suggest you open an issue (or question if this is a known behavior) in k8s GitHub. Unless someone can point us to something that LINSTOR is doing wrong (apart from "using CRDs"), I do not see how we can help here.

@eshikhov
Copy link
Author

eshikhov commented Dec 3, 2024

Yes, you are correct, installing LINSTOR's CRDs makes k8s itself slower (higher worker queue latency).
Installing other CRDs, even more than LINSTOR's, has no negative impact.

@vladimirfx
Copy link

The pure existence of LINSTOR's CRDs make k8s itself slower (higher worker queue latency).

Yes, you understand correctly. Only Linstore CRDs slow down openapi_v3_contoller in order of magnitude. Other CRDs (a few hundred) do not affect that controller. Unfortunately, I can't find relevant K8s issues.
What is interesting is that when CRDs are modified by removing old versions - the response time for openapi_v3_contoller is normalized.

@eshikhov
Copy link
Author

eshikhov commented Dec 3, 2024

Tried it in two different clusters
One distribution on talos Kubernetes version v1.30.7
the other on k3s Kubernetes version v1.23.16

On different distributions and versions of Kubernetes Work Queue Latency increased

@ghernadi
Copy link
Contributor

ghernadi commented Dec 4, 2024

Sorry for the silence.. we are investigating and will try to come up with a solution, but since our todo-lists are quite full, it might take a while.

Thanks anyways for reporting!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants