Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable NodeLocal DNSCache #7885

Closed
sagor999 opened this issue Jan 27, 2022 · 4 comments
Closed

Enable NodeLocal DNSCache #7885

sagor999 opened this issue Jan 27, 2022 · 4 comments
Labels
team: workspace Issue belongs to the Workspace team

Comments

@sagor999
Copy link
Contributor

Is your feature request related to a problem? Please describe

https://kubernetes.io/docs/tasks/administer-cluster/nodelocaldns/
Why we need this?
It will greatly improve our dns performance as well as dns resiliency.
Majority of dns requests from each node will stay on the node itself by hitting local cache, greatly reducing pressure on coredns, as well udp traffic to coredns (that sometimes might get dropped, lost).
This also reduces pressure on conntrack table, since conntrack would not need to track those requests now.

Describe the behaviour you'd like

Describe alternatives you've considered

Additional context

@sagor999 sagor999 added the team: workspace Issue belongs to the Workspace team label Jan 27, 2022
@aledbf
Copy link
Member

aledbf commented Jan 27, 2022

It will greatly improve our dns performance as well as dns resiliency.

We need metrics about the current state

This also reduces pressure on conntrack table, since conntrack would not need to track those requests now.

Before this route, we need to know the size of the conntrack table.

Why?

@sagor999
Copy link
Contributor Author

Our current conntrack table usage at peak from last two weeks is about 60%, though it is coming from meta cluster:
topk(20, node_nf_conntrack_entries/node_nf_conntrack_entries_limit)

I agree though this is a big change and probably we can hold off on this one for now.

@kylos101 kylos101 moved this to Scheduled in 🌌 Workspace Team Jan 28, 2022
@kylos101 kylos101 removed the status in 🌌 Workspace Team Feb 9, 2022
@kylos101
Copy link
Contributor

kylos101 commented Feb 9, 2022

Removing this from scheduled work for now, moved to the "Cluster and Node Improvements" epic. I consider that to be a short lived backlog. @aledbf @sagor999 we things end up rotting there I'll plan to close them. But this way, they exit our inbox.

@sagor999
Copy link
Contributor Author

sagor999 commented Feb 9, 2022

Yeah, I think this one can be closed for the time being. If we will start seeing high load on coredns pods, or excessive latency for dns queries, then we can return back to exploring using nodelocal dns cache for that.
So I will go ahead and close this issue, we can always re-open it in the future.

@sagor999 sagor999 closed this as completed Feb 9, 2022
@sagor999 sagor999 moved this to Done in 🌌 Workspace Team Feb 9, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
team: workspace Issue belongs to the Workspace team
Projects
No open projects
Archived in project
Development

No branches or pull requests

3 participants