DNS queries with unknown datacenters can cause excessive load on consul servers and force agents to run out of file descriptors

If a consul agent receives DNS queries of the form `someservice.service.falsedc.domain.consul` these queries will cause excessive load on the consul servers along with log lines of the form `[WARN] consul.rpc: RPC request for DC 'falsedc', no path found`.  At a glance it seems like the server should fail early when it cannot find the datacenter, but instead recurses until the request's TTL is reached and dropped. 

Furthermore the agent that received the query will show log lines of the form `[ERR] dns: rpc error: rpc error: No path to datacenter`.  Furthermore if the agent receives these queries at a moderate rate it will eventually run out of file descriptors. I suspect that perhaps a new socket is opened for each pending query.  This is not necessarily bad as responses should be fast, but the first part of this issue causes consul to open more and more sockets until it can't open any more.  The errors from this scenario also cause the consul agent to write gigabytes of logs within minutes.  

The issue can be replicated on a Linux system which has the consul agent set as its nameserver (e.g. via binding to port 53 or via dnsmasq) by adding `domain.consul` to the search domains in `/etc/resolv.conf` (e.g. `search domain.consul`) and running queries of the format `someservice.service.domain.consul`, which get expanded by the resolver to `someservice.service.domain.consul.domain.consul`.  However I'm fairly certain that this is just a special case, and that the issue should be reproducible with any nonexisting datacenter and any consul domain. 


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

DNS queries with unknown datacenters can cause excessive load on consul servers and force agents to run out of file descriptors #807

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

DNS queries with unknown datacenters can cause excessive load on consul servers and force agents to run out of file descriptors #807

Description

Activity

armon commented on Mar 24, 2015

armon commented on May 5, 2015

armon commented on May 5, 2015

frankfarmer commented on May 29, 2015

primal-github commented on May 29, 2015

igoratencompass commented on Jul 6, 2018

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions