[dns] caching for local domain #225

rade · 2014-11-23T20:50:02Z

At present, weavedns first attempts to resolve names against the records from local containers. If that fails then it asks other peers and returns the first answer. This has some significant shortcomings...

every query for a name that belongs to a remote container requires a broadcast, network round trip, which introduces latency and increases network and CPU load, and doesn't scale.
the same occurs for a name that doesn't resolve at all
if the name is recorded more than once, we are entirely at the mercy of underlying network conditions etc, as to what answer we return, or even if we return an answer at all

The obvious solution is to introduce caching.

In its most basic form a dns peer would:

remember all answers returned from a broadcast query, including their ttls
if no answers are returned (based on some timeout), remember that too, again with a ttl
expire entries based on their ttl
answer queries from the cache if possible

The shortcomings of this basic approach are principally around the danger of returning stale entries vs achieving good performance and scalability. Still, I reckon it's good enough for starters, and even with a short ttl an improvement over the present situation.

rade · 2014-11-23T21:50:07Z

the danger of returning stale entries vs achieving good performance and scalability

One possibility here is to invert the information flow, moving from 'pull' to 'push', e.g. by gossiping records between peers. The trouble with doing that naively is that we'd end up with every record everywhere, which doesn't scale.

inercia · 2014-12-09T23:53:24Z

Nodes could share their DNS records by piggybacking their local DNS knowledge in gossip messages, or by explicitly pushing information to some random peers (memberlist uses this mechanism). This would increase the hit rate when looking for DNS names in the local records and, in case of a miss, the nodes could fall back to doing the regular resolution system...

rade · 2014-12-09T23:58:45Z

As I said above, the trouble with naive 'push' is that that we end up with every record everywhere.

inercia · 2014-12-10T00:18:00Z

But this is not necessarily a bad thing, right? Unless you have a huge number of records, having all the records in all the nodes is not bad IMO. Nodes should just have a hard limit in the memory used for this table though...

rade · 2014-12-10T00:32:51Z

It's not just about memory use but also the amount of information you need to shunt around the network. Storing every record everywhere is bad. Transmitting every record everywhere is bad too.

inercia · 2014-12-10T09:59:43Z

Indeed, the push protocol would have to be carefully designed, but this kind of gossip protocol, in the same spirit as SWIM, wouldn't add too much overhead for small networks. I think you could extrapolate the results from this simulator for calculating how much traffic it would add...

rade · 2014-12-10T10:33:16Z

The protocol overhead isn't the problem. The size of the data and frequency of update is, both of which can be expected to increase linearly with scale out, which will overwhelm network and processing capacity at some point.

In most networks there will be a natural segmentation of the namespace - you won't have all nodes querying DNS for all names. There is therefore no good reasons for nodes to be told about, or hold onto, DNS entries that their local containers have never asked for. Other than attempting to reduce latency for the first query for a name, but I'd rather give up that than design something inherently unscalable.

That's not to say that a naive push couldn't be good enough for starters, just as the (somewhat less naive, but still imperfect) pull is.

Btw, another difference between 'push' and 'pull' is that the former requires a custom protocol whereas 'pull' can be based on standard DNS. Using standard protocols is incredibly handy for debugging and tooling in general. It's not a major design pivot though; features and performance are more important.

inercia · 2014-12-10T11:03:25Z

Then I would start with the piggybacking propagation. When a node X sends a UDP packet to a node Y, it could add some random entries from the local DNS cache if any remaining space is available in the packet (up to the maximum capacity, the path MTU). This method would not replace the current "pull" mechanism, but it would increase the hit ratio, a first step towards a more elaborated solution...

…self-contained improved API, more friendly for the `DNSServer` class

…onal)

…self-contained improved API, more friendly for the `DNSServer` class

…onal)

…he least useful entries...

… truncated responses)

…ht TTL to answers, as well as the authoritative flag

increase logs verbosity in the unit tests

use a private mDNS client per local worker

… system

… cache after garbage collecting entries

fix in getReply(): only return something when we really have a reply...

…and Put()

… one answer

inercia · 2015-03-09T18:45:50Z

So if a mapping in X exists for that name, X["A"] -> Z, and Y registers that name, you are saying that Y should broadcast the new mapping X["A"] -> Y (equivalent to the please remove X["A"] but with added information), but we would be using a push model then, with the all peers broadcasting all mappings problem, right? Wouldn't it be better to stick to relatively short TTLs and let peers ask for X["A"] when they are asked by a client?

rade · 2015-03-09T19:20:09Z

Wouldn't it be better to stick to relatively short TTLs and let peers ask for X["A"] when they are asked by a client?

The whole point of doing something more complicated is to be able to use a longer TTL. The shorter the ttl the more DNS traffic there is on the network.

We would be using a push model then, with the all peers broadcasting all mappings problem

Peers can just broadcast "I am interested in name A" when resolving A and the cache misses. In fact this is what happens now; the mdns broadcast contains just that information. But we need more than that:

recipient peers should remember that association, so that they can subsequently inform interested peers when a name is added/removed locally
the "I am interested in name A" information needs to be transmitted more reliably; gossip communication between peers can take care of that.
new peers need a way of obtaining the "I am interested in name A" map from all existing peers; gossip communication can take care of that.
cache entries need to be removed/invalidated when peers disappear.

All of this is very similar to what we are doing in #390 for communicating IP reservations between peers.

inercia · 2015-03-09T22:12:09Z

The whole point of doing something more complicated is to be able to use a longer TTL. The shorter the ttl the more DNS traffic there is on the network.

Obviously we want to increase the TTL, but I think we both agree that mDNS is a clearly limited solution and we must use short TTLs until we have a better solution...

All of this is very similar to what we are doing in #390 for communicating IP reservations between peers.

That would be a nice change but it would involve a lot of work, and I'm a bit concerned about this extra communication step between WeaveDNS and the router, but it can be done...

…and Put()

inercia · 2015-03-16T18:58:13Z

@squaremo "caching" to "cache local RRs"? why only the local RRs? don't we want to cache replies obtained recursively?

squaremo · 2015-03-17T10:42:46Z

"caching" to "cache local RRs"? why only the local RRs?

Because that's what the issue actually describes and the subsequent comments discuss.
(By local I mean "in .weave.local" of course)

inercia · 2015-03-18T08:29:20Z

Maybe we could create another issue for moving WeaveDNS to gossip communications, then we could start discussing how to do it... (and @rade would probably be the right person for filling the issue).

rade · 2015-03-18T09:33:46Z

We could, but that is quite a long way down the road. And gossiping is an implementation detail - issues should describe features/problems.

rade · 2015-04-20T19:43:11Z

@alvaro I think this is resolved by #429 provided there is some clarification on the following part of this issue's description: "In its most basic form a dns peer would: remember all answers returned from a broadcast query, including their ttls". We are not doing that, are we? i.e. if we get responses from multiple servers, the cache entries get overwritten, so we only keep the response from one server.

I am ok with that, but please point out (and if necessary create) the issues were that is addressed. #226 is a possible candidate, but looks quite broad.

inercia · 2015-04-20T20:41:08Z

@rade Current implementation does not cache all responses it gets from the network, but only the first one. When WeaveDNS is asked about a local name it will

send a mDNS query and wait for up to 500ms for an answer
get the first answer
store that answer in the cache and
return that answer to the client

Following answers to the mDNS query will be simply ignored. The answer will be always cached for 30 seconds (we are currently using a hardcoded TTL for local answers). So we are caching the response from the fastest peer...

Multiple responses from peers will be processed and remembered when #338 is addressed (I'm working on that) (and, as a side effect, we will have an initial solution for #226).

rade · 2015-04-20T20:58:37Z

ok. #338 is about a single peer returning multiple responses, i.e. when there are multiple containers with the same name on that peer.

rade · 2015-04-21T07:42:57Z

Closed by #429.

rade added feature icebox labels Nov 23, 2014

bboreham mentioned this issue Dec 8, 2014

[dns] overwrite entry's with same name #267

Closed

rade mentioned this issue Dec 9, 2014

zeroconf weave connectivity #224

Open

inercia mentioned this issue Feb 16, 2015

weave-362: support TCP fall-back for very large DNS responses #392

Merged

inercia self-assigned this Feb 16, 2015

inercia added the in progress label Feb 16, 2015

inercia added a commit to inercia/weave that referenced this issue Feb 17, 2015

weaveworks#225 - skeleton of the new DNS cache

68cafd7

inercia added a commit to inercia/weave that referenced this issue Feb 18, 2015

weaveworks#225 - moved some logic to the entry class so it is more …

4968822

…self-contained improved API, more friendly for the `DNSServer` class

inercia added a commit to inercia/weave that referenced this issue Feb 19, 2015

weaveworks#225 - use the cache in the DNS server (almost fully functi…

2f22035

…onal)

inercia added a commit to inercia/weave that referenced this issue Feb 23, 2015

weaveworks#225 - skeleton of the new DNS cache

6da0a4d

inercia added a commit to inercia/weave that referenced this issue Feb 23, 2015

weaveworks#225 - moved some logic to the entry class so it is more …

cab8edc

…self-contained improved API, more friendly for the `DNSServer` class

inercia added a commit to inercia/weave that referenced this issue Feb 23, 2015

weaveworks#225 - use the cache in the DNS server (almost fully functi…

b75b592

…onal)

inercia added a commit to inercia/weave that referenced this issue Feb 23, 2015

weaveworks#225 - improved garbage collection in the cache, removing t…

41d41d7

…he least useful entries...

inercia added a commit to inercia/weave that referenced this issue Feb 23, 2015

weaveworks#225 - fixed some race conditions with invalid entries (ie,…

3fab87d

… truncated responses)

inercia added a commit to inercia/weave that referenced this issue Feb 24, 2015

weaveworks#225 - adjust the reply returned fom the cache: set the rig…

982b276

…ht TTL to answers, as well as the authoritative flag

inercia added a commit to inercia/weave that referenced this issue Feb 24, 2015

weaveworks#225 - fix the unit tests by setting a valid TTL

62b27f1

increase logs verbosity in the unit tests

inercia added a commit to inercia/weave that referenced this issue Feb 24, 2015

weaveworks#225 - pass current time as a parameter in the cache methods

b8f428f

use a private mDNS client per local worker

inercia added a commit to inercia/weave that referenced this issue Feb 24, 2015

weaveworks#225 - a proper stop-and-cleanup mechanism in the whole DNS…

28728fb

… system

inercia added a commit to inercia/weave that referenced this issue Feb 24, 2015

weaveworks#225 - check that we keep the most up-to-date values in the…

df2fa8d

… cache after garbage collecting entries

inercia added a commit to inercia/weave that referenced this issue Feb 25, 2015

weaveworks#225 - improved unit tests

5b99849

fix in getReply(): only return something when we really have a reply...

inercia added a commit to inercia/weave that referenced this issue Feb 25, 2015

weaveworks#225 - more unit tests...

3afff02

inercia added a commit to inercia/weave that referenced this issue Mar 9, 2015

weaveworks#225 - use a heap to keep entries sorted by TTL

af26f68

inercia added a commit to inercia/weave that referenced this issue Mar 9, 2015

weaveworks#225 - test entries that vanish in the cache between Get() …

591b75c

…and Put()

inercia added a commit to inercia/weave that referenced this issue Mar 9, 2015

weaveworks#225 - removed the Num*Workers

32a42e4

inercia added a commit to inercia/weave that referenced this issue Mar 9, 2015

weaveworks#225 - micro-optimization: do not shuffle when we only have…

f9dbda0

… one answer

inercia added a commit to inercia/weave that referenced this issue Mar 10, 2015

weaveworks#225 - use a heap to keep entries sorted by TTL

b5177bb

inercia added a commit to inercia/weave that referenced this issue Mar 10, 2015

weaveworks#225 - test entries that vanish in the cache between Get() …

cc3ce99

…and Put()

inercia added a commit to inercia/weave that referenced this issue Mar 10, 2015

weaveworks#225 - minor

019531f

inercia added a commit to inercia/weave that referenced this issue Mar 10, 2015

weaveworks#225 - skeleton of the new DNS cache

6a2ba17

This was referenced Mar 10, 2015

weave-225: [dns] caching (3/3) #447

Closed

weave-225: [dns] caching (2/3) #440

Closed

squaremo changed the title ~~[dns] caching~~ [dns] cache local RRs Mar 16, 2015

rade unassigned inercia Apr 8, 2015

rade changed the title ~~[dns] cache local RRs~~ [dns] caching for local domain Apr 20, 2015

rade added this to the 0.10.0 milestone Apr 20, 2015

rade removed icebox labels Apr 20, 2015

rade mentioned this issue Apr 21, 2015

[dns] remember and return *all* answers to a broadcast query #583

Closed

rade closed this as completed Apr 21, 2015

awh added the [component/dns] label Jun 2, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[dns] caching for local domain #225

[dns] caching for local domain #225

rade commented Nov 23, 2014

rade commented Nov 23, 2014

inercia commented Dec 9, 2014

rade commented Dec 9, 2014

inercia commented Dec 10, 2014

rade commented Dec 10, 2014

inercia commented Dec 10, 2014

rade commented Dec 10, 2014

inercia commented Dec 10, 2014

inercia commented Mar 9, 2015

rade commented Mar 9, 2015

inercia commented Mar 9, 2015

inercia commented Mar 16, 2015

squaremo commented Mar 17, 2015

inercia commented Mar 18, 2015

rade commented Mar 18, 2015

rade commented Apr 20, 2015

inercia commented Apr 20, 2015

rade commented Apr 20, 2015

rade commented Apr 21, 2015

[dns] caching for local domain #225

[dns] caching for local domain #225

Comments

rade commented Nov 23, 2014

rade commented Nov 23, 2014

inercia commented Dec 9, 2014

rade commented Dec 9, 2014

inercia commented Dec 10, 2014

rade commented Dec 10, 2014

inercia commented Dec 10, 2014

rade commented Dec 10, 2014

inercia commented Dec 10, 2014

inercia commented Mar 9, 2015

rade commented Mar 9, 2015

inercia commented Mar 9, 2015

inercia commented Mar 16, 2015

squaremo commented Mar 17, 2015

inercia commented Mar 18, 2015

rade commented Mar 18, 2015

rade commented Apr 20, 2015

inercia commented Apr 20, 2015

rade commented Apr 20, 2015

rade commented Apr 21, 2015