(DoS vector?) Peers are removed from the routing table on Disconnect #45

JustinDrake · 2017-01-23T12:52:56Z

Currently when a peer disconnects, the peer is removed from the routing table. To me this seems needlessly aggressive. Shouldn't the Kademlia least-recently seen eviction policy deal with clearing inactive nodes? Certainly a node shouldn't be evicted from the routing table for a temporary disconnection.

I don't see this remove-on-disconnect policy in the Kademlia whitepaper, and to my eye it is a DoS attack vector. An attacker can flood a victim at the network level to force the temporary closure of all its connections. This would flush the node's routing state, and the attacker could then fill the victim's routing table with bad nodes.

Even in a non-hostile scenario, if a node is temporary shut off from the internet (e.g. for just a few minutes), then it needlessly has to repopulate its routing table from scratch.

Am I missing something obvious?

cpacia · 2017-01-23T14:47:50Z

The Kademlia paper assumes UDP so there isn't any concept of a disconnection. It only removes them when you try to send them a rpc message and it times out.

JustinDrake · 2017-01-23T15:05:38Z

It only removes them when you try to send them a message and it times out.

Yes, I think that's right. To quote the white paper:

If the appropriate k-bucket is full, however, then the recipient pings the k-bucket’s least-recently seen node to decide what to do. If the least-recently seen node fails to respond, it is evicted from the k-bucket and the new sender inserted at the tail

Eviction should happen if the following three conditions are met:

The relevant k-bucket is full
The least-recently seen node is pinged
The least-recently seen node fails to pong

Eviction on disconnect is probably too aggressive.

whyrusleeping · 2017-02-10T19:12:42Z

@JustinDrake hrm... youre right. We should fix this.

whyrusleeping · 2017-02-10T19:14:19Z

We should probably set a reasonably short timeout on the eviction ping though, don't want to sit around for ages waiting on that.

JustinDrake · 2017-02-13T12:58:43Z

Awesome. I'd love to have this bug fixed for the public release of OpenBazaar 2.0 as it affects Duo. I'd be happy to help, e.g. by doing a code review.

As mentioned in #31, we probably want to use the Kademlia PING message.

We should probably set a reasonably short timeout on the eviction ping

Yes. To speed things up further we can use heuristics such as:

If the node was last seen less than (say) 5 minutes ago, assume that the node is online and don't evict.
If the node was last seen more than (say) 1 day ago, assume that the node is offline and evict without a PING.

whyrusleeping · 2017-03-04T07:35:14Z

@JustinDrake Sounds good. I'll try and get this done ASAP for you guys.

JustinDrake · 2017-07-17T14:32:57Z

@whyrusleeping Have you made any progress on this? It would be great to have this for the initial release of OpenBazaar 2.0.

FrankSzendzielarz · 2017-12-01T11:27:07Z

Just a quick note on the above thread, as I have recently been studying Kademlia for the Ethereum peer discovery protocol and have it fresh in my mind. The M&M paper contradicts itself somewhat in more than one place. The above observation that the oldest node in the k-bucket should be pinged is correct, but the paper supersedes itself towards the end of the document by advising against doing that, as it could cause a ping storm. They recommend maintaining a cache of possible replacement nodes, but as cpacia correctly points out, to only remove the oldest kbucket node when a meaningful rpc call fails at some later stage, replacing the oldest node with the most recent from the replacement cache. Just an FYI.

whyrusleeping · 2017-12-01T12:27:09Z

@FrankSzendzielarz interesting observation, thank you!

anacrolix · 2019-02-14T00:36:11Z

Let's please keep this issue on topic, if there are observations or comments unrelated to the disconnect behaviour please continue those in an appropriate issue. I am currently looking into the disconnect behaviour.

Stebalien · 2021-07-21T04:22:40Z

Fixed.

whyrusleeping added the kind/bug A bug in existing code (including security flaws) label Feb 10, 2017

JustinDrake mentioned this issue Feb 13, 2017

[Meta] Protocol-level changes to consider before launch OpenBazaar/openbazaar-go#363

Closed

25 tasks

whyrusleeping self-assigned this Mar 4, 2017

JustinDrake mentioned this issue Apr 14, 2017

Comments on altcoins.md OpenBazaar/openbazaar-go#508

Closed

Stebalien mentioned this issue Aug 31, 2017

Your attention has been requested on... whyrusleeping/todo#42

Open

whyrusleeping added the status/ready Ready to be worked label Oct 17, 2017

whyrusleeping mentioned this issue Feb 13, 2018

DHT Query Performance #88

Closed

laser mentioned this issue May 15, 2018

Peers are removed from the routing table on disconnect (go-libp2p-kad-dht) filecoin-project/venus#439

Closed

bigs added the exp/expert Having worked on the specific codebase is important label Sep 11, 2018

anacrolix assigned anacrolix and unassigned whyrusleeping Jan 17, 2019

anacrolix mentioned this issue Feb 14, 2019

Analysis: k-bucket underutilisation and excessive thrashing #194

Closed

anacrolix removed their assignment Dec 10, 2019

Stebalien closed this as completed Jul 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(DoS vector?) Peers are removed from the routing table on Disconnect #45

(DoS vector?) Peers are removed from the routing table on Disconnect #45

JustinDrake commented Jan 23, 2017

cpacia commented Jan 23, 2017 •

edited

Loading

JustinDrake commented Jan 23, 2017

whyrusleeping commented Feb 10, 2017

whyrusleeping commented Feb 10, 2017

JustinDrake commented Feb 13, 2017

whyrusleeping commented Mar 4, 2017

JustinDrake commented Jul 17, 2017

FrankSzendzielarz commented Dec 1, 2017

whyrusleeping commented Dec 1, 2017

anacrolix commented Feb 14, 2019

Stebalien commented Jul 21, 2021

(DoS vector?) Peers are removed from the routing table on Disconnect #45

(DoS vector?) Peers are removed from the routing table on Disconnect #45

Comments

JustinDrake commented Jan 23, 2017

cpacia commented Jan 23, 2017 • edited Loading

JustinDrake commented Jan 23, 2017

whyrusleeping commented Feb 10, 2017

whyrusleeping commented Feb 10, 2017

JustinDrake commented Feb 13, 2017

whyrusleeping commented Mar 4, 2017

JustinDrake commented Jul 17, 2017

FrankSzendzielarz commented Dec 1, 2017

whyrusleeping commented Dec 1, 2017

anacrolix commented Feb 14, 2019

Stebalien commented Jul 21, 2021

cpacia commented Jan 23, 2017 •

edited

Loading