Improve node search over k-buckets (getClosestLocalNode) #212

joshuakarp · 2021-07-21T08:12:53Z

Specification

To optimise reading out the closes nodes to a target node we need to apply some improvements.

The getClosestNode function needs to take a nodeId and limit as parameters. The nodeId is the node we're calculating the distance relative to. The limit is how many nodes we wish to return. The limit defaults to the nodeBucketLimit as per the Kademlia spec.

We need to avoid reading out all of the buckets and iterating over empty buckets. This can be achieved by using a readStream over the nodeGraphBucketsDb level. This level contains sub levels for each bucket. Each sub level contains the nodeId:nodeInfo key:value pairs. Using the nodeGraphBucketsDb level we can iterate over each stored node in bucket order all at once. Note when setting the gt or lt on the stream we need to start from the desired bucket. In this case the starting point is the bucket 'above' the desired starting bucket. the key we want to start from takes the form of a Buffer with <prefix><higherBucketId><prefix>. Iterating less than this gives us the target bucket plus all lower buckets. Above this is all of the higher buckets.

When we run out of lower buckets we need to iterate over the higher buckets from where we started. If we run into limit while iterating over the nodes we need to get the whole of the last bucket we read. since nodes are out of order within a bucket we need whole buckets to ensure we obtain the closest nodes.

The resulting list is sorted by distance using nodesUtils.bucketSortByDistance and the list is truncated down to the provided limit.

As implemented

We iterate over the nodes directly across the buckets. the nodes are read out in the following order.

all nodes within the target bucket N
nodes in order of bucket 0 to bucket N-1
nodes in order of bucket N+1 to 255.

When we reach our specified limit we read the whole of the last bucket we've read and add that to the list. we then sort all of the nodes and truncate the list back down to the limit and return that.

Additional context

discussion thread https://gitlab.com/MatrixAI/Engineering/Polykey/js-polykey/-/merge_requests/195#note_612883452 (text above copied from a previous comment of mine)
Improve node search over k-buckets (getClosestLocalNode) #212 (comment)

Tasks

1. Update getClosestNodes implementation to use a readStream to iterate over each node sequentially across all of the buckets.
2. if we run out of 'closer' buckets we iterate over the higher buckets.
3. buckets are not ordered via distance, so if we read any node from a bucket into the list we need to get the whole bucket.
4. the returned list needs to be sorted by distance and truncated down to the limit.

The text was updated successfully, but these errors were encountered:

CMCDragonkai · 2022-03-11T05:32:46Z

With the NodeGraph using sublevels to represent buckets, you can now use createReadStream to stream to the "left" from the designated bucket and NodeId to be looked up. You just need to use reverse: true and lt with a synthesised seek key based on the bucket key and NodeId. See nodes/utils for the utilities to create that seek key. Use limit: k to limit the number of nodes to find.

If you cannot fill out the k buffer, then just start streaming to the right.

Then you can return it.

CMCDragonkai · 2022-03-12T15:45:53Z

Reposting from #326 (comment)

I've done some further prototyping on the NodeIds that are in each bucket. And this has an important impact on how we do the getClosestNodes function, and any visualisation of the NodeGraph.

The first thing to realise is that NodeId is just a sequence of bytes, they can be 1 byte or 32 bytes or whatever. They are set to 32 bytes, but it's easier to explain everything by thinking of them with only 1 byte.

And because they are just a sequence of bytes, there is a numeric equivalence of NodeId. 0 - 255 can be represented as 1 byte, and 0 - 65535 can be represented with 2 bytes.

The Buffer.compare function compares pair-wise bytes using the numeric representation. So [255, 0] is less than [255, 1].

Buffer.compare(Buffer.from([255, 0]), Buffer.from([255, 1])) === -1

Note that this means our NodeId are in big-endian format.

The buckets sublevel stores keys in lexicographic order, this means we are in ascending order of bucket indexes by default, and then in each bucket, the node IDs are also in lexicographic order. This is achieved by lexi.pack of the bucket index, and storing raw NodeId buffers as the final keys.

I believe lexicographic order of the raw NodeId buffers are the same as the numeric order of the bigints that the NodeIds would represent (remember that 32 bytes can represent a VERY large number, so that's why we use bigint).

A 1-byte NodeId like 00000001 would be just 1 in numeric order and would look like [1]. I have to check though, by adding an additional test to js-db that proves that lexicographic order of binary keys is the same order as produced by Buffer.compare which just compares the numeric values in each byte.

Supposing lexicographic order of NodeId is equal to numeric order of NodeId, the next question is whether this order can be used to produce NodeIds that are ordered by Kadmelia XOR distance. So I tried it out because it looked like it did in the wiki image:

However upon testing with more node IDs, this did not end up being the case.

The end result is that for any given NodeId, the closest NodeId is a directly adjacent NodeId with an XOR distance of 1, this corresponds to a bit-wise output of XOR of just 1, whether that is 001 or 000000...00001.

    101
xor 100
    001

This can be applied to any NodeId using XOR, because the inverse of XOR is XOR.

So:

5^1 == 4
76^1 == 77

Afterwards, all other NodeId are grouped into buckets on either side of the numeric line.

Then each bucket has NodeId, and the numeric/lexicographic order is not the same as if it were sorted by distance.

Here's an example to prove this:

    // Create 100 NodeIds
    // They all have a size of 1 byte
    // Numerically they represent 0 - 99
    const nodeIds: Array<NodeId> = Array.from(
      {length: 100},
      (_, i) => IdInternal.create<NodeId>(utils.bigInt2Bytes(BigInt(i), 1))
    );

    const results: Array<any> = [];

    for (let i = 0; i < nodeIds.length; i++) {
      let bucketIndex;
      let distance;
      try {
        // 77 is the chosen NodeId
        bucketIndex = nodesUtils.bucketIndex(nodeIds[77], nodeIds[i]);
        distance = nodesUtils.nodeDistance(nodeIds[77], nodeIds[i]);
      } catch (e) {
        // Ignore the chosen NodeId
      }
      results.push(
        [
          i,
          nodeIds[i],
          bucketIndex,
          distance
        ]
      );
    }

    console.log(results);

The output is:

    [
      [ 0, IdInternal(1) [Uint8Array] [ 0 ], 6, 77n ],
      [ 1, IdInternal(1) [Uint8Array] [ 1 ], 6, 76n ],
      [ 2, IdInternal(1) [Uint8Array] [ 2 ], 6, 79n ],
      [ 3, IdInternal(1) [Uint8Array] [ 3 ], 6, 78n ],
      [ 4, IdInternal(1) [Uint8Array] [ 4 ], 6, 73n ],
      [ 5, IdInternal(1) [Uint8Array] [ 5 ], 6, 72n ],
      [ 6, IdInternal(1) [Uint8Array] [ 6 ], 6, 75n ],
      [ 7, IdInternal(1) [Uint8Array] [ 7 ], 6, 74n ],
      [ 8, IdInternal(1) [Uint8Array] [ 8 ], 6, 69n ],
      [ 9, IdInternal(1) [Uint8Array] [ 9 ], 6, 68n ],
      [ 10, IdInternal(1) [Uint8Array] [ 10 ], 6, 71n ],
      [ 11, IdInternal(1) [Uint8Array] [ 11 ], 6, 70n ],
      [ 12, IdInternal(1) [Uint8Array] [ 12 ], 6, 65n ],
      [ 13, IdInternal(1) [Uint8Array] [ 13 ], 6, 64n ],
      [ 14, IdInternal(1) [Uint8Array] [ 14 ], 6, 67n ],
      [ 15, IdInternal(1) [Uint8Array] [ 15 ], 6, 66n ],
      [ 16, IdInternal(1) [Uint8Array] [ 16 ], 6, 93n ],
      [ 17, IdInternal(1) [Uint8Array] [ 17 ], 6, 92n ],
      [ 18, IdInternal(1) [Uint8Array] [ 18 ], 6, 95n ],
      [ 19, IdInternal(1) [Uint8Array] [ 19 ], 6, 94n ],
      [ 20, IdInternal(1) [Uint8Array] [ 20 ], 6, 89n ],
      [ 21, IdInternal(1) [Uint8Array] [ 21 ], 6, 88n ],
      [ 22, IdInternal(1) [Uint8Array] [ 22 ], 6, 91n ],
      [ 23, IdInternal(1) [Uint8Array] [ 23 ], 6, 90n ],
      [ 24, IdInternal(1) [Uint8Array] [ 24 ], 6, 85n ],
      [ 25, IdInternal(1) [Uint8Array] [ 25 ], 6, 84n ],
      [ 26, IdInternal(1) [Uint8Array] [ 26 ], 6, 87n ],
      [ 27, IdInternal(1) [Uint8Array] [ 27 ], 6, 86n ],
      [ 28, IdInternal(1) [Uint8Array] [ 28 ], 6, 81n ],
      [ 29, IdInternal(1) [Uint8Array] [ 29 ], 6, 80n ],
      [ 30, IdInternal(1) [Uint8Array] [ 30 ], 6, 83n ],
      [ 31, IdInternal(1) [Uint8Array] [ 31 ], 6, 82n ],
      [ 32, IdInternal(1) [Uint8Array] [ 32 ], 6, 109n ],
      [ 33, IdInternal(1) [Uint8Array] [ 33 ], 6, 108n ],
      [ 34, IdInternal(1) [Uint8Array] [ 34 ], 6, 111n ],
      [ 35, IdInternal(1) [Uint8Array] [ 35 ], 6, 110n ],
      [ 36, IdInternal(1) [Uint8Array] [ 36 ], 6, 105n ],
      [ 37, IdInternal(1) [Uint8Array] [ 37 ], 6, 104n ],
      [ 38, IdInternal(1) [Uint8Array] [ 38 ], 6, 107n ],
      [ 39, IdInternal(1) [Uint8Array] [ 39 ], 6, 106n ],
      [ 40, IdInternal(1) [Uint8Array] [ 40 ], 6, 101n ],
      [ 41, IdInternal(1) [Uint8Array] [ 41 ], 6, 100n ],
      [ 42, IdInternal(1) [Uint8Array] [ 42 ], 6, 103n ],
      [ 43, IdInternal(1) [Uint8Array] [ 43 ], 6, 102n ],
      [ 44, IdInternal(1) [Uint8Array] [ 44 ], 6, 97n ],
      [ 45, IdInternal(1) [Uint8Array] [ 45 ], 6, 96n ],
      [ 46, IdInternal(1) [Uint8Array] [ 46 ], 6, 99n ],
      [ 47, IdInternal(1) [Uint8Array] [ 47 ], 6, 98n ],
      [ 48, IdInternal(1) [Uint8Array] [ 48 ], 6, 125n ],
      [ 49, IdInternal(1) [Uint8Array] [ 49 ], 6, 124n ],
      [ 50, IdInternal(1) [Uint8Array] [ 50 ], 6, 127n ],
      [ 51, IdInternal(1) [Uint8Array] [ 51 ], 6, 126n ],
      [ 52, IdInternal(1) [Uint8Array] [ 52 ], 6, 121n ],
      [ 53, IdInternal(1) [Uint8Array] [ 53 ], 6, 120n ],
      [ 54, IdInternal(1) [Uint8Array] [ 54 ], 6, 123n ],
      [ 55, IdInternal(1) [Uint8Array] [ 55 ], 6, 122n ],
      [ 56, IdInternal(1) [Uint8Array] [ 56 ], 6, 117n ],
      [ 57, IdInternal(1) [Uint8Array] [ 57 ], 6, 116n ],
      [ 58, IdInternal(1) [Uint8Array] [ 58 ], 6, 119n ],
      [ 59, IdInternal(1) [Uint8Array] [ 59 ], 6, 118n ],
      [ 60, IdInternal(1) [Uint8Array] [ 60 ], 6, 113n ],
      [ 61, IdInternal(1) [Uint8Array] [ 61 ], 6, 112n ],
      [ 62, IdInternal(1) [Uint8Array] [ 62 ], 6, 115n ],
      [ 63, IdInternal(1) [Uint8Array] [ 63 ], 6, 114n ],
      [ 64, IdInternal(1) [Uint8Array] [ 64 ], 3, 13n ],
      [ 65, IdInternal(1) [Uint8Array] [ 65 ], 3, 12n ],
      [ 66, IdInternal(1) [Uint8Array] [ 66 ], 3, 15n ],
      [ 67, IdInternal(1) [Uint8Array] [ 67 ], 3, 14n ],
      [ 68, IdInternal(1) [Uint8Array] [ 68 ], 3, 9n ],
      [ 69, IdInternal(1) [Uint8Array] [ 69 ], 3, 8n ],
      [ 70, IdInternal(1) [Uint8Array] [ 70 ], 3, 11n ],
      [ 71, IdInternal(1) [Uint8Array] [ 71 ], 3, 10n ],
      [ 72, IdInternal(1) [Uint8Array] [ 72 ], 2, 5n ],
      [ 73, IdInternal(1) [Uint8Array] [ 73 ], 2, 4n ],
      [ 74, IdInternal(1) [Uint8Array] [ 74 ], 2, 7n ],
      [ 75, IdInternal(1) [Uint8Array] [ 75 ], 2, 6n ],
      [ 76, IdInternal(1) [Uint8Array] [ 76 ], 0, 1n ],
      [ 77, IdInternal(1) [Uint8Array] [ 77 ], undefined, undefined ],
      [ 78, IdInternal(1) [Uint8Array] [ 78 ], 1, 3n ],
      [ 79, IdInternal(1) [Uint8Array] [ 79 ], 1, 2n ],
      [ 80, IdInternal(1) [Uint8Array] [ 80 ], 4, 29n ],
      [ 81, IdInternal(1) [Uint8Array] [ 81 ], 4, 28n ],
      [ 82, IdInternal(1) [Uint8Array] [ 82 ], 4, 31n ],
      [ 83, IdInternal(1) [Uint8Array] [ 83 ], 4, 30n ],
      [ 84, IdInternal(1) [Uint8Array] [ 84 ], 4, 25n ],
      [ 85, IdInternal(1) [Uint8Array] [ 85 ], 4, 24n ],
      [ 86, IdInternal(1) [Uint8Array] [ 86 ], 4, 27n ],
      [ 87, IdInternal(1) [Uint8Array] [ 87 ], 4, 26n ],
      [ 88, IdInternal(1) [Uint8Array] [ 88 ], 4, 21n ],
      [ 89, IdInternal(1) [Uint8Array] [ 89 ], 4, 20n ],
      [ 90, IdInternal(1) [Uint8Array] [ 90 ], 4, 23n ],
      [ 91, IdInternal(1) [Uint8Array] [ 91 ], 4, 22n ],
      [ 92, IdInternal(1) [Uint8Array] [ 92 ], 4, 17n ],
      [ 93, IdInternal(1) [Uint8Array] [ 93 ], 4, 16n ],
      [ 94, IdInternal(1) [Uint8Array] [ 94 ], 4, 19n ],
      [ 95, IdInternal(1) [Uint8Array] [ 95 ], 4, 18n ],
      [ 96, IdInternal(1) [Uint8Array] [ 96 ], 5, 45n ],
      [ 97, IdInternal(1) [Uint8Array] [ 97 ], 5, 44n ],
      [ 98, IdInternal(1) [Uint8Array] [ 98 ], 5, 47n ],
      [ 99, IdInternal(1) [Uint8Array] [ 99 ], 5, 46n ]
    ]

As you can see, bucket 2 & 3 are left-of NodeId 77, while bucket 4 & 5 is right-of NodeId 77 and finally bucket 6 is on the far-left.

Additionally, the numeric/lexicographic order for each NodeId does not match an ordered distance.

However it is true that all possible NodeIds in particular bucket would be contiguous numerically.

This means when getting the closest node IDs, we still need to sort each bucket's item by the distance function. This is bounded by the bucket size since buckets are limited to 20 nodes by default.

I'm writing 2 methods that do similar things:

getBuckets - this streams out buckets in order of bucket index and then by lastUpdated
getNodes - this streams out nodes in order of bucket index

I originally thought that I could have getNodes return in order of distance. But the DB would only provide me either order by lastUpdated similar to getBuckets, or lexicographic ordering of NodeId, which you can see isn't particularly useful.

The 2 sort-orders within buckets that are useful are ultimately:

In order of distance
In order of lastUpdated

So this means I'll need to bring back the sortByDistance utility and potentially make it available to the getBuckets and getNodes methods.

@tegefaulkes - you'll need to beware of this to solve this issue.

CMCDragonkai · 2022-03-12T15:52:52Z

Insertion sort would be nice to use here, but I fear it won't be efficient because arrays will end up being resized. So once you map all the values to distance, you then just sort them all.

CMCDragonkai · 2022-03-12T16:14:54Z

This test was added, so it proves that lexicographic order is equal to Buffer.compare order as well as numeric order of big-endian buffer bytes. MatrixAI/js-db@7f7d75e

tegefaulkes · 2022-03-15T00:50:41Z

A NodeId's distance is fully reversible calculation given we know the node it's relative to. This means if we wanted distance ordering we can use the distance as the key in the db instead of the NodeId. That's just something to consider.

CMCDragonkai · 2022-03-15T01:56:17Z

A NodeId's distance is fully reversible calculation given we know the node it's relative to. This means if we wanted distance ordering we can use the distance as the key in the db instead of the NodeId. That's just something to consider.

We cannot do that as I explained in #158. The distances are different because sometimes we want to know the closest nodes to a node that isn't the own node. So there's no way to just index by 1 distance metric.

You have to use the way I'm doing it in getBuckets to create a new method called getClosestNodes.

CMCDragonkai · 2022-03-15T01:57:37Z

I suggest a signature like:

getClosestNodes(nodeId: NodeId, limit: number): Array<[NodeId, NodeData]>

This is similar to getNodes except that that because it is a limit, it can be a finite return type compared to the async genreator.

CMCDragonkai · 2022-03-15T01:58:28Z

Use this comment: #158 (comment)

CMCDragonkai · 2022-03-15T01:58:47Z

With nodesUtils.bucketSortByDistance.

CMCDragonkai · 2022-03-15T02:34:50Z

In your tests for this, make sure to add in a test that the returned closest nodes should be ordered by distance ascending.

Example of doing this:

    const closestNodeDistances = closestNodes.map(([nodeId]) => nodesUtils.nodeDistance(sourceNodeId, nodeId));
    expect(
      closestNodeDistances.slice(1).every((distance, i) => {
        return closestNodeDistances[i] <= distance;
      })
    ).toBe(true);

I'm actually not sure if 2 nodes could have the same distance away from a given node ID, if that's not possible then you can change that to <.

tegefaulkes · 2022-03-15T04:38:35Z

I have an implementation that works but it may not be the best solution right now. Since a bucket will not be sorted by distance AND we want to sort the output by the 20 closest nodes. when we get some nodes from a bucket we need to get the whole bucket.

My first implementation started by iterating over each node with a limit set. but doing that it was possible to end up stopping half way into a bucket. Since streaming like this it is not easy to tell where the boundary of the bucket is.

What I have now is I'm getting each bucket to the left of the starting bucket inclusive and the right of the starting bucket exclusive until I reach the limit. sort this list and truncate it down to the limit. However we can end up trying to read a lot of empty buckets this way.

It's possible I can do the stream method until I reach the limit and then read the whole bucket at that last Id. I can then take the union of the stream list, sort and truncate the result. This will give us the best of both worlds.

CMCDragonkai · 2022-03-15T04:51:15Z

I have an implementation that works but it may not be the best solution right now. Since a bucket will not be sorted by distance AND we want to sort the output by the 20 closest nodes. when we get some nodes from a bucket we need to get the whole bucket.

Yes you have to acquire the full bucket and sort it by distance. This is fine, because you know that the bucket limit is a fixed relatively small number. This is why we are using sortBucketByDistance utility function. This is being used by getBuckets when sorted by distance.

My first implementation started by iterating over each node with a limit set. but doing that it was possible to end up stopping half way into a bucket. Since streaming like this it is not easy to tell where the boundary of the bucket is.

You can tell what the boundary of the bucket is, see how we do this in NodeGraph.getBuckets, this is done by parsing the key and checking if the bucketIndex or bucketKey is different. You know that all nodes in a bucket are contiguous during streaming.

What I have now is I'm getting each bucket to the left of the starting bucket inclusive and the right of the starting bucket exclusive until I reach the limit. sort this list and truncate it down to the limit. However we can end up trying to read a lot of empty buckets this way.

You should not be reading any empty buckets. Do it the same way that getBuckets does it, and it will be quite efficient.

tegefaulkes · 2022-03-15T06:43:06Z

Created public async getClosestNodes(nodeId: NodeId, limit: number = 20): Promise<[NodeId, NodeData][]>
It uses the readStream to get the nodeIds. To make sure we getting whole buckets, it reads the whole bucket of the last NodeId from the stream and unions what bucket list with the stream list. this is all sorted and truncated down to the limit size.

Created some tests testing for conditions where

all returned nodes are lower or equal buckets the target
all returned nodes are above or equal buckets the target
mix of lower and higher buckets
same as above 3 tests but total nodes are less than limit
no nodes exist

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

tegefaulkes · 2022-03-15T07:19:12Z

Mind you this just adds the implementation to the NodeGraph. the RPC call needs to be updated. I'll need to replace the current implementation in the NodeConnectionManager with the one I just made. and update the RPC handler as well.

Relates to #212

tegefaulkes · 2022-03-17T01:26:45Z

Copy of old spec for reference

Specification

Currently, getClosestLocalNodes in NodeGraph performs an exhaustive search over all of the buckets to find the k closest local nodes to a given target node. This is based off the following comment from https://stackoverflow.com/questions/30654398/implementing-find-node-on-torrent-kademlia-routing-table/30655403#30655403:

I figure that many people simply iterate over the whole routing table because for regular nodes it will only contain a few dozen buckets at most and a DHT node does not see much traffic, so it only has to execute this operation a few times per second and if you implement this in a dense, cache-friendly datastructure then the lion's share might actually be the memory traffic and not the CPU instructions doing a few XORs and comparisons.

However, recall that the buckets are organised according to the bitwise "distance" from the current node to another. Each key in the database represents a bucket index, i, where 2^i <= distance (from current node) < 2^(i+1). That is, for example, nodes in the bucket at index 20 are known to have a distance between 2^20 and 2^21 from the current node.
* Recall distance is the bitwise XOR operator from one node ID to another node ID.
* For example: IY2M0YTI5YzdhMTUzNGFjYWIxNmNiNGNmNTFiYTBjYTg XOR IY2M0YTI5YzdhMTUzNGFjYWIxNmNiNGNmNTFiYTBjYTh = 15

However, we need to remember how Kademlia functions. Not all of our buckets are going to have nodes in them (that is, there may not even be created buckets in the database at some of these indexes).

Therefore, it's not just a simple case of looking at the adjacent buckets then - because you're pretty unlikely to find an immediately adjacent bucket that already has a node in it. So then the bigger question is, how do you inform the search to find the closest nodes?

Additional context

discussion thread https://gitlab.com/MatrixAI/Engineering/Polykey/js-polykey/-/merge_requests/195#note_612883452 (text above copied from a previous comment of mine)

Tasks

After prior discussion with @CMCDragonkai, we believe the following process should provide a sufficient improvement over the current exhaustive search. The following should be implemented (or improved):

For a node n that would appear in bucket index i:

Get the nodes in the adjacent right bucket (i + 1th bucket) - if it exists
Combine these nodes with the nodes in your current bucket (if it exists)
Find the closest k nodes in this combined bucket.
Start iterating down the 'left' buckets if we haven't found k nodes yet
1. Note, nodes within a bucket are not sorted according to distance. So, if there are more nodes in the bucket than we need, we'd need to iterate over all nodes in the bucket to find the "closest" ones.
Start iterating up the 'right' buckets if we still haven't found k nodes yet.

We could use a createReadStream to neatly iterate over all buckets (because there's going to be lots of bucket indexes that don't actually have a bucket created).

CMCDragonkai · 2022-03-21T03:57:28Z

Need to check the getClosestNode and compare it to the k-buckets implementation: https://github.com/tristanls/k-bucket/blob/master/index.js#L203-L228

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

tegefaulkes · 2022-03-30T02:26:16Z

Reviewing how distance bucket index and distance maps over to a targetNodeId. The tables demonstrate this mapping. Note that each entry takes the format (bucket, distance).

Sorted by base distance

index	base	target
0	(5, 33)	(10, 1057)
1	(6, 65)	(10, 1089)
2	(7, 129)	(10, 1153)
3	(8, 257)	(10, 1281)
4	(9, 513)	(10, 1537)
5	(10, 1025)	(0, 1)
6	(11, 2049)	(11, 3073)
7	(12, 4097)	(12, 5121)
8	(13, 8193)	(13, 9217)
9	(14, 16385)	(14, 17409)

Sorted by target distance

index	base	target
0	(10, 1025)	(0, 1)
1	(5, 33)	(10, 1057)
2	(6, 65)	(10, 1089)
3	(7, 129)	(10, 1153)
4	(8, 257)	(10, 1281)
5	(9, 513)	(10, 1537)
6	(11, 2049)	(11, 3073)
7	(12, 4097)	(12, 5121)
8	(13, 8193)	(13, 9217)
9	(14, 16385)	(14, 17409)

Everything within the same bucket sorted by base distance

index	base	target
0	(10, 1025)	(0, 1)
1	(10, 1026)	(1, 2)
2	(10, 1027)	(1, 3)
3	(10, 1028)	(2, 4)
4	(10, 1029)	(2, 5)
5	(10, 1030)	(2, 6)
6	(10, 1031)	(2, 7)
7	(10, 1032)	(3, 8)
8	(10, 1033)	(3, 9)

So based on this observation we can conclude

Everything in the same bucket as the target maps to every bucket below that for the target. E.G. 10 -> 0,1,2,3..., 9
Every bucket 'lower' than the bucket the target is in is mapped to the same bucket for the target. E.G. 0,1,2,3... 9 -> 10
The ordering of the distance for the lower buckets remains unchanged. that is nodes in bucket 1 -> 10 are still closer than bucket 2 -> 10 nodes.
all the buckets 'above' the target's bucket remains unchanged.

This is the bucket switching behaviour we discussed when talking about the buckets resetBuckets implementation.

Knowing this, we should get nodes in the following order. where N is the target's bucket and K is the highest bucket...

N
iterate over 0 -> N -1
iterate over N+1 -> K

The code mostly followed this procedure, I just need to make a small fix.

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

tegefaulkes · 2022-03-30T03:22:18Z

Applied a fix.

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

joshuakarp added enhancement New feature or request development Standard development labels Jul 21, 2021

CMCDragonkai mentioned this issue Feb 21, 2022

Testnet Deployment #326

Closed

29 tasks

CMCDragonkai assigned tegefaulkes Mar 11, 2022

CMCDragonkai added the epic Big issue with multiple subissues label Mar 11, 2022

CMCDragonkai mentioned this issue Mar 13, 2022

Tree-based Kademlia Data Structure - k-bucket library #158

Closed

tegefaulkes added a commit that referenced this issue Mar 15, 2022

feat: added NodeGraph.getClosestNodes()

114707d

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

tegefaulkes added a commit that referenced this issue Mar 17, 2022

fix: Updated usage and tests of getClosestNodes()

366aa3f

Relates to #212

tegefaulkes mentioned this issue Mar 17, 2022

Setting a Node involves NodeManager and NodeGraph, where NodeManager handles bucket overflow policy #359

Closed

10 tasks

tegefaulkes added a commit that referenced this issue Mar 23, 2022

feat: added NodeGraph.getClosestNodes()

8b421ac

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

tegefaulkes added a commit that referenced this issue Mar 23, 2022

feat: added NodeGraph.getClosestNodes()

0615154

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

CMCDragonkai pushed a commit that referenced this issue Mar 26, 2022

feat: added NodeGraph.getClosestNodes()

0133130

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

CMCDragonkai pushed a commit that referenced this issue Mar 26, 2022

feat: added NodeGraph.getClosestNodes()

b9f65ad

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

tegefaulkes added a commit that referenced this issue Mar 28, 2022

feat: added NodeGraph.getClosestNodes()

670cab2

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

tegefaulkes added a commit that referenced this issue Mar 28, 2022

feat: added NodeGraph.getClosestNodes()

31a2f55

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

CMCDragonkai pushed a commit that referenced this issue Mar 29, 2022

feat: added NodeGraph.getClosestNodes()

b10c9ee

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

tegefaulkes added a commit that referenced this issue Mar 29, 2022

feat: added NodeGraph.getClosestNodes()

592b8e5

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

tegefaulkes added a commit that referenced this issue Mar 30, 2022

feat: added NodeGraph.getClosestNodes()

c1b1533

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

tegefaulkes added a commit that referenced this issue Apr 8, 2022

feat: added NodeGraph.getClosestNodes()

b15e041

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

tegefaulkes mentioned this issue Apr 13, 2022

Decentralised NAT Signalling #365

Closed

CMCDragonkai removed the epic Big issue with multiple subissues label May 2, 2022

tegefaulkes added a commit that referenced this issue Jun 1, 2022

feat: added NodeGraph.getClosestNodes()

e93e9a1

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

tegefaulkes mentioned this issue Jun 1, 2022

Preparing for Testnet Deployment #378

Merged

29 tasks

tegefaulkes added a commit that referenced this issue Jun 1, 2022

feat: added NodeGraph.getClosestNodes()

b177034

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

tegefaulkes added a commit that referenced this issue Jun 2, 2022

feat: added NodeGraph.getClosestNodes()

51e2e5a

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

tegefaulkes added a commit that referenced this issue Jun 6, 2022

feat: added NodeGraph.getClosestNodes()

cb92a66

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

tegefaulkes added a commit that referenced this issue Jun 7, 2022

feat: added NodeGraph.getClosestNodes()

072b8ca

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

tegefaulkes added a commit that referenced this issue Jun 10, 2022

feat: added NodeGraph.getClosestNodes()

420e8cb

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

emmacasolin pushed a commit that referenced this issue Jun 14, 2022

feat: added NodeGraph.getClosestNodes()

c2b77b4

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

CMCDragonkai closed this as completed Jun 14, 2022

emmacasolin pushed a commit that referenced this issue Jun 14, 2022

feat: added NodeGraph.getClosestNodes()

d08bbe9

Implemented `getClosestNodes()` and relevant tests in `NodeGraph.test.ts`. Relates to #212

CMCDragonkai added the r&d:polykey:core activity 3 Peer to Peer Federated Hierarchy label Jul 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve node search over k-buckets (getClosestLocalNode) #212

Improve node search over k-buckets (getClosestLocalNode) #212

joshuakarp commented Jul 21, 2021 •

edited by tegefaulkes

Loading

CMCDragonkai commented Mar 11, 2022

CMCDragonkai commented Mar 12, 2022

CMCDragonkai commented Mar 12, 2022

CMCDragonkai commented Mar 12, 2022

tegefaulkes commented Mar 15, 2022

CMCDragonkai commented Mar 15, 2022

CMCDragonkai commented Mar 15, 2022

CMCDragonkai commented Mar 15, 2022

CMCDragonkai commented Mar 15, 2022

CMCDragonkai commented Mar 15, 2022

tegefaulkes commented Mar 15, 2022

CMCDragonkai commented Mar 15, 2022 •

edited

Loading

tegefaulkes commented Mar 15, 2022

tegefaulkes commented Mar 15, 2022

tegefaulkes commented Mar 17, 2022

CMCDragonkai commented Mar 21, 2022

tegefaulkes commented Mar 30, 2022

tegefaulkes commented Mar 30, 2022

Improve node search over k-buckets (getClosestLocalNode) #212

Improve node search over k-buckets (getClosestLocalNode) #212

Comments

joshuakarp commented Jul 21, 2021 • edited by tegefaulkes Loading

Specification

As implemented

Additional context

Tasks

CMCDragonkai commented Mar 11, 2022

CMCDragonkai commented Mar 12, 2022

CMCDragonkai commented Mar 12, 2022

CMCDragonkai commented Mar 12, 2022

tegefaulkes commented Mar 15, 2022

CMCDragonkai commented Mar 15, 2022

CMCDragonkai commented Mar 15, 2022

CMCDragonkai commented Mar 15, 2022

CMCDragonkai commented Mar 15, 2022

CMCDragonkai commented Mar 15, 2022

tegefaulkes commented Mar 15, 2022

CMCDragonkai commented Mar 15, 2022 • edited Loading

tegefaulkes commented Mar 15, 2022

tegefaulkes commented Mar 15, 2022

tegefaulkes commented Mar 17, 2022

Copy of old spec for reference

Specification

Additional context

Tasks

CMCDragonkai commented Mar 21, 2022

tegefaulkes commented Mar 30, 2022

tegefaulkes commented Mar 30, 2022

joshuakarp commented Jul 21, 2021 •

edited by tegefaulkes

Loading

CMCDragonkai commented Mar 15, 2022 •

edited

Loading