Batch neighbour retrieval in traversals #21809

jvolmer · 2025-06-18T17:45:33Z

We found that if we have a traversal with a limit, but one node that is hit is a supernode, we first retrieve all the neighbours of this supernode first and finally get rid of these again due to the limit. The unnecessary retrieval can be very expensive.
Therefore we decided to batch the neighbour retrieval for each node into batches of 1000. For the OneSidedEnumerator, the neighbourhood retrieval is done in computeNeighbourhoodOfNextVertex(). In single server, the expand method on the provider is the one that saves the neighbours all into memory. In cluster, already the fetching (via the provider) of the neighbour edges can be expensive and should be batched.
The current changes include a SingleServerNeighbourProvider that is an iterator over neighbours of one vertex that can be set (and therefore resets the internal cursors) via rearm. The next method is supposed to give the next 1000 neighbours - current it still gives all of them, this still needs more work. The SingleServerProvider behavior also has not changed so far, it iterates over all batches. This has to change such that it executes the callback only on the next batch. (Another this that has to be investigated: the TraversalStats have to be given to the SingleServerNeighbourProvider, possibly as a shared_ptr (?), using just a bare reference to the traversal stats of the SingleServerProvider crashes arangod.)
Also, the similar provider has to be implemented for the cluster case where we fetch edges.

jvolmer added 3 commits June 18, 2025 16:07

Extract single server neighbour provider

b71988f

Iterate over all batches

3e7680d

Get vertex cache back in

46b0b76

jvolmer self-assigned this Jun 18, 2025

cla-bot bot added the cla-signed label Jun 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Batch neighbour retrieval in traversals #21809

Batch neighbour retrieval in traversals #21809

jvolmer commented Jun 18, 2025

Uh oh!

Uh oh!

Batch neighbour retrieval in traversals #21809

Are you sure you want to change the base?

Batch neighbour retrieval in traversals #21809

Conversation

jvolmer commented Jun 18, 2025

Uh oh!

Uh oh!