fix: stewardship api doesn't check final leaves #4735

martinconic · 2024-07-26T08:43:20Z

Checklist

I have read the coding guide.
My change requires a documentation update, and I have done it.
I have added tests to cover my changes.
I have filled out the description and linked the related issues.

Description

ldeffenb

I'm just not sure how to do this efficiently without having the BMT traverser somehow informing the callback if the netGetter was used or not yet for the address being iterated. Otherwise I would have proposed a solution...

ldeffenb · 2024-07-26T09:09:35Z

pkg/steward/steward.go

-	noop := func(leaf swarm.Address) error { return nil }
-	switch err := s.netTraverser.Traverse(ctx, root, noop); {
+	fn := func(a swarm.Address) error {
+		_, err := s.netGetter.RetrieveChunk(ctx, a, swarm.ZeroAddress)


I considered this naive approach, but what it ends up doing is retrieving all non-leaf chunks TWICE from the swarm doubling the bandwidth cost of a /stewardship check. The first netGetter call happens while expanding the BMT for the non-leaf chunks, but ALL chunks end up doing this callback resulting in the netGetter (which bypasses the local cache) redundantly pulling most of the chunks a second time.

Good catch. By my opinion, this can be a comment in the code for the improvement, as the functionality is here with minimal changes.

If that's the case, then we might as well allow the traverser to use the normal getter method that uses the localstore and do ALL of the netgetter checks here in the callback? Not many more changes, and eliminates the redundancy.

The net getter is to be able to get the chunk even if it is not locally available. It would be good to cache the chunk upon retrieval in localstore, though.

As I understand the netGetter, it ONLY retrieves from the swarm, and doesn't even try the localstore (try it on a locally pinned, but stamp-expired reference, it will fail, while the /bytes and /bzz will still work). My point is that since we're going to netGet every single traversed reference in the callback, then don't make a netTraverser using the netGetter, but use a normal traverser that will use localstore and only go to the swarm if necessary. That will reduce the redundancy of hitting the swarm at least.

Yes, as long as it is certain that the getter based on localstore will fallback to retrieve the chunk from the network.

The retrieval protocol has a singleflight mechanism so this improves efficiency of requests that are fired in succession.

fix: check for leaf on bmt traversal

1c86421

ldeffenb suggested changes Jul 26, 2024

View reviewed changes

martinconic requested review from janos, istae and acha-bill July 26, 2024 09:42

istae approved these changes Jul 26, 2024

View reviewed changes

janos approved these changes Jul 26, 2024

View reviewed changes

acha-bill approved these changes Jul 28, 2024

View reviewed changes

martinconic merged commit 7fad4ab into master Jul 29, 2024
14 checks passed

martinconic deleted the fix/stewardship-leaf-check branch July 29, 2024 13:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: stewardship api doesn't check final leaves #4735

fix: stewardship api doesn't check final leaves #4735

martinconic commented Jul 26, 2024

ldeffenb left a comment

ldeffenb Jul 26, 2024

janos Jul 26, 2024

ldeffenb Jul 26, 2024 •

edited

Loading

janos Jul 27, 2024

ldeffenb Jul 27, 2024

janos Jul 27, 2024

martinconic Jul 29, 2024

fix: stewardship api doesn't check final leaves #4735

fix: stewardship api doesn't check final leaves #4735

Conversation

martinconic commented Jul 26, 2024

Checklist

Description

ldeffenb left a comment

Choose a reason for hiding this comment

ldeffenb Jul 26, 2024

Choose a reason for hiding this comment

janos Jul 26, 2024

Choose a reason for hiding this comment

ldeffenb Jul 26, 2024 • edited Loading

Choose a reason for hiding this comment

janos Jul 27, 2024

Choose a reason for hiding this comment

ldeffenb Jul 27, 2024

Choose a reason for hiding this comment

janos Jul 27, 2024

Choose a reason for hiding this comment

martinconic Jul 29, 2024

Choose a reason for hiding this comment

ldeffenb Jul 26, 2024 •

edited

Loading