feat: handle incoming blob filters #956

EvanHahn · 2024-11-07T20:52:02Z

I recommend reviewing this PR one commit at a time.

When you receive a blob filter from another peer, this updates their sync states. For example, if you receive a blob filter that says "I only want photo thumbnails", that peer's "wants" bitfield will be reduced.

Closes #682 and #905.

src/blob-store/downloader.js

This change should have no impact.

…Stream` This is a types-only change that should have no functionality impact.

This change should have no impact on functionality. It makes the following changes to `PeerState`: - Instead of a `#wants` bitfield and a `#wantAll` boolean, we only store one value: `#wants`, which can be `null` or a bitfield. This helps avoid states where you want everything *and* the bitfield has data inside. - `setWantRange` is renamed to `addWantRange`. - `addWantRange` takes two numbers (a start and length) instead of an object. This is for performance and consistency with future code. - `clearWantRanges` is a new method that resets the bitfield. I think these are useful changes on their own, but will make upcoming changes smaller too.

This should have no functionality impact.

We weren't doing this when `SyncApi` was instantiated. Now we are.

This is the bulk of the work required in [#905]. It takes incoming blob filters and converts them to want bitfields. It starts in `MapeoProject` and gets all the way down to `PeerState`. [#905]: #905

src/blob-store/index.js

gmaclennan

ok looks good, good work on this, just needs some work managing the lifecycle of the entries stream.

src/mapeo-project.js

gmaclennan · 2024-11-18T15:05:58Z

src/sync/core-sync-state.js

   */
-  setPeerWants(peerId, ranges) {
+  addWantRange(peerId, start, length) {


I was worried about performance (because of the cost of calculating state), but I think the way this is implemented, the throttle on SyncState should avoid state calculation happening too frequently, so I think it is ok to just call this with every range (there will be a lot of calls to this).

I wrote a simple benchmark script with Deno. On my machine, I could call setRange 7,364 times per second, at least with my simple benchmark. I realize my computer is probably much faster than most phones, but even a 100x slowdown is probably okay here.

I'm going to leave this as is, but let me know if that's wrong.

Sorry for the lack of clarity, I was thinking out loud. I was concerned about the performance not because of the cost of this function, but because it triggers a state update, and reading state is expensive (when cores get large, which they will do for the blob core). However, the way we currently trigger state updates, we don't actually calculate state as part of the update event, and then in SyncState we throttle the handler for update events so that getState() is only called ever 200ms, so with the current implementation this is all fine. The reason this was originally ranges was to avoid triggering a state-recalculation for every range set, but our throttle in SyncState solves that problem for now. We just need to remember this in the future.

src/discovery/local-discovery.js

gmaclennan

great, fantastic to have this working.

This is a squashed commit of: - #940 - #957 - #956 Co-authored-by: Gregor MacLennan <gmaclennan@digital-democracy.org>

EvanHahn commented Nov 7, 2024

View reviewed changes

src/blob-store/downloader.js Outdated Show resolved Hide resolved

EvanHahn added 7 commits November 7, 2024 22:36

chore: move HyperdriveIndex into its own file

345d80b

This change should have no impact.

chore: explicit return type to `BlobStore.prototype.createEntriesRead…

2b5bfdc

…Stream` This is a types-only change that should have no functionality impact.

chore: DRY out blob download filter conditional

fd5a07d

This should have no functionality impact.

chore: MapeoProject should update SyncApi's blob download filter

b433efa

chore: SyncApi, tell already-connected peers about your blob filters

c189f79

We weren't doing this when `SyncApi` was instantiated. Now we are.

feat: handle incoming blob filters

7c810c2

This is the bulk of the work required in [#905]. It takes incoming blob filters and converts them to want bitfields. It starts in `MapeoProject` and gets all the way down to `PeerState`. [#905]: #905

EvanHahn force-pushed the want-bitfield branch from 70f65b4 to 7c810c2 Compare November 12, 2024 16:45

EvanHahn marked this pull request as ready for review November 12, 2024 16:52

EvanHahn requested a review from gmaclennan November 12, 2024 16:52

gmaclennan reviewed Nov 12, 2024

View reviewed changes

src/blob-store/index.js Outdated Show resolved Hide resolved

EvanHahn requested a review from gmaclennan November 14, 2024 16:21

gmaclennan requested changes Nov 18, 2024

View reviewed changes

EvanHahn added 2 commits November 19, 2024 16:11

Merge branch 'main' into want-bitfield

5b6ab32

Address code review comments

13f9a3d

EvanHahn requested a review from gmaclennan November 19, 2024 21:55

Merge branch 'media-manager-v1' into want-bitfield

3c8a71d

EvanHahn commented Nov 19, 2024

View reviewed changes

src/discovery/local-discovery.js Show resolved Hide resolved

gmaclennan approved these changes Nov 20, 2024

View reviewed changes

EvanHahn merged commit 5ae541d into media-manager-v1 Nov 20, 2024
9 checks passed

EvanHahn deleted the want-bitfield branch November 20, 2024 14:34

EvanHahn mentioned this pull request Nov 20, 2024

feat: media manager v1 #969

Merged

EvanHahn added a commit that referenced this pull request Nov 20, 2024

feat: receive blob filters from archive devices (#969)

3d1c94b

This is a squashed commit of: - #940 - #957 - #956 Co-authored-by: Gregor MacLennan <gmaclennan@digital-democracy.org>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: handle incoming blob filters #956

feat: handle incoming blob filters #956

EvanHahn commented Nov 7, 2024 •

edited

Loading

gmaclennan left a comment

gmaclennan Nov 18, 2024

EvanHahn Nov 19, 2024

gmaclennan Nov 20, 2024

gmaclennan left a comment

feat: handle incoming blob filters #956

feat: handle incoming blob filters #956

Conversation

EvanHahn commented Nov 7, 2024 • edited Loading

gmaclennan left a comment

Choose a reason for hiding this comment

gmaclennan Nov 18, 2024

Choose a reason for hiding this comment

EvanHahn Nov 19, 2024

Choose a reason for hiding this comment

gmaclennan Nov 20, 2024

Choose a reason for hiding this comment

gmaclennan left a comment

Choose a reason for hiding this comment

EvanHahn commented Nov 7, 2024 •

edited

Loading