bitswap/server: wantlist overflows fails in a toxic maner preventing any data transfer #527

Jorropo · 2023-12-19T13:08:47Z

This is a bug I introduced in 9cb5cb5 when fixing CVE-2023-25568.

Here is a screenshot of our gateway's wantlists:

You can see that many gateways instances have more than 1024 entries.
This cause issues with:

boxo/bitswap/server/internal/decision/engine.go

Lines 669 to 677 in 9cb5cb5

    
           if wouldBe := s + uint(len(wants)); wouldBe > e.maxQueuedWantlistEntriesPerPeer { 
        
           	log.Debugw("wantlist overflow", "local", e.self, "remote", p, "would be", wouldBe) 
        
           	// truncate wantlist to avoid overflow 
        
           	available, o := bits.Sub(e.maxQueuedWantlistEntriesPerPeer, s, 0) 
        
           	if o != 0 { 
        
           		available = 0 
        
           	} 
        
           	wants = wants[:available] 
        
           }

boxo/bitswap/server/internal/decision/engine.go

Line 765 in 9cb5cb5

    
           e.peerRequestQueue.PushTasksTruncated(e.maxQueuedWantlistEntriesPerPeer, p, activeEntries...)

boxo/bitswap/server/internal/decision/engine.go

Line 851 in 9cb5cb5

    
           e.peerRequestQueue.PushTasksTruncated(e.maxQueuedWantlistEntriesPerPeer, entry.Peer, peertask.Task{

Theses three sections of code are needed to fix CVE-2023-25568, they make the server ignore any new queries overflowing 1024. Previously the server would remember infinitely many CIDs eventually OOMing.
However the truncation code always prefer keeping existing entries over new ones.

This means we can get in a situation where we get stuck:

let's say the client send a 2048 entries wantlist.
out of theses we only have 1/3 of the CIDs, and theses are uniformly distributed.
the server truncate the wantlist to 1024.
the server starts serving theses entries first, out of theses we don't have 683 CIDs.
we now only have 341 effective wantlist size. This is because the bitswap server never cleanup entries after sending DONT_HAVE.
The point of this feature is for -1 scalling, if the server is also downloading the same blocks, it might get them after having already sent DONT_HAVE, then the server can either send the block or the a HAVE message overriding the previous DONT_HAVE.
This repeat each time shrinking the usable wantlist on this connection because the part of the wantlist the server is willing to keep fills up with CIDs it does not have.
Eventually reaching 0

(note: the client can send CANCEL or a message with the full flag and theses 1024 "stuck" CIDs will be cleaned out properly, but the client isn't smart enough to realize this is happening)

I see three possible solutions:

Truncate left instead of truncating right.

This means newer entries would override older ones, instead of having older ones stick around.
My original thinking when writing the fix is that the server currently does not apply back-pressure, so we should let existing entries first so we can make some progress and clear them out when we send responses. If we always cancel what is already in flight a client that is too fast would never get any blocks because everything would be canceled before being sent.
This could be fixed by only canceling what is already queued but not entries that are actively being worked on.

Completely rewrite the server and architecture it request response

If we handled messages in a request response manner we could apply back-pressure when the client is sending too many queries.
We could still have a global CID → [PeerID] wantlist map however this would be limited to CIDs we don't have, for -1 scalling. CIDs we already have would be completely skip this flow and the queue and be handled purely in the message handler.
We would still need a limit on how many -1 tracked CIDs we have, but that means this bug would only impact blocks we don't have which SGTM.

Make the client aware and handle rotation itself by truncating itself and handle canceling overwrote entries.

This is nice because it does not require updating the servers, only the client, so fixed client can download from buggy servers.
Sounds PITA to code and I don't really want to do it tbh. Also this means we commit to having broke the protocol.

The text was updated successfully, but these errors were encountered:

lidel · 2024-02-19T12:54:43Z

I don't think it was written down anywhere, but me and @aschmahmann had verbal conversation
that as a compromise, to avoid rewriting everything, we should try:

implementing "Truncate left instead of truncating right" (make new entries override old ones)
- with metric that shows how full the queue is (if we don't have it)
- with configuration option to override default MaxQueuedWantlistEntiresPerPeer (allowing operators to tune their setup to have queue match increased load)

bitswap wantlist overflow handling now selects newer entries when truncating wantlist. This fix prevents the retained portion of the wantlist from filling up with CIDs that the server does not have. Fixes #527

wantlist overflow handling now cancels existing entries to make room for newer requests. This fix prevents the wantlist from filling up with CIDs that the server does not have. Fixes #527

fixes ipfs#527

Jorropo added need/triage Needs initial labeling and prioritization P1 High: Likely tackled by core team if no one steps up kind/bug A bug in existing code (including security flaws) and removed need/triage Needs initial labeling and prioritization labels Dec 19, 2023

Jorropo self-assigned this Dec 19, 2023

Jorropo changed the title ~~Bitswap server fails in a toxic maner when the client exceed 1024 entries in the wantlist by a *lot* preventing any data transfer~~ bitswap/server: wantlist overflows fails in a toxic maner preventing any data transfer Dec 19, 2023

BigLep added this to IPFS Shipyard Team Jan 2, 2024

lidel moved this to 🥞 Todo in IPFS Shipyard Team Jan 2, 2024

BigLep moved this from 🥞 Todo to 🏃‍♀️ In Progress in IPFS Shipyard Team Jan 3, 2024

BigLep mentioned this issue Jan 3, 2024

Release 0.26 ipfs/kubo#10259

Closed

3 tasks

hacdias mentioned this issue Jan 16, 2024

Release 0.27 ipfs/kubo#10306

Closed

8 tasks

hacdias mentioned this issue Mar 1, 2024

Release 0.28 ipfs/kubo#10353

Closed

11 tasks

Jorropo removed their assignment Mar 4, 2024

hacdias mentioned this issue Apr 11, 2024

Release 0.29 ipfs/kubo#10398

Closed

9 tasks

lidel mentioned this issue May 28, 2024

Release 0.30 ipfs/kubo#10436

Closed

32 tasks

gammazero self-assigned this Jun 20, 2024

gammazero mentioned this issue Jun 21, 2024

Fix wantlist overflow handling to select newer entries. #627

Closed

gammazero mentioned this issue Jun 23, 2024

fix(bitswap): wantlist overflow handling #629

Merged

This was referenced Jun 24, 2024

feat(share): Shwap Bitwap composition celestiaorg/celestia-node#3421

Merged

Shwap hardening and optimizations tracking issue celestiaorg/celestia-node#3513

Open

lidel closed this as completed in #629 Jul 30, 2024

lidel closed this as completed in 42c0c86 Jul 30, 2024

wenyue pushed a commit to wenyue/boxo that referenced this issue Oct 17, 2024

fix(bitswap): wantlist overflow handling (ipfs#629)

8bcdca1

fixes ipfs#527

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bitswap/server: wantlist overflows fails in a toxic maner preventing any data transfer #527

bitswap/server: wantlist overflows fails in a toxic maner preventing any data transfer #527

Jorropo commented Dec 19, 2023 •

edited

Loading

lidel commented Feb 19, 2024 •

edited

Loading

bitswap/server: wantlist overflows fails in a toxic maner preventing any data transfer #527

bitswap/server: wantlist overflows fails in a toxic maner preventing any data transfer #527

Comments

Jorropo commented Dec 19, 2023 • edited Loading

Truncate left instead of truncating right.

Completely rewrite the server and architecture it request response

Make the client aware and handle rotation itself by truncating itself and handle canceling overwrote entries.

lidel commented Feb 19, 2024 • edited Loading

Jorropo commented Dec 19, 2023 •

edited

Loading

lidel commented Feb 19, 2024 •

edited

Loading