Fix excessive receive buffer allocation #1611

Ralith · 2023-07-21T22:13:36Z

Allocation logic previously considered a GRO batch to be one datagram, rather than (when supported) 64. This led to an unintended 128MiB allocation, now scaled back down to ~4MiB.

Fixes #1608.

quinn/src/endpoint.rs

mahkoh · 2023-07-24T17:06:16Z

quinn/src/endpoint.rs

            });
-        let mut iovs = unsafe { iovs.assume_init() };
+        let iovs = unsafe { slice_assume_init_mut(&mut iovs[..recv_count]) };


If gro is available, then recv_count is 1. But AIUI, in a single iovec, you can only receive messages from a single source. So this means that in each syscall you can receive messages from at most one remote.

Would it make more sense to split this more dynamically? E.g. if we detect burst activity, create 1 big iovec with 64 gro segments. But if activity is more spread out, use many iovecs with one gro segment each.

Good catch; this is probably a regression for endpoints with many active connections. However, per #1354, if we're using GRO we must accept exactly 64 segments, or else we'll get inadvertent packet truncation. I guess we'll have to bite the bullet and lower the max datagram size...

Why is the current default 64k? Is this useful outside of communication on localhost?

The default is from RFC 9000, though we're free to pick something else. Whatever value we choose will impose a hard upper limit to MTU detection, and it'd be nice to Just Work on networks with jumbo frames. However, AFAIK such networks are rare, so I'm willing to drop this to a best-case internet MTU and let jumbo frame users adjust their configurations if necessary.

Do we have to allocate per segment or can we allocate a large buffer which would allow some jumbo frames provided that most frames are smaller?

If we receive a GRO batch consisting of many jumbo frame packets, and we did not allocate enough space for all of them, packets may be truncated (i.e. lost) per #1354.

Consensus in the implementers' slack seems to be that recvmmsg isn't adding a lot here, i.e. we probably won't see much regression from doing one syscall per sender as in this PR. This is in principle workload dependent... it's tempting to leverage that to simplify our APIs a bunch, though.

Yeah, sounds like ditching recvmmsg() could make sense.

Allocation logic previously considered a GRO batch to be one datagram, rather than (when supported) 64. This led to an unintended ~128MiB allocation, now scaled back down to ~4MiB.

djc · 2023-08-11T12:20:32Z

Closing this, as we decided to merge #1615 instead.

Ralith force-pushed the fix-recv-buffer-size branch 2 times, most recently from ac6e864 to e58224e Compare July 21, 2023 23:05

djc approved these changes Jul 24, 2023

View reviewed changes

quinn/src/endpoint.rs Outdated Show resolved Hide resolved

mahkoh reviewed Jul 24, 2023

View reviewed changes

lijunwangs mentioned this pull request Jul 24, 2023

v1.16: Upgrade quinn to 0.9.4 solana-labs/solana#32609

Merged

Fix excessive receive buffer allocation

130de25

Allocation logic previously considered a GRO batch to be one datagram, rather than (when supported) 64. This led to an unintended ~128MiB allocation, now scaled back down to ~4MiB.

Ralith force-pushed the fix-recv-buffer-size branch from e58224e to 130de25 Compare July 24, 2023 18:56

Ralith mentioned this pull request Jul 24, 2023

Reduce default maximum UDP payload size to the ethernet MTU #1615

Merged

djc closed this Aug 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix excessive receive buffer allocation #1611

Fix excessive receive buffer allocation #1611

Ralith commented Jul 21, 2023 •

edited

Loading

mahkoh Jul 24, 2023 •

edited

Loading

Ralith Jul 24, 2023

mahkoh Jul 24, 2023

Ralith Jul 24, 2023

djc Jul 24, 2023

Ralith Jul 24, 2023

Ralith Jul 24, 2023

djc Jul 25, 2023

djc commented Aug 11, 2023

Fix excessive receive buffer allocation #1611

Fix excessive receive buffer allocation #1611

Conversation

Ralith commented Jul 21, 2023 • edited Loading

mahkoh Jul 24, 2023 • edited Loading

Choose a reason for hiding this comment

Ralith Jul 24, 2023

Choose a reason for hiding this comment

mahkoh Jul 24, 2023

Choose a reason for hiding this comment

Ralith Jul 24, 2023

Choose a reason for hiding this comment

djc Jul 24, 2023

Choose a reason for hiding this comment

Ralith Jul 24, 2023

Choose a reason for hiding this comment

Ralith Jul 24, 2023

Choose a reason for hiding this comment

djc Jul 25, 2023

Choose a reason for hiding this comment

djc commented Aug 11, 2023

Ralith commented Jul 21, 2023 •

edited

Loading

mahkoh Jul 24, 2023 •

edited

Loading