artifact store: fix correctness bug; remove copy read timeout #7866

iliana · 2025-03-25T05:58:51Z

This fixes the two main problems identified in #7796:

In some situations a truncated artifact can be persisted to the artifact store. (yikes!)
Uploading a real TUF repo to an a4x2 cluster results in lots of read timeouts.

In the previous implementation, we always wrote temporary files to tmp/{sha256}, and returned an error if that file was already present. Due to the Drop behavior of Utf8TempFile, what I think was my attempting to simplify the code resulted in deleting another task's temporary file when we return an "already in progress" error. (This PR adds a regression test for this issue, which I've verified fails as expected on the current implementation.)

This replaces the temporary file persisting logic with the atomicwrites crate as suggested by @sunshowers, which is resistant to trivial mistakes like this. AtomicFile::write takes a function which must perform the entire write operation; if the function returns an error, the file is not renamed to the final path. To make it work in an asynchronous context, AtomicFile::write is placed on a blocking task and bytes are sent over an mpsc channel. The write task also creates a checksum of the data, returning an error to prevent persisting the file if the checksum is invalid.

atomicwrites also syncs the file as well as the parent directories, meaning we can remove that code in our implementation.

Since the logic for detecting multiple in-progress transfers relied on the predictable naming of temporary files, I removed that. The original reasoning for checking this was to avoid doing unnecessary work. The current implementation allows writing an artifact that already exists though, so long as someone else isn't trying to write it at the same time. (I intentionally allowed overwriting an existing artifact to allow Nexus to work around an incorrectly-written artifact in the store, if it noticed such a thing happening; it doesn't currently.) I don't know which is better; in theory allowing multiple writers means that if one fails spuriously the other that's running could still succeed. It would be simple enough to add the proposed change in #7860 to this PR as well.

The reason the 15 second read timeout was in place for copy requests was to avoid it entirely holding up any other attempts to replicate the artifact. Since we would no longer stop these attempts the read timeout can be removed.

davepacheco

This looks fine to me but it's heavily using some sharp-edged stuff that I haven't used a lot (spawn_blocking, blocking_recv, etc.) so it might be nice to get another set of eyes on it. Maybe @jgallagher or @sunshowers?

davepacheco · 2025-03-26T21:03:55Z

sled-agent/src/artifact_store.rs

+            OverwriteBehavior::AllowOverwrite,
+            temp_dir,
+        );
+        let (tx, mut rx) = mpsc::channel(16); // TODO


Is there something in particular you plan to do for this // TODO? If it's just that this seems like an arbitrary number, that's fair -- I'd just document that.

It's unfortunate that the buffering here is per-message and not based on the amount of bytes. I don't really see an easy way around that though and I'm not sure it's worth doing much work to improve it. Maybe some day it'll be worth creating an async atomicwrites.

It's maybe worth noting that this is the number of chunks from dropshot's StreamingBody we're willing to buffer up if the writing task(s) are slow? I'm not sure how big those chunks are in practice, nor how likely a slow writer is.

When I was implementing it, the typical chunk sizes I saw were around 50-100KiB.

I meant to figure out what the expected chunk sizes are (we have both dropshot's StreamingBody and reqwest's Body to take into account here) and then go from there but haven't yet.

StreamingBody just forwards the Bytes chunks returned from reqwest's Body -- there's no rechunking or additional copies.

As far as I can tell the maximum chunk sizes from Nexus (which StreamingBody wraps) is the buffer sizes in Nexus, which by default are ~400 KB for both HTTP 1 and HTTP 2 clients. reqwest presumably uses Hyper's defaults for clients, which is ~400 KB for HTTP 1 and 1 MB for HTTP 2.

I am not really sure what a good number here is, other than maybe "1". If we're only pulling bytes off the wire as fast as we can write them to both M.2s, we're not going to buffer a bunch of network traffic and then have backpressure later.

I do think I should adjust the code to write the chunk to both M.2s simultaneously though, right now it's done serially.

I think a max mem use of up to 64 MB is fine, so 64 (assuming the 1 MB upper bound per chunk) seems reasonable.

jgallagher

The async bits LGTM.

jgallagher · 2025-03-26T21:59:48Z

sled-agent/src/artifact_store.rs

+            OverwriteBehavior::AllowOverwrite,
+            temp_dir,
+        );
+        let (tx, mut rx) = mpsc::channel(16); // TODO


It's maybe worth noting that this is the number of chunks from dropshot's StreamingBody we're willing to buffer up if the writing task(s) are slow? I'm not sure how big those chunks are in practice, nor how likely a slow writer is.

…-writer-redo

plotnick

Makes uploading a TUF repo into a4x2 take 15 minutes instead of N hours, and doesn't fill up the disk once that's done. Ship it!

iliana added 4 commits March 25, 2025 04:20

regression test for #7796

464511d

artifact store: use the atomicwrites crate

f67a545

remove the read timeout for copy requests

688ff01

fix comment numbering

36faafb

iliana requested a review from davepacheco March 25, 2025 05:58

iliana mentioned this pull request Mar 25, 2025

artifact store: keep track of in-progress artifacts in memory #7860

Closed

iliana linked an issue Mar 25, 2025 that may be closed by this pull request

TUF artifact distribution errors in sled agent log #7796

Closed

davepacheco reviewed Mar 26, 2025

View reviewed changes

jgallagher reviewed Mar 26, 2025

View reviewed changes

iliana added 3 commits April 7, 2025 16:51

Merge remote-tracking branch 'origin/main' into iliana/artifact-store…

93febc2

…-writer-redo

Merge remote-tracking branch 'origin/main' into iliana/artifact-store…

67b19f7

…-writer-redo

channel size = 1; write to both M.2s in parallel

0613f83

plotnick approved these changes Apr 15, 2025

View reviewed changes

iliana merged commit aeb27f0 into main Apr 15, 2025
17 checks passed

iliana deleted the iliana/artifact-store-writer-redo branch April 15, 2025 16:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

artifact store: fix correctness bug; remove copy read timeout #7866

artifact store: fix correctness bug; remove copy read timeout #7866

Uh oh!

iliana commented Mar 25, 2025

Uh oh!

davepacheco left a comment

Uh oh!

davepacheco Mar 26, 2025

Uh oh!

jgallagher Mar 26, 2025

Uh oh!

sunshowers Mar 27, 2025

Uh oh!

iliana Mar 27, 2025

Uh oh!

sunshowers Mar 27, 2025

Uh oh!

iliana Apr 7, 2025 •

edited

Loading

Uh oh!

sunshowers Apr 8, 2025

Uh oh!

jgallagher left a comment

Uh oh!

jgallagher Mar 26, 2025

Uh oh!

plotnick left a comment

Uh oh!

Uh oh!

Uh oh!

artifact store: fix correctness bug; remove copy read timeout #7866

artifact store: fix correctness bug; remove copy read timeout #7866

Uh oh!

Conversation

iliana commented Mar 25, 2025

Uh oh!

davepacheco left a comment

Choose a reason for hiding this comment

Uh oh!

davepacheco Mar 26, 2025

Choose a reason for hiding this comment

Uh oh!

jgallagher Mar 26, 2025

Choose a reason for hiding this comment

Uh oh!

sunshowers Mar 27, 2025

Choose a reason for hiding this comment

Uh oh!

iliana Mar 27, 2025

Choose a reason for hiding this comment

Uh oh!

sunshowers Mar 27, 2025

Choose a reason for hiding this comment

Uh oh!

iliana Apr 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sunshowers Apr 8, 2025

Choose a reason for hiding this comment

Uh oh!

jgallagher left a comment

Choose a reason for hiding this comment

Uh oh!

jgallagher Mar 26, 2025

Choose a reason for hiding this comment

Uh oh!

plotnick left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

iliana Apr 7, 2025 •

edited

Loading