Chunker: Always seek on the uncompressed stream. #15669

benjaminp · 2022-06-13T21:11:08Z

The WriteRequest.write_offset field has bizarre semantics during compressed uploads as documented in the remote API protos: https://github.com/bazelbuild/remote-apis/blob/3b4b6402103539d66fcdd1a4d945f660742665ca/build/bazel/remote/execution/v2/remote_execution.proto#L241-L248 In particular, the write offset of the first WriteRequest refers to the offset in the uncompressed source.

This change ensures we always seek the uncompressed stream to the correct offset when starting an upload call. The old code could fail to resume compressed uploads under some conditions. The progressiveCompressedUploadShouldWork test purported to exercise this situation. The test, however, contained the same logic error as the code under test.

The `WriteRequest.write_offset` field has bizarre semantics during compressed uploads as documented in the remote API protos: https://github.com/bazelbuild/remote-apis/blob/3b4b6402103539d66fcdd1a4d945f660742665ca/build/bazel/remote/execution/v2/remote_execution.proto#L241-L248 In particular, the write offset of the first `WriteRequest` refers to the offset in the uncompressed source. This change ensures we always seek the uncompressed stream to the correct offset when starting an upload call. The old code could fail to resume compressed uploads under some conditions. The `progressiveCompressedUploadShouldWork` test purported to exercise this situation. The test, however, contained the same logic error as the code under test.

brentleyjones · 2022-06-13T23:50:57Z

Would this fully fix #14654 then? Might be worth an inclusion in a 5.3 if one gets made.

brentleyjones · 2022-06-15T18:26:30Z

@bazel-io flag

ckolli5 · 2022-06-17T15:18:42Z

@bazel-io fork 5.3.0

ckolli5 · 2022-06-30T16:42:09Z

Hello @benjaminp, I am trying to cherry pick these changes to release-5.3.0 but presubmit checks are failing. Could you please help me in cherry picking these changes with appropriate commits. Thanks!

* Chunker: Always seek on the uncompressed stream. The `WriteRequest.write_offset` field has bizarre semantics during compressed uploads as documented in the remote API protos: https://github.com/bazelbuild/remote-apis/blob/3b4b6402103539d66fcdd1a4d945f660742665ca/build/bazel/remote/execution/v2/remote_execution.proto#L241-L248 In particular, the write offset of the first `WriteRequest` refers to the offset in the uncompressed source. This change ensures we always seek the uncompressed stream to the correct offset when starting an upload call. The old code could fail to resume compressed uploads under some conditions. The `progressiveCompressedUploadShouldWork` test purported to exercise this situation. The test, however, contained the same logic error as the code under test. Closes #15669. PiperOrigin-RevId: 455083727 Change-Id: Ie22dacf31f15644c7a83f49776e7a633d8bb4bca * Updated chunker.java file. * Update src/test/java/com/google/devtools/build/lib/remote/ByteStreamUploaderTest.java Co-authored-by: Benjamin Peterson <benjamin@locrian.net> * Update src/test/java/com/google/devtools/build/lib/remote/ByteStreamUploaderTest.java Co-authored-by: Benjamin Peterson <benjamin@locrian.net> * Update src/test/java/com/google/devtools/build/lib/remote/ByteStreamUploaderTest.java Co-authored-by: Benjamin Peterson <benjamin@locrian.net> Co-authored-by: Benjamin Peterson <benjamin@engflow.com> Co-authored-by: Benjamin Peterson <benjamin@locrian.net>

jgao54 · 2022-08-25T08:59:54Z

src/main/java/com/google/devtools/build/lib/remote/Chunker.java

   */
  public void seek(long toOffset) throws IOException {
-    if (toOffset < offset) {
+    if (initialized && toOffset >= offset && !compressed) {
+      ByteStreams.skipFully(data, toOffset - offset);


Was just reviewing the release notes today for bazel 5.3.0 and came across this.

it looks like with this change, offset no longer updated here when skipFully is called. Just want to sanity check that this is the intentional behavior? (I am not super familiar with bazel internals and how seek is called, but just worried this would result in extra bytes discarded if offset is not updated)

Thanks; this is a bug. It think it's unlikely to be triggered in practice, since seeking an initialized chunker forward is rare.

got it, thanks for clarifying!

benjaminp requested a review from a team as a code owner June 13, 2022 21:11

sgowroji added team-Remote-Exec Issues and PRs for the Execution (Remote) team awaiting-review PR is awaiting review from an assigned reviewer labels Jun 14, 2022

coeuvre approved these changes Jun 14, 2022

View reviewed changes

copybara-service bot closed this in dd57d41 Jun 15, 2022

benjaminp deleted the chunker-seeking branch June 15, 2022 14:32

bazel-io added the potential release blocker Flagged by community members using "@bazel-io flag". Should be added to a release blocker milestone label Jun 15, 2022

bazel-io mentioned this pull request Jun 17, 2022

[5.3.0] Chunker: Always seek on the uncompressed stream. #15697

Closed

bazel-io removed the potential release blocker Flagged by community members using "@bazel-io flag". Should be added to a release blocker milestone label Jun 17, 2022

ckolli5 mentioned this pull request Jun 22, 2022

Chunker: Always seek on the uncompressed stream. #15720

Closed

jgao54 reviewed Aug 25, 2022

View reviewed changes

ShreeM01 removed the awaiting-review PR is awaiting review from an assigned reviewer label Sep 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chunker: Always seek on the uncompressed stream. #15669

Chunker: Always seek on the uncompressed stream. #15669

benjaminp commented Jun 13, 2022

brentleyjones commented Jun 13, 2022

brentleyjones commented Jun 15, 2022

ckolli5 commented Jun 17, 2022

ckolli5 commented Jun 30, 2022

jgao54 Aug 25, 2022

benjaminp Aug 25, 2022

jgao54 Aug 26, 2022

Chunker: Always seek on the uncompressed stream. #15669

Chunker: Always seek on the uncompressed stream. #15669

Conversation

benjaminp commented Jun 13, 2022

brentleyjones commented Jun 13, 2022

brentleyjones commented Jun 15, 2022

ckolli5 commented Jun 17, 2022

ckolli5 commented Jun 30, 2022

jgao54 Aug 25, 2022

Choose a reason for hiding this comment

benjaminp Aug 25, 2022

Choose a reason for hiding this comment

jgao54 Aug 26, 2022

Choose a reason for hiding this comment