RCORE-2160 Make upload completion reporting multiprocess-compatible #7796

tgoyne · 2024-06-10T19:02:20Z

Rather than tracking a bunch of derived state in-memory, check for upload completion by checking if there are any unuploaded changesets. This is both multiprocess-compatible and is more precise than the old checks, which had some false-negatives and minor inconsistencies. Previously creating local commits which produced empty changesets and then calling wait_for_upload_completion() would complete immediately, but pausing and then resuming the session would make it wait until the new session performed the upload scan, which didn't happen until after download completion.

The synchronous completion waits (which are hopefully only used in tests) are now just thin wrappers around the async waits. This exposed a small inconsistency around when completion happened when the sync client is stopped, which is something we don't expose publicly so changing it should be fine.

tgoyne · 2024-06-10T19:05:51Z

src/realm/sync/client.cpp

        REALM_ASSERT(self->m_actualized);
+        if (!status.is_ok()) {


If post() itself failed we previously never called the completion callback, while now we report the error to the callback. The event loop being able to fail is sort of weird and I'm not sure it can actually happen?

I'd expect the only error we could get here to be OperationAborted if the event loop were shut down before the sync client.

tgoyne · 2024-06-10T19:07:15Z

src/realm/sync/client.cpp

@@ -1564,6 +1497,23 @@ void SessionWrapper::force_close()
    m_sess = nullptr;
    // Everything is being torn down, no need to report connection state anymore
    m_connection_state_change_listener = {};
+
+    // All outstanding wait operations must be canceled


Moving this from finalize() to force_close() means that in tests we send the notifications when the client is shutdown rather than when the session is abandoned, matching the old behavior of blocking wait for completion or client stop. I think this is clearly correct for tests and should have no effect outside of test code.

tgoyne · 2024-06-10T19:09:44Z

test/object-store/sync/flx_sync.cpp

@@ -730,6 +730,7 @@ TEST_CASE("flx: client reset", "[sync][flx][client reset][baas]") {
                REQUIRE(mode == ClientResyncMode::Recover);
                auto subs = local_realm->get_latest_subscription_set();
                subs.get_state_change_notification(sync::SubscriptionSet::State::Complete).get();
+                subs.refresh();


This test was relying on wait_for_upload_completion() waiting for subscription changes to be uploaded (which it would only sometimes do and wasn't actually guaranteed), which happened to result in the subscription state being Complete before the call to get_state_change_notification() so this worked without the refresh.

has this been flaky lately or something? i think waiting_for_upload_completion() used to guarantee this, but maybe that's changed from under me.

check_for_upload_completion() has had specific logic to report completion even if there's unuploaded changesets as long as it had scanned all of the changesets (i.e. as long as all of the remaining ones are empty or from the server) since 2018, and I'm pretty sure there was equivalent behavior achieved differently before that.

The change in functionality that broke this test is that we don't scan the changesets to see if any needed to be uploaded until after the first DOWNLOAD is received, so previously empty changesets made wait_for_uploads() wait for the first DOWNLOAD message and now it doesn't. It's probably possible to preserve that behavior, but it seems really weird and inconsistent (particularly because the presence of empty changesets may not be directly related to anything the developer did).

Either way, the test was incorrect; it should either be waiting on the state change notification and then calling refresh or simply asserting the state without waiting. Waiting then asserting without the refresh in between doesn't really make any sense.

tgoyne · 2024-06-10T19:16:12Z

test/object-store/util/sync/sync_test_utils.cpp

@@ -737,7 +737,7 @@ struct BaasFLXClientReset : public TestClientReset {
        if (m_on_post_local) {
            m_on_post_local(realm);
        }
-        wait_for_upload(*realm);
+        wait_for_download(*realm);


This was relying on wait_for_upload() waiting for subscription changes after resume(). The thing we're actually waiting for here is for is a server roundtrip so that we receive the client reset error, which wait_for_download() does guarantee.

coveralls-official · 2024-06-12T20:11:39Z

Pull Request Test Coverage Report for Build thomas.goyne_416

Details

205 of 210 (97.62%) changed or added relevant lines in 10 files are covered.
73 unchanged lines in 15 files lost coverage.
Overall coverage decreased (-0.006%) to 90.941%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
src/realm/sync/noinst/client_history_impl.cpp	34	36	94.44%
src/realm/sync/client.cpp	59	62	95.16%

Files with Coverage Reduction	New Missed Lines	%
src/realm/array_string.cpp	1	87.23%
src/realm/object-store/sync/async_open_task.cpp	1	88.36%
src/realm/sort_descriptor.cpp	1	94.06%
src/realm/util/serializer.cpp	1	90.43%
test/fuzz_tester.hpp	1	57.73%
test/test_util_network.cpp	1	95.56%
src/realm/cluster.cpp	2	75.6%
test/test_all.cpp	2	75.82%
src/realm/sync/client.cpp	3	91.26%
src/realm/sync/noinst/client_impl_base.cpp	6	81.93%

Totals
Change from base Build thomas.goyne_415:	-0.006%
Covered Lines:	214551
Relevant Lines:	235923

💛 - Coveralls

coveralls-official · 2024-06-18T21:02:47Z

Pull Request Test Coverage Report for Build thomas.goyne_419

Details

112 of 117 (95.73%) changed or added relevant lines in 8 files are covered.
No unchanged relevant lines lost coverage.
Overall first build on tg/upload-completion at 90.955%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
src/realm/sync/noinst/client_history_impl.cpp	34	36	94.44%
src/realm/sync/client.cpp	59	62	95.16%

Totals
Change from base Build 2430:	91.0%
Covered Lines:	214681
Relevant Lines:	236031

💛 - Coveralls

coveralls-official · 2024-06-20T21:30:40Z

Pull Request Test Coverage Report for Build thomas.goyne_420

Details

112 of 117 (95.73%) changed or added relevant lines in 8 files are covered.
46 unchanged lines in 16 files lost coverage.
Overall coverage decreased (-0.002%) to 90.964%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
src/realm/sync/noinst/client_history_impl.cpp	34	36	94.44%
src/realm/sync/client.cpp	59	62	95.16%

Files with Coverage Reduction	New Missed Lines	%
src/realm/array_mixed.cpp	1	91.94%
src/realm/sort_descriptor.cpp	1	94.06%
src/realm/sync/noinst/client_impl_base.cpp	1	81.93%
src/realm/sync/noinst/server/server_history.cpp	1	63.7%
src/realm/util/compression.cpp	1	89.62%
test/fuzz_tester.hpp	1	57.73%
test/test_query2.cpp	1	98.73%
test/test_lang_bind_helper.cpp	2	93.2%
src/realm/sync/client.cpp	3	91.26%
src/realm/table.cpp	3	90.42%

Totals
Change from base Build 2432:	-0.002%
Covered Lines:	214645
Relevant Lines:	235966

💛 - Coveralls

danieltabacaru · 2024-06-26T14:41:42Z

src/realm/sync/noinst/client_history_impl.cpp

+    if (uploaded_version == current_client_version)
+        return;
+
+    BinaryColumn changesets(db.get_alloc());


you can use m_array->changesets and m_arrays->origin_file_idents

This is a static function.

danieltabacaru · 2024-06-26T14:42:22Z

src/realm/sync/noinst/client_history_impl.cpp

+    // empty changesets and did not need to be uploaded. If this is less than
+    // uploaded_version, we have changesets which have been uploaded but the
+    // server has not yet told us we can delete and we may need to use for merging.
+    auto base_version = current_client_version - changesets.size();


I think you should use m_sync_history_base_version here instead

danieltabacaru · 2024-06-26T14:43:39Z

src/realm/sync/noinst/client_history_impl.cpp

+    }
+
+    auto count = size_t(current_client_version - uploaded_version);
+    for (size_t i = changesets.size() - count; i < changesets.size(); ++i) {


you can take a look at the loop in trim_sync_history() since you're doing something similar

I don't know what this comment means. I have indeed looked at that loop?

I missed that the function is static. I meant that you could make the loop pretty much the same.

danieltabacaru · 2024-06-26T15:45:40Z

src/realm/sync/client.cpp

-
-    void on_upload_completion();
+    version_type m_upload_completion_requested_version = -1;
+
    void on_download_completion();


It'd be nice to align all completion handlers at some point given the current refactoring.

The end state of all this does need to be that download completion is also determinable by inspecting the Realm file, but it's significantly more complicated to get there for downloads (as the server is the source of truth for download completion rather than the client). I'm trying to split off each of the separate pieces to avoid having another monster PR that changes everything.

…ppers around async completion

Rather than tracking a bunch of derived state in-memory, check for upload completion by checking if there are any unuploaded changesets. This is both multiprocess-compatible and is more precise than the old checks, which had some false-negatives.

coveralls-official · 2024-07-01T16:48:51Z

Pull Request Test Coverage Report for Build thomas.goyne_425

Details

112 of 117 (95.73%) changed or added relevant lines in 8 files are covered.
91 unchanged lines in 18 files lost coverage.
Overall coverage decreased (-0.01%) to 90.99%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
src/realm/sync/noinst/client_history_impl.cpp	34	36	94.44%
src/realm/sync/client.cpp	59	62	95.16%

Files with Coverage Reduction	New Missed Lines	%
src/realm/sync/instructions.hpp	1	76.03%
test/test_table.cpp	1	99.51%
src/realm/array_blobs_big.cpp	2	98.58%
src/realm/query_expression.hpp	2	93.81%
src/realm/mixed.cpp	3	86.46%
src/realm/sync/noinst/protocol_codec.hpp	3	74.07%
src/realm/util/future.hpp	3	95.94%
src/realm/util/fifo_helper.cpp	4	85.11%
test/object-store/util/sync/baas_admin_api.cpp	5	84.93%
src/realm/bplustree.cpp	6	72.55%

Totals
Change from base Build 2454:	-0.01%
Covered Lines:	215141
Relevant Lines:	236444

💛 - Coveralls

tgoyne self-assigned this Jun 10, 2024

cla-bot bot added the cla: yes label Jun 10, 2024

tgoyne commented Jun 10, 2024

View reviewed changes

tgoyne force-pushed the tg/upload-completion branch from 961d9d7 to 61c2bed Compare June 10, 2024 19:49

tgoyne force-pushed the tg/download-progress branch from 5680af0 to caff9c2 Compare June 12, 2024 17:21

tgoyne force-pushed the tg/upload-completion branch 2 times, most recently from 6446366 to 8213071 Compare June 12, 2024 18:10

tgoyne force-pushed the tg/download-progress branch from caff9c2 to 6501c2c Compare June 12, 2024 19:16

tgoyne force-pushed the tg/upload-completion branch from 8213071 to f29c23c Compare June 12, 2024 19:16

tgoyne force-pushed the tg/download-progress branch from 6501c2c to 5931242 Compare June 18, 2024 19:14

Base automatically changed from tg/download-progress to master June 18, 2024 20:08

realm deleted a comment from coveralls-official bot Jun 18, 2024

tgoyne force-pushed the tg/upload-completion branch from f29c23c to 3657ee6 Compare June 18, 2024 20:25

tgoyne marked this pull request as ready for review June 18, 2024 21:23

tgoyne requested review from jbreams and danieltabacaru June 18, 2024 21:26

tgoyne force-pushed the tg/upload-completion branch from 3657ee6 to b9cd461 Compare June 20, 2024 20:38

danieltabacaru reviewed Jun 26, 2024

View reviewed changes

danieltabacaru approved these changes Jun 27, 2024

View reviewed changes

tgoyne force-pushed the tg/upload-completion branch from b9cd461 to c1921b1 Compare June 27, 2024 17:43

jbreams approved these changes Jun 27, 2024

View reviewed changes

tgoyne added 2 commits July 1, 2024 08:58

Make wait_for_(upload|download)_complete_or_client_stopped() thin wra…

c84fcb6

…ppers around async completion

tgoyne force-pushed the tg/upload-completion branch from c1921b1 to c6b7d3d Compare July 1, 2024 15:58

tgoyne merged commit fb46803 into master Jul 1, 2024
40 checks passed

tgoyne deleted the tg/upload-completion branch July 1, 2024 16:59

kiburtse mentioned this pull request Jul 8, 2024

download progress estimate is always 1.0 #7869

Closed

github-actions bot locked as resolved and limited conversation to collaborators Jul 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RCORE-2160 Make upload completion reporting multiprocess-compatible #7796

RCORE-2160 Make upload completion reporting multiprocess-compatible #7796

tgoyne commented Jun 10, 2024 •

edited

Loading

tgoyne Jun 10, 2024

jbreams Jun 20, 2024

tgoyne Jun 10, 2024

tgoyne Jun 10, 2024

jbreams Jun 20, 2024

tgoyne Jun 20, 2024

tgoyne Jun 10, 2024

coveralls-official bot commented Jun 12, 2024 •

edited

Loading

coveralls-official bot commented Jun 18, 2024 •

edited

Loading

coveralls-official bot commented Jun 20, 2024 •

edited

Loading

danieltabacaru Jun 26, 2024

tgoyne Jun 26, 2024

danieltabacaru Jun 26, 2024

danieltabacaru Jun 26, 2024

tgoyne Jun 26, 2024

danieltabacaru Jun 26, 2024

danieltabacaru Jun 26, 2024 •

edited

Loading

tgoyne Jun 26, 2024

coveralls-official bot commented Jul 1, 2024 •

edited

Loading

RCORE-2160 Make upload completion reporting multiprocess-compatible #7796

RCORE-2160 Make upload completion reporting multiprocess-compatible #7796

Conversation

tgoyne commented Jun 10, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coveralls-official bot commented Jun 12, 2024 • edited Loading

Pull Request Test Coverage Report for Build thomas.goyne_416

Details

💛 - Coveralls

coveralls-official bot commented Jun 18, 2024 • edited Loading

Pull Request Test Coverage Report for Build thomas.goyne_419

Details

💛 - Coveralls

coveralls-official bot commented Jun 20, 2024 • edited Loading

Pull Request Test Coverage Report for Build thomas.goyne_420

Details

💛 - Coveralls

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

danieltabacaru Jun 26, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coveralls-official bot commented Jul 1, 2024 • edited Loading

Pull Request Test Coverage Report for Build thomas.goyne_425

Details

💛 - Coveralls

tgoyne commented Jun 10, 2024 •

edited

Loading

coveralls-official bot commented Jun 12, 2024 •

edited

Loading

coveralls-official bot commented Jun 18, 2024 •

edited

Loading

coveralls-official bot commented Jun 20, 2024 •

edited

Loading

danieltabacaru Jun 26, 2024 •

edited

Loading

coveralls-official bot commented Jul 1, 2024 •

edited

Loading