Check structural equality in QueryInfo#setDiff, again. #7997

benjamn · 2021-04-16T15:34:06Z

Revisiting PR #6891, which was reverted by commit c2ef68f.

With the introduction of canonical cache results (#7439), the strict equality check in setDiff is about to become mostly synonymous with deep equality checking (but much faster to check), so we need to deal with the consequences of #6891 one way or another.

As evidence that this change now (after #7439) has no observable impact, notice that we were able to switch back to using !equal(a, b) without needing to fix/update any failing tests, compared to the handful of tests that needed updating in #6891.

However, by switching to deep equality checking, we allow ourselves the option to experiment with disabling (or periodically clearing) the ObjectCanon, which would presumably lead to broadcasting more !== (but deeply equal) results to cache.watch watchers.

With deep equality checking in setDiff, those !== results will get past the setDiff filter in the same cases canonical objects would, so there will be less of a discrepancy in client behavior with/without canonization enabled.

brainkim · 2021-04-17T00:23:34Z

@benjamn
I’m okay with this if you think it’s a good idea, but I’m concerned about this whole file (QueryInfo.ts) in general. It feels a little bit like we are using equality checks here with InMemoryCache.watch() to implement behavior which would better be encapsulated in a place like QueryManager.fetchQueryByPolicy(), where the fetch policy logic is mostly implemented. The fact of the matter is that setDiff() is only called in one place in this file (in a callback), and in other places QueryInfo.diff is assigned manually, so the main purpose of this method seems to be to notify callback listeners in a setTimeout() callback. Therefore, it feels deeply unclear to me why referential equality, or even structural equality, has semantic meaning here, or what processes it’s supposed to kick off. In my wine-addled opinion, the behavior of this code feels emergent rather than intentional, and I think we should consider refactoring this stuff to clarify why a local cache miss here would trigger a network request somewhere else.

The other concerns are more practical. The studio team indicated that they were having issues with cache.evict not re-triggering network requests (PR 3097 in studio). Is this a use-case we’re supporting? Because I have no idea what cache eviction has to do with queries re-firing. I would like to make sure we have a better handle on their use-case.

Additionally, my current understanding of these optimism caches is that they are performance optimizations, equality checks aside. What is the impact of using structural equality checking here? The unknowns are compounded by the fact that this new canonicalization is layered on top of the optimism-based caching so I am very much in the dark about whether there are good reasons to switch from referential equality to structural equality here.

For these reasons, I am unable to approve or disapprove of this PR. Of course, I’m still somewhat new to this codebase and defer to your expertise.

benjamn · 2021-04-19T15:08:24Z

Therefore, it feels deeply unclear to me why referential equality, or even structural equality, has semantic meaning here, or what processes it’s supposed to kick off.

@brainkim I agree this pipeline could be simplified, but I think it would be wise to separate the following two questions:

How should a watched query react upon receiving new data? This is a deep and open problem space, and it's safe to say our current collection of built-in fetch policies does not come anywhere close to covering all possible use cases / desired behaviors. For example, you could imagine a sophisticated policy that attempts to refetch individual objects with missing fields, to avoid refetching the whole query. The sky is the limit, though simplicity/predictability is at a premium for the common use cases.
Should a watched query ever receive the same data twice in a row? Regardless of how the query is supposed to react to new data, it's probably unnecessary and potentially a performance problem to notify the query twice in a row with the "same" data. I put "same" in quotes because sameness could mean either strict referential equality or some notion of deep equality. However, I believe the canonization trick (Efficiently canonicalize InMemoryCache result objects. #7439) forces our hand here, since strict equality checking with canonical objects amounts to deep equality checking. If someone were to disable canonization for performance or debugging reasons, or cleared the ObjectCanon to reclaim memory, it would be unfortunate if this code suddenly started letting more !== results through. If we stick to deep equality checking here, the behavior will be the same whether canonization is happening or not.

My goal with this distinction is to suggest that the second question is much easier to answer (no, watched queries should never receive the same deeply equal data twice in a row), and justifies the changes in this PR, without requiring a full answer to the first question (though I would love to keep thinking about that too).

As for the Studio cache.evict issues, PR #8000 introduces an API we will be encouraging them to use instead of just calling cache.evict to trigger changes, so I'm confident this change won't be a problem for them this time, with guidance from us about how to use cache.refetchQueries. That guidance will be necessary anyway (even without this PR), since the introduction of canonization is likely to trigger the same problems with cache.evict that deep equality checking revealed in #6891, so we have to get to the bottom of those regressions before Studio can use v3.4 anyway.

brainkim · 2021-04-19T15:28:31Z

I believe the canonization trick (#7439) forces our hand here, since strict equality checking with canonical objects amounts to deep equality checking. If someone were to disable canonization for performance or debugging reasons, or cleared the ObjectCanon to reclaim memory, it would be unfortunate if this code suddenly started letting more !== results through.

Maybe a unit test would be in order, if this is the case? What exactly is the public expectation which would fail if we didn’t use structural equality here?

Similar to #7997, but further upstream.

benjamn · 2021-04-30T15:39:10Z

@brainkim I'm all for a test, but that will have to wait on making it possible to opt out of canonization (which is something I'm planning for v3.4 but haven't gotten to yet).

Similar to #7997, but further upstream.

Revisiting PR #6891, which was reverted by commit c2ef68f. With the introduction of canonical cache results (#7439), the strict equality check in setDiff is about to become mostly synonymous with deep equality checking (but much faster to check), so we need to deal with the consequences of #6891 one way or another. As evidence that this change now (after #7439) has no observable impact, notice that we were able to switch back to using !equal(a, b) without needing to fix/update any failing tests, compared to the few tests that needed updating in #6891. However, by switching to deep equality checking, we allow ourselves the option to experiment with disabling (or periodically clearing) the ObjectCanon, which would presumably lead to broadcasting more !== but deeply equal results to cache.watch watchers. With deep equality checking in setDiff, those !== results will get past the filter in the same cases canonical objects would, so there's less of a discrepancy in client behavior with/without canonization enabled.

#7997 (comment)

benjamn · 2021-05-12T21:02:23Z

@brainkim Alright, I added a regression test (that I can confirm fails if we use !== to test equality in setDiff) in e3ec93d. Look good to you?

brainkim

The pieces are falling into place it seems.

Similar to #7997, but further upstream.

benjamn added 🧩 implementation-detail 💸 caching labels Apr 16, 2021

benjamn added this to the Release 3.4 milestone Apr 16, 2021

benjamn self-assigned this Apr 16, 2021

benjamn requested a review from brainkim April 16, 2021 15:35

benjamn added a commit that referenced this pull request Apr 19, 2021

Use deep equality check before calling watch.callback.

dc2c1e3

Similar to #7997, but further upstream.

benjamn force-pushed the release-3.4 branch from f7caa87 to 4420c49 Compare April 29, 2021 18:49

benjamn added a commit that referenced this pull request May 1, 2021

Use deep equality check before calling watch.callback.

0572693

Similar to #7997, but further upstream.

benjamn added a commit that referenced this pull request May 5, 2021

Use deep equality check before calling watch.callback.

baa3e51

Similar to #7997, but further upstream.

benjamn added a commit that referenced this pull request May 11, 2021

Use deep equality check before calling watch.callback.

cca8d2b

Similar to #7997, but further upstream.

benjamn added 3 commits May 12, 2021 15:36

Improve subscribeAndCount type inference before writing more tests.

3dce4a8

Test that queryInfo.setDiff prevents duplicate results.

e3ec93d

#7997 (comment)

benjamn force-pushed the reinstate-pr-6891 branch from 4f1babc to e3ec93d Compare May 12, 2021 21:00

brainkim approved these changes May 12, 2021

View reviewed changes

benjamn merged commit ea3a05c into release-3.4 May 12, 2021

benjamn deleted the reinstate-pr-6891 branch May 12, 2021 21:53

benjamn added a commit that referenced this pull request May 12, 2021

Use deep equality check before calling watch.callback.

5b97a9c

Similar to #7997, but further upstream.

benjamn added a commit that referenced this pull request May 12, 2021

Use deep equality check before calling watch.callback.

38b162c

Similar to #7997, but further upstream.

benjamn added a commit that referenced this pull request May 13, 2021

Use deep equality check before calling watch.callback.

7befca3

Similar to #7997, but further upstream.

benjamn added a commit that referenced this pull request May 18, 2021

Use deep equality check before calling watch.callback.

4b66ff2

Similar to #7997, but further upstream.

hwillson added the 2021-06 label Jul 29, 2021

hwillson removed this from the MM-2021-06 milestone Jul 29, 2021

github-actions bot locked as resolved and limited conversation to collaborators Feb 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check structural equality in QueryInfo#setDiff, again. #7997

Check structural equality in QueryInfo#setDiff, again. #7997

benjamn commented Apr 16, 2021

brainkim commented Apr 17, 2021 •

edited

Loading

benjamn commented Apr 19, 2021 •

edited

Loading

brainkim commented Apr 19, 2021

benjamn commented Apr 30, 2021

benjamn commented May 12, 2021

brainkim left a comment

Check structural equality in QueryInfo#setDiff, again. #7997

Check structural equality in QueryInfo#setDiff, again. #7997

Conversation

benjamn commented Apr 16, 2021

brainkim commented Apr 17, 2021 • edited Loading

benjamn commented Apr 19, 2021 • edited Loading

brainkim commented Apr 19, 2021

benjamn commented Apr 30, 2021

benjamn commented May 12, 2021

brainkim left a comment

Choose a reason for hiding this comment

brainkim commented Apr 17, 2021 •

edited

Loading

benjamn commented Apr 19, 2021 •

edited

Loading