Fix potential inefficient planning due to __typename #2137

pcmanus · 2022-09-12T13:41:19Z

This PR fixes a query planning performance issue discovered with some user data. Namely, planning for some particular queries was taking a very noticeable amount of time (~30 sec on my somewhat fancy laptop for reference).

Upon inspection, it was due to some large amount of plans being built and compared, but the only reason there were many plans considered was due to the consideration of multiple options for the __typename field in a number of places. Obviously, since __typename can be requested equally from any subgraph (as long as it knows the type), evaluating all those options was particularly wasteful.

The attaching commit avoids that problem by essentially making the query planning "ignore" __typename as it computes all the possible choices/plans to consider, and then add it back where it should at the very end when finalising the plan.

On the particular example mentioned above, this reduces the number of plans to consider/evaluate by the query planner to just one, and so query planner is reduced to some ~300ms (again, on my laptop, and it's a fairly large query/plan).

Testing this, I randomly run into a (very) minor bug with aliasing __typename, in that if one was to query __typename twice in the same selection by using an alias, the aliased version was discarded. This PR also opportunistically fixes this. Tbc, I can't think of a reason why anyone would request such a thing but it's valid graphQL and was definitively not handled properly so...

netlify · 2022-09-12T13:41:21Z

👷 Deploy request for apollo-federation-docs pending review.

Visit the deploys page to approve it

Name	Link
🔨 Latest commit	`5462b2d`

codesandbox-ci · 2022-09-12T13:42:11Z

This pull request is automatically built and testable in CodeSandbox.

To see build info of the built libraries, click here or the icon next to each commit SHA.

pcmanus · 2022-09-12T13:47:14Z

Note: I'm not absolutely in love with my solution here in principle: the "attachement" thing feels a bit of a hack. But it's the simplest solution that I could convince myself would work, and it's relatively simple and "contained", so figure this might do, at least until some better idea comes in.

pcmanus · 2022-09-12T13:51:39Z

internals-js/src/operations.ts

@@ -86,7 +107,6 @@ export class Field<TArgs extends {[key: string]: any} = {[key: string]: any}> ex
    readonly alias?: string
  ) {
    super(definition.schema(), variablesInArguments(args));
-    this.validate();


Note to reviewers: somewhat unrelated minor optimisation. While starting to look at this issue, I fired up profiling to see if anything specific showed up. While nothing egregious showed up, this validation came up higher than it should have, and upon reflexion, doing that validation in the constructor was a bit heavy. And since we have a validate() method on SelectionSet that we call when needed (but less aggressively), simply made it so this Field.validate was called only at the same time (outside of performance, this even feel a bit more expected anyway).

pcmanus · 2022-09-12T13:55:13Z

internals-js/src/operations.ts

@@ -1261,7 +1287,11 @@ export class SelectionSet extends Freezable<SelectionSet> {
  clone(): SelectionSet {
    const cloned = new SelectionSet(this.parentType);
    for (const selection of this.selections()) {
-      cloned.add(selection.clone());
+      const clonedSelection = selection.clone();


Notes to reviewer: another minor stuff that profiling showed up. I could probably have put those into a separate commit, but I was lazy. Sorry.

pcmanus · 2022-09-12T13:58:17Z

query-planner-js/src/buildPlan.ts

          operationName,
        );

+    operation = operation.optimize(fragments);


Note to reviewers: a very minor optimistic refactor. The optimize method was called in each of operationForEntitiesFetch and operationForQueryFetch which was both less DRY, but also less readable in the sense that we always want to call that method, and it has no reason to depend on the kind of operation we deal with.

The code from apollographql#2137 is optimize some use of __typename when other fields are queried alongside it. But when __typename had only inline fragments alongside it, the logic was flawed and was discarding any potential instance of the optimization done somewhere more nested. This fixes that problem.

The optimization of apollographql#2137 is meant to have no impact on the generated plans: it's an optimization that ensure the query planner does not needlessly evaluate ineffient options just because __typename happens to be resolvable from any subgraph. But this make testing that optimization a bit more complicated. To work around this, this patch adds a new behaviour to the query planner whereby along the generation of the plan, it also exposes some statistics on the plan generation. As of this commit, the only thing exposed is the number of plan that were evaluated under the hood, as that is what we care to check here (but it is also one of the main contributor to time spent query planning, so arguably an important statistic). There was probably more hacky ways to test this optimization, but it while this statistic is currently only use for testing, this feels like something that could have other uses and be useful to enrich later.

The code from #2137 is optimizing some use of __typename when other fields are queried alongside it. But when __typename had only inline fragments alongside it, the logic was flawed and was discarding any potential instance of the optimization done somewhere more nested. This fixes that problem. Additionally, this patch adds a test for the optimization of #2137, but to do that, this patch adds a new behaviour to the query planner whereby along the generation of the plan, it also exposes some statistics on the plan generation. As of this commit, the only thing exposed is the number of plan that were evaluated under the hood, as that is what we care to check here (but it is also one of the main contributor to time spent query planning, so arguably an important statistic).

Fix potential inefficient planning due to __typename

5462b2d

pcmanus self-assigned this Sep 12, 2022

pcmanus commented Sep 12, 2022

View reviewed changes

clenfest approved these changes Sep 12, 2022

View reviewed changes

clenfest merged commit 081351d into apollographql:main Sep 12, 2022

abernix mentioned this pull request Sep 12, 2022

fix(deps): update rust crate router-bridge to 0.1.5 apollographql/router#1756

Merged

1 task

pcmanus mentioned this pull request Sep 13, 2022

Fix bug in patch for #2137 #2140

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix potential inefficient planning due to __typename #2137

Fix potential inefficient planning due to __typename #2137

pcmanus commented Sep 12, 2022

netlify bot commented Sep 12, 2022

codesandbox-ci bot commented Sep 12, 2022

pcmanus commented Sep 12, 2022

pcmanus Sep 12, 2022

pcmanus Sep 12, 2022

pcmanus Sep 12, 2022

Fix potential inefficient planning due to __typename #2137

Fix potential inefficient planning due to __typename #2137

Conversation

pcmanus commented Sep 12, 2022

netlify bot commented Sep 12, 2022

👷 Deploy request for apollo-federation-docs pending review.

codesandbox-ci bot commented Sep 12, 2022

pcmanus commented Sep 12, 2022

pcmanus Sep 12, 2022

Choose a reason for hiding this comment

pcmanus Sep 12, 2022

Choose a reason for hiding this comment

pcmanus Sep 12, 2022

Choose a reason for hiding this comment