Support batched queries (fix #1812) #3490

paf31 · 2019-12-04T19:07:41Z

Description

This adds the GQLBatchedReqs type which supports multiple batched queries like the one described in the original linked example, as well as single queries, just like before.

It's unclear what to do with forwarded headers in the case where we have multiple queries, so to keep things simple for now, batched queries will not forward any headers. We can revisit this later if necessary.

Affected components

Server
Docs

Related Issues

#1812

Catalog upgrade

Does this PR change Hasura Catalog version?

No
Yes
- Updated docs with SQL for downgrading the catalog

Metadata

Does this PR add a new Metadata feature?

No
Yes
- Does run_sql auto manages the new metadata through schema diffing?
  - Yes
  - Not required
- Does run_sql auto manages the definitions of metadata on renaming?
  - Yes
  - Not required
- Does export_metadata/replace_metadata supports the new metadata added?
  - Yes
  - Not required

GraphQL

No new GraphQL schema is generated
New GraphQL schema is being generated:
- New types and typenames are correlated

Breaking changes

Steps to test and verify

I've tested this on the command line as follows:

$ curl -X POST 'http://localhost:8080/v1/graphql' -d '[{"query":"query first { user { email id name } }"},{"query":"query second { role { id name } }","variables":null}]' | jq -S
[
  {
    "data": {
      "user": [
        {
          "email": "test@test.com",
          "id": 1,
          "name": "test user"
        }
      ]
    }
  },
  {
    "data": {
      "role": [
        {
          "id": 1,
          "name": "admin"
        }
      ]
    }
  }
]

~~I haven't tested this with Apollo itself yet.~~ Edit: I was able to test batching from Apollo with this small Node script:

const ApolloClient = require('apollo-client').default;
const BatchHttpLink = require('apollo-link-batch-http').BatchHttpLink;
const InMemoryCache = require('apollo-cache-inmemory').InMemoryCache;

const fetch = require('node-fetch');
const gql = require('graphql-tag').default;

const link = new BatchHttpLink({ 
  uri: 'http://localhost:8080/v1/graphql', 
  fetch 
});

const client = new ApolloClient({ link, cache: new InMemoryCache() });

client.query({
  query: gql`query first { user { id } }`,
}).then(user => console.log(user));
  
client.query({
  query: gql`query second { role { id } }`,
}).then(role => console.log(role));

Limitations, known bugs & workarounds

claassistantio · 2019-12-04T19:08:00Z

All committers have signed the CLA.

netlify · 2019-12-04T19:09:05Z

Deploy preview for hasura-docs ready!

Built with commit 475d902

https://deploy-preview-3490--hasura-docs.netlify.com

lexi-lambda

This LGTM, but we should probably update the docs as well. At the very least, we should update the API documentation for /v1/graphql, but I’m not sure if a mention elsewhere makes sense. @marionschleifer, do you have any thoughts?

hasura-bot · 2019-12-04T19:29:39Z

Review app for commit 2c71c7d deployed to Heroku: https://hge-ci-pull-3490.herokuapp.com
Docker image for server: hasura/graphql-engine:pull3490-2c71c7df

ecthiender · 2019-12-05T04:49:45Z

I have a question. I think the convention around executing multiple queries is to gather all errors during execution (if there are any) and then return a response with the data and errors. The spec doesn't talk about this behaviour explicitly AFAICT, but the sub-sections in response section has examples which looks like that. Particularly, example 185. The spec also mentions, if there are validation errors then the server can reject the entire query. I think most libraries follow the convention of executing all queries even if previous one had execution errors, and returning data and errors together. We can also probably verify this with something like apollo-server (I think they support batch queries).

With the current implementation, it looks like if there's an execution error we get back only error and
not data for the other queries. If the first query fails, it does not return results of the second query.

E.g -
curl -X POST https://hge-ci-pull-3490.herokuapp.com/v1/graphql -d '[{"query":"mutation first { insert_user(objects: [{email: \"a@a.com\", name: \"alice\"}]) { returning { email id name } }}"},{"query":"query second { role { id name } }","variables":null}]' | jq -S

So the question is, shouldn't we follow the convention?

Ref -

paf31 · 2019-12-05T19:36:12Z

Since the spec doesn't mention batching, I think it'd be reasonable to treat batched requests as if each individual request were run in order. If the first request fails, I think it's fair to not run the second. Especially in the case where the first request is a mutation, it probably doesn't make sense to carry on to the second request. If the second is a query, it might return incorrect data, and if it is a mutation, its effects might depend on the effects of the first being successful.

Now it might be more reasonable to run batches of queries in parallel, and return errors in parallel too, allowing one but not all to succeed, but I think there's a case to be made that it's better to keep the semantics uniform across queries and mutations. Also, in the case of remote schemas, I'm not sure to what extend we can expect queries to be entirely non-side-effecting.

The example you linked suggests parallel semantics for fields within a single query, but I think that's fine because a single query can't include mutations inside it, whereas a batch of requests can mix queries and mutations.

hasura-bot · 2019-12-05T19:40:09Z

Review app for commit 9fbbddd deployed to Heroku: https://hge-ci-pull-3490.herokuapp.com
Docker image for server: hasura/graphql-engine:pull3490-9fbbdddb

marionschleifer · 2019-12-12T16:53:15Z

docs/graphql/manual/api-reference/graphql-api/index.rst

@@ -28,9 +28,60 @@ The following types of requests can be made using the GraphQL API:
 - :doc:`Query / Subscription <query>`
 - :doc:`Mutation <mutation>`

+Batching requests


I'm a bit confused by the title Batching requests and then further down the sentence ...we can send two queries in one request. Is it multiple queries in a request? Or multiple requests in a request? 😄

Well the example is two queries, but it's not restricted to just queries because you can mix queries and mutations. So, I need a word that covers both of those - perhaps "request" isn't ideal since there is also "the request", but just saying "query" would be incomplete.

I switched it to "operations" since that's the word the GraphQL spec uses.

hasura-bot · 2019-12-18T00:13:16Z

Review app for commit 6ec2b9f deployed to Heroku: https://hge-ci-pull-3490.herokuapp.com
Docker image for server: hasura/graphql-engine:pull3490-6ec2b9f1

hasura-bot · 2019-12-20T15:10:13Z

Review app for commit 475d902 deployed to Heroku: https://hge-ci-pull-3490.herokuapp.com
Docker image for server: hasura/graphql-engine:pull3490-475d9025

hasura-bot · 2019-12-20T16:04:05Z

Review app https://hge-ci-pull-3490.herokuapp.com is deleted

jorroll · 2019-12-22T11:10:33Z

I think it'd be reasonable to treat batched requests as if each individual request were run in order. If the first request fails, I think it's fair to not run the second. Especially in the case where the first request is a mutation, it probably doesn't make sense to carry on to the second request. If the second is a query, it might return incorrect data, and if it is a mutation, its effects might depend on the effects of the first being successful.

@paf31 @ecthiender

Ooo I very much disagree with this and this is going against the conventions established by other graphql servers. Apollo allows automatic batching of queries made within a certain timeframe--e.g. batch all queries made within 10 milliseconds of each other. If Hasura implements batching as described above, this will mean that unrelated queries in an app may fail depending on when they are executed--very unexpected and very undesirable!

Example: If a query for contact records happens to get batched with a query for addresses (because of timing), and the address query fails, I still want to see the contact record information. I certainly don't want to sometimes see contact record information, if the contact record query happened to execute before the address query.

If someone wants queries/mutations to be executed as part of the same transaction (or if they want the order to matter), they should send them as part of a single operation (easy to do with aliases). Batched queries (and mutations) should always be handled in parallel--I know this is the way graphql-ruby handles them, I also just verified that this is the way apollo-server handles them.

lexi-lambda · 2019-12-26T19:15:58Z

@thefliik I wasn’t really familiar with the way Apollo’s batching works, but I just read up on it a little bit. It seems like the purpose of batching is really just to reduce the number of concurrent in-flight HTTP requests… is that accurate?

If it is, then I admit I’m a little skeptical of its value. It seems like there are already two better solutions to that problem:

HTTP/2 makes concurrent requests to the same server basically free, so HTTP/2 is a much more general solution to this problem at the transport layer. Now, admittedly, graphql-engine doesn’t support HTTP/2 right now… but maybe it could?
In the meantime you can use a websocket if you’re sending tons and tons of separate queries.

In any case, I agree that this PR is inconsistent with the way the Apollo client evidently expects batching to work, so I think we should probably back this change out or change the way it works (as right now it implements a protocol no client is likely to implement). But the above points make me wonder if it’s worth supporting at all. Is there any reason you think that HTTP/2 support wouldn’t be enough to subsume this mechanism?

paf31 · 2019-12-26T19:42:19Z

I'm also unclear on the benefits of batching vs a transport layer solution, but after some thought I've convinced myself that if we support batching at all, it should have the semantics you describe. While it's possible to send some nonsensical requests, it's no worse than what you could do with non-serialized requests without batching (e.g. multiple threads of execution in JS sending requests independently), and I think it would be fine for the semantics to emulate that behavior. That is, we pretend we received N independent requests in order.

I was previously concerned because for some reason I thought the different operations could come from the same thread of execution, and therefore could depend on each other. However, the only way I think you could get into this situation is something like this:

client.mutate({
  ...
}).then(...);
  
client.mutate({
  ...
}).then(...);

or

client.mutate({
  ...
}).then(x => client.mutate({
  ...
}).then(...));

In the first case, there is either no way for them to depend on each other's results, and in the second, the two requests would never occur in the same batch due to data dependencies.

jorroll · 2019-12-27T01:01:22Z

It seems like the purpose of batching is really just to reduce the number of concurrent in-flight HTTP requests… is that accurate?

If it is, then I admit I’m a little skeptical of its value. It seems like there are already two better solutions to that problem:

@lexi-lambda @paf31 I believe you are correct. For reference, you can read this apollo blog post about batching that specifically cites http/2 as an alternative to batching: https://blog.apollographql.com/batching-client-graphql-queries-a685f5bcd41b.

That blog post also lists some reasons why http/2 might be preferrable/faster than batching (batching is as slow as the slowest query vs http/2 would return each result seperately).
- Though I do wonder how http/2 would perform on a mobile connection, where one larger query might be preferable to many smaller ones.

I'm speculating, but I think batching exists for folks who want to use plain old http (for whatever reason). It's also important to know that batching was introduced/invented by Apollo back in 2016, well before the graphql spec included subscriptions and, I think, well before most folks were using graphql over websockets. I just did a cursory check, and it looks like a draft of the http/2 spec was introduced in 2015, so it's also unlikely anyone was using that when batching was introduced (I think I read somewhere that it wasn't until Node 10--released in 2018--that http/2 support was included, so I'll also point out that Google Firebase Functions don't even officially support Node 10 yet, support is currently in beta).

Anyway, I'm not familiar with the http/2 spec, but, from my perspective, there still might be cause to support ApolloClient batching if there were reasons why Hasura users couldn't use http/2 or websockets. In general, I could imagine websockets might be problematic for someone if they needed a stateless connection. For example, if the Hasura server was called like a serverless function (using something like Google Cloud Run) then websockets wouldn't be an option (not sure about http/2).

paf31 added 2 commits December 3, 2019 18:45

WIP: support batched queries hasura#1812

98ba294

Add a test case

2c71c7d

paf31 requested a review from lexi-lambda as a code owner December 4, 2019 19:07

lexi-lambda added c/server Related to server s/wip Status: This issue is a work in progress labels Dec 4, 2019

lexi-lambda reviewed Dec 4, 2019

View reviewed changes

Mention batching in GraphQL endpoint docs

9fbbddd

paf31 requested a review from marionschleifer as a code owner December 5, 2019 19:25

lexi-lambda previously approved these changes Dec 9, 2019

View reviewed changes

marionschleifer reviewed Dec 12, 2019

View reviewed changes

"request" -> "operation"

6ec2b9f

paf31 dismissed lexi-lambda’s stale review via 6ec2b9f December 17, 2019 23:55

lexi-lambda approved these changes Dec 18, 2019

View reviewed changes

lexi-lambda requested a review from marionschleifer December 18, 2019 17:31

marionschleifer approved these changes Dec 19, 2019

View reviewed changes

Merge branch 'master' into 1812

475d902

lexi-lambda merged commit c766881 into hasura:master Dec 20, 2019

paf31 deleted the 1812 branch December 26, 2019 18:16

lexi-lambda mentioned this pull request Dec 27, 2019

Feature Request: Batching queries #1812

Closed

polRk pushed a commit to polRk/graphql-engine that referenced this pull request Feb 12, 2020

Support batched queries (fix hasura#1812) (hasura#3490)

3d98a32

tirumaraiselvan mentioned this pull request Mar 5, 2020

set cookies with batching operations #4015

Open

sumodgeorge mentioned this pull request Nov 28, 2023

[Snyk] Fix for 4 vulnerabilities sumodgeorge/graphql-engine#30

Open

Allymahmoud mentioned this pull request Nov 28, 2023

[Snyk] Fix for 4 vulnerabilities Allymahmoud/graphql-engine#386

Open

ptakpatryk mentioned this pull request Dec 19, 2023

[Snyk] Security upgrade less from 3.11.1 to 3.12.0 ptakpatryk/graphql-engine#27

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support batched queries (fix #1812) #3490

Support batched queries (fix #1812) #3490

paf31 commented Dec 4, 2019 •

edited

Loading

claassistantio commented Dec 4, 2019 •

edited

Loading

netlify bot commented Dec 4, 2019 •

edited

Loading

lexi-lambda left a comment

hasura-bot commented Dec 4, 2019

ecthiender commented Dec 5, 2019

paf31 commented Dec 5, 2019

hasura-bot commented Dec 5, 2019

marionschleifer Dec 12, 2019

paf31 Dec 17, 2019 •

edited

Loading

paf31 Dec 17, 2019

hasura-bot commented Dec 18, 2019

hasura-bot commented Dec 20, 2019

hasura-bot commented Dec 20, 2019

jorroll commented Dec 22, 2019 •

edited

Loading

lexi-lambda commented Dec 26, 2019

paf31 commented Dec 26, 2019 •

edited

Loading

jorroll commented Dec 27, 2019 •

edited

Loading

Support batched queries (fix #1812) #3490

Support batched queries (fix #1812) #3490

Conversation

paf31 commented Dec 4, 2019 • edited Loading

Description

Affected components

Related Issues

Catalog upgrade

Metadata

GraphQL

Breaking changes

Steps to test and verify

Limitations, known bugs & workarounds

claassistantio commented Dec 4, 2019 • edited Loading

netlify bot commented Dec 4, 2019 • edited Loading

lexi-lambda left a comment

Choose a reason for hiding this comment

hasura-bot commented Dec 4, 2019

ecthiender commented Dec 5, 2019

paf31 commented Dec 5, 2019

hasura-bot commented Dec 5, 2019

marionschleifer Dec 12, 2019

Choose a reason for hiding this comment

paf31 Dec 17, 2019 • edited Loading

Choose a reason for hiding this comment

paf31 Dec 17, 2019

Choose a reason for hiding this comment

hasura-bot commented Dec 18, 2019

hasura-bot commented Dec 20, 2019

hasura-bot commented Dec 20, 2019

jorroll commented Dec 22, 2019 • edited Loading

lexi-lambda commented Dec 26, 2019

paf31 commented Dec 26, 2019 • edited Loading

jorroll commented Dec 27, 2019 • edited Loading

paf31 commented Dec 4, 2019 •

edited

Loading

claassistantio commented Dec 4, 2019 •

edited

Loading

netlify bot commented Dec 4, 2019 •

edited

Loading

paf31 Dec 17, 2019 •

edited

Loading

jorroll commented Dec 22, 2019 •

edited

Loading

paf31 commented Dec 26, 2019 •

edited

Loading

jorroll commented Dec 27, 2019 •

edited

Loading