do not reparse JSON responses in a loop #172

Geal · 2021-11-22T10:44:44Z

Potential fix for #144
related: #33 #80

with very large responses, we were looping over HTTP chunks by
accumulating them, trying to parse the JSON response, then go for
another iteration if it was not enough, so we end up parsing the same
data very frequently

This commit first accumulates the data entirely, then parses it

⚠️ this potentially breaks @stream: the previous code was trying to recognize an entire json response, return it, then try parsing another json response if there's still some data, and return a stream of responses. From what I see the code could not handle stream responses anyway since multipart is expected: https://github.com/graphql/graphql-over-http/blob/main/rfcs/IncrementalDelivery.md#content-type-multipartmixed
We should investigate that
Edit: @stream is not currently supported, so this can be merged right now, and we'll modify once we do it (we're keeping the Stream of responses to that end)

performance results

main `d9b3c43`

Summary:                                                     
  Total:        200.0049 secs                                
  Slowest:      20.0006 secs                                 
  Fastest:      9.0386 secs                                  
  Average:      19.1225 secs                                 
  Requests/sec: 2.4999                                       
                                                             
                                                             
Response time histogram:                                     
  9.039 [1]     |                                            
  10.135 [1]    |                                            
  11.231 [1]    |                                            
  12.327 [11]   |■                                           
  13.423 [9]    |■                                           
  14.520 [13]   |■                                           
  15.616 [10]   |■                                           
  16.712 [15]   |■                                           
  17.808 [18]   |■■                                          
  18.904 [9]    |■                                           
  20.001 [412]  |■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■    
                                                             
                                                             
Latency distribution:                                        
  10% in 15.9425 secs                                        
  25% in 20.0001 secs                                        
  50% in 20.0002 secs                                        
  75% in 20.0002 secs                                        
  90% in 20.0002 secs                                        
  95% in 20.0002 secs                                        
  99% in 20.0003 secs                                        
                                                             
Details (average, fastest, slowest):                         
  DNS+dialup:   0.0003 secs, 9.0386 secs, 20.0006 secs       
  DNS-lookup:   0.0000 secs, 0.0000 secs, 0.0000 secs        
  req write:    0.0001 secs, 0.0000 secs, 0.0008 secs        
  resp wait:    6.8858 secs, 0.0001 secs, 19.0536 secs       
  resp read:    12.2363 secs, 0.9462 secs, 19.9999 secs      
                                                             
Status code distribution:                                    
  [200] 500 responses

spending most of the time deserializing strings (I was testing a products subgraph where the name is 40MB long)

this PR

Summary:                                                  
  Total:        26.1397 secs                              
  Slowest:      5.3922 secs                               
  Fastest:      0.2809 secs                               
  Average:      2.4454 secs                               
  Requests/sec: 19.1280                                   
                                                          
                                                          
Response time histogram:                                  
  0.281 [1]     |                                         
  0.792 [6]     |■■                                       
  1.303 [17]    |■■■■■                                    
  1.814 [78]    |■■■■■■■■■■■■■■■■■■■■■■                   
  2.325 [145]   |■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■ 
  2.837 [115]   |■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■■         
  3.348 [78]    |■■■■■■■■■■■■■■■■■■■■■■                   
  3.859 [35]    |■■■■■■■■■■                               
  4.370 [14]    |■■■■                                     
  4.881 [8]     |■■                                       
  5.392 [3]     |■                                        
                                                          
                                                          
Latency distribution:                                     
  10% in 1.5380 secs                                      
  25% in 1.9118 secs                                      
  50% in 2.3362 secs                                      
  75% in 2.9391 secs                                      
  90% in 3.4465 secs                                      
  95% in 3.8630 secs                                      
  99% in 4.6872 secs                                      
                                                          
Details (average, fastest, slowest):                      
  DNS+dialup:   0.0002 secs, 0.2809 secs, 5.3922 secs     
  DNS-lookup:   0.0000 secs, 0.0000 secs, 0.0000 secs     
  req write:    0.0001 secs, 0.0000 secs, 0.0072 secs     
  resp wait:    0.4656 secs, 0.0002 secs, 2.0409 secs     
  resp read:    1.9795 secs, 0.2806 secs, 4.6103 secs     
                                                          
Status code distribution:                                 
  [200] 500 responses

There's definitely an improvement on large responses. It probably won't have a big impact on small responses that can fit in one chunk

with very large responses, we were looping over HTTP chunks by accumulating them, trying to parse the JSON response, then go for another iteration if it was not enough, so we end up parsing the same data very frequently This commit first accumulates the data entirely, then parses it

BrynCooke

LGTM

o0Ignition0o

lgtm!

o0Ignition0o · 2021-11-22T15:23:02Z

crates/apollo-router/src/http_subgraph.rs

-                    None
-                }
-            },
+                serde_json::from_slice::<graphql::Response>(&current_payload_bytes).unwrap_or_else(


yay for unwrap or else :D

Geal force-pushed the handle-large-responses branch 2 times, most recently from 7f6cf45 to 7c609ad Compare November 22, 2021 13:43

Geal force-pushed the handle-large-responses branch from 7c609ad to 87259f8 Compare November 22, 2021 13:53

Geal marked this pull request as ready for review November 22, 2021 14:14

Geal requested review from o0Ignition0o and BrynCooke November 22, 2021 14:14

Geal mentioned this pull request Nov 22, 2021

Performance of JSON manipulation #173

Open

8 tasks

avoid a match

e15507e

BrynCooke approved these changes Nov 22, 2021

View reviewed changes

o0Ignition0o approved these changes Nov 22, 2021

View reviewed changes

Geal merged commit 61b5a8d into main Nov 22, 2021

Geal deleted the handle-large-responses branch November 22, 2021 15:24

This was referenced Nov 22, 2021

Slow response times with large documents #144

Closed

avoid cloning entities when building the response #132

Merged

Geal self-assigned this Dec 1, 2021

tinnou pushed a commit to Netflix-Skunkworks/router that referenced this pull request Oct 16, 2023

release: router-bridge@v0.1.3 (apollographql#172)

f0c3799

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

do not reparse JSON responses in a loop #172

do not reparse JSON responses in a loop #172

Geal commented Nov 22, 2021 •

edited

Loading

BrynCooke left a comment

o0Ignition0o left a comment

o0Ignition0o Nov 22, 2021

do not reparse JSON responses in a loop #172

do not reparse JSON responses in a loop #172

Conversation

Geal commented Nov 22, 2021 • edited Loading

performance results

main d9b3c43

this PR

BrynCooke left a comment

Choose a reason for hiding this comment

o0Ignition0o left a comment

Choose a reason for hiding this comment

o0Ignition0o Nov 22, 2021

Choose a reason for hiding this comment

Geal commented Nov 22, 2021 •

edited

Loading

main `d9b3c43`