Fix race condition in AdaptiveServerSelection and misc fixes #13104

vvivekiyer · 2024-05-07T16:50:10Z

Contains 2 fixes

1. Adaptive Server Selection - race condition:

A race condition between jetty threads and netty threads can result in setting negative values for numInFlightRequests for servers. This can result in that particular server being overloaded when compared to other.

It's difficult to reproduce this but the race-condition is obvious from code-reading.

The race condition is explained below
Let's say a query is routed to 2 servers S1 and S2. Say the query has a timeout of 1s. The race condition timeline is as follows:
T1: Query is routed to S1 and S2. The ADSS stats will look as follows:
S1 Stats = { numInFlightRequests = 1 } S2 Stats = { numInFlightRequests = 1 }

T2: S1 responds with the results (dataTable). The ADSS stats will be updated to look as follows. Note that this update is by the netty thread that receives the response.
S1 Stats = { numInFlightRequests = 0 } S2 Stats = { numInFlightRequests = 1 }

T3: Let's say the query timed out. The jetty thread will update the ADSS stats for S2 as per code to look as follows:
S1 Stats = { numInFlightRequests = 0 } S2 Stats = { numInFlightRequests = 0 }

T4: Before the jetty thread removes the QueryResponse object for the request, the server S2 could respond and the corresponding netty thread would update the ADSS stats incorrectly to look as follows
S1 Stats = { numInFlightRequests = 0 } S2 Stats = { numInFlightRequests = -1 }

2. Updates client error list to add a few more exceptions.

codecov-commenter · 2024-05-07T17:28:00Z

Codecov Report

Attention: Patch coverage is 50.00000% with 1 lines in your changes are missing coverage. Please review.

Project coverage is 62.11%. Comparing base (59551e4) to head (b2724d6).
Report is 416 commits behind head on master.

Files	Patch %	Lines
...pache/pinot/core/transport/AsyncQueryResponse.java	50.00%	0 Missing and 1 partial ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##             master   #13104      +/-   ##
============================================
+ Coverage     61.75%   62.11%   +0.36%     
+ Complexity      207      198       -9     
============================================
  Files          2436     2514      +78     
  Lines        133233   137786    +4553     
  Branches      20636    21319     +683     
============================================
+ Hits          82274    85583    +3309     
- Misses        44911    45787     +876     
- Partials       6048     6416     +368

Flag	Coverage Δ
custom-integration1	`<0.01% <0.00%> (-0.01%)`	⬇️
integration	`<0.01% <0.00%> (-0.01%)`	⬇️
integration1	`<0.01% <0.00%> (-0.01%)`	⬇️
integration2	`0.00% <0.00%> (ø)`
java-11	`62.11% <50.00%> (+0.40%)`	⬆️
java-21	`<0.01% <0.00%> (-61.63%)`	⬇️
skip-bytebuffers-false	`62.11% <50.00%> (+0.36%)`	⬆️
skip-bytebuffers-true	`0.00% <0.00%> (-27.73%)`	⬇️
temurin	`62.11% <50.00%> (+0.36%)`	⬆️
unittests	`62.10% <50.00%> (+0.36%)`	⬆️
unittests1	`46.77% <50.00%> (-0.12%)`	⬇️
unittests2	`27.74% <0.00%> (+0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

jackjlli

LGTM. Thanks for making the hotfix!

somandal

lgtm

jasperjiaguo · 2024-05-07T19:29:20Z

pinot-core/src/main/java/org/apache/pinot/core/transport/AsyncQueryResponse.java

@@ -152,12 +153,6 @@ void receiveDataTable(ServerRoutingInstance serverRoutingInstance, DataTable dat
    ServerResponse response = _responseMap.get(serverRoutingInstance);
    response.receiveDataTable(dataTable, responseSize, deserializationTimeMs);

-    // Record query completion stats immediately after receiving the response from the server instead of waiting


IIUC if we remove this discount upon each receiveDataTable and rely only on the one in getFinalResponses, it means the performance of all fan out servers are determined by the slowest one among them, do you think it would case inaccuracy where we over estimate the load on some servers with or without a timeout happens?

Good observation.

I see this resulting in more time taken to warm up/ramp up - that's the reason we had this piece of code earlier. With this approach, we'll be more conservative to not overload servers (because we assume that every server has not responded till the last server responds).

Achieving both will be a hairier change - considering the interaction between netty/jetty. We can revisit this logic depending on the behavior we see in our environment.

jadami10 · 2024-05-20T02:36:38Z

@vvivekiyer thank you for working on this fix! This actually bit us a month ago (only time in ~2 years), but we restarted the brokers before grabbing the routing stats, so we couldn't root cause.

ADSS Race Condition and update to client error codes

b2724d6

vvivekiyer added the bugfix label May 7, 2024

Jackie-Jiang approved these changes May 7, 2024

View reviewed changes

jackjlli approved these changes May 7, 2024

View reviewed changes

somandal approved these changes May 7, 2024

View reviewed changes

jasperjiaguo reviewed May 7, 2024

View reviewed changes

vvivekiyer merged commit 6a73450 into apache:master May 7, 2024
20 checks passed

vvivekiyer mentioned this pull request Jun 7, 2024

Reposition query submission spot for adaptive server selection #13327

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix race condition in AdaptiveServerSelection and misc fixes #13104

Fix race condition in AdaptiveServerSelection and misc fixes #13104

vvivekiyer commented May 7, 2024 •

edited

Loading

codecov-commenter commented May 7, 2024 •

edited

Loading

jackjlli left a comment

somandal left a comment

jasperjiaguo May 7, 2024 •

edited

Loading

vvivekiyer May 7, 2024

jadami10 commented May 20, 2024

Fix race condition in AdaptiveServerSelection and misc fixes #13104

Fix race condition in AdaptiveServerSelection and misc fixes #13104

Conversation

vvivekiyer commented May 7, 2024 • edited Loading

codecov-commenter commented May 7, 2024 • edited Loading

Codecov Report

jackjlli left a comment

Choose a reason for hiding this comment

somandal left a comment

Choose a reason for hiding this comment

jasperjiaguo May 7, 2024 • edited Loading

Choose a reason for hiding this comment

vvivekiyer May 7, 2024

Choose a reason for hiding this comment

jadami10 commented May 20, 2024

vvivekiyer commented May 7, 2024 •

edited

Loading

codecov-commenter commented May 7, 2024 •

edited

Loading

jasperjiaguo May 7, 2024 •

edited

Loading