[timeseries] Part-4: Complete Support for Multi-Server Queries #14676

ankitsultana · 2024-12-17T19:08:13Z

Finishes the ongoing work to support Time Series queries on tables that can have numInstancesPerReplicaGroup > 1.

The Timeseries Quickstart also now starts with 2 servers.

Also tested in one of our smaller clusters and we are able to get 50-100 QPS consistently.

codecov-commenter · 2024-12-17T19:47:27Z

Codecov Report

Attention: Patch coverage is 3.44828% with 196 lines in your changes missing coverage. Please review.

Project coverage is 63.97%. Comparing base (59551e4) to head (245d1ff).
Report is 1505 commits behind head on master.

Files with missing lines	Patch %	Lines
...va/org/apache/pinot/query/runtime/QueryRunner.java	0.00%	37 Missing ⚠️
.../pinot/query/service/dispatch/QueryDispatcher.java	2.85%	34 Missing ⚠️
...ispatch/timeseries/TimeSeriesDispatchObserver.java	0.00%	28 Missing ⚠️
...imeseries/PhysicalTimeSeriesBrokerPlanVisitor.java	8.00%	23 Missing ⚠️
...pinot/tsdb/planner/TimeSeriesQueryEnvironment.java	0.00%	23 Missing ⚠️
.../pinot/tsdb/planner/physical/TableScanVisitor.java	0.00%	15 Missing ⚠️
...b/planner/physical/TimeSeriesDispatchablePlan.java	0.00%	15 Missing ⚠️
...imeseries/PhysicalTimeSeriesServerPlanVisitor.java	0.00%	10 Missing ⚠️
...roker/requesthandler/TimeSeriesRequestHandler.java	0.00%	4 Missing ⚠️
...runtime/timeseries/TimeSeriesExecutionContext.java	50.00%	4 Missing ⚠️
... and 3 more

Additional details and impacted files

@@             Coverage Diff              @@
##             master   #14676      +/-   ##
============================================
+ Coverage     61.75%   63.97%   +2.22%     
- Complexity      207     1608    +1401     
============================================
  Files          2436     2707     +271     
  Lines        133233   149242   +16009     
  Branches      20636    22871    +2235     
============================================
+ Hits          82274    95484   +13210     
- Misses        44911    46763    +1852     
- Partials       6048     6995     +947

Flag	Coverage Δ
custom-integration1	`100.00% <ø> (+99.99%)`	⬆️
integration	`100.00% <ø> (+99.99%)`	⬆️
integration1	`100.00% <ø> (+99.99%)`	⬆️
integration2	`0.00% <ø> (ø)`
java-11	`63.93% <3.44%> (+2.22%)`	⬆️
java-21	`63.87% <3.44%> (+2.24%)`	⬆️
skip-bytebuffers-false	`63.95% <3.44%> (+2.20%)`	⬆️
skip-bytebuffers-true	`63.84% <3.44%> (+36.12%)`	⬆️
temurin	`63.97% <3.44%> (+2.22%)`	⬆️
unittests	`63.97% <3.44%> (+2.22%)`	⬆️
unittests1	`56.28% <3.51%> (+9.39%)`	⬆️
unittests2	`34.43% <1.47%> (+6.70%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

tibrewalpratik17

LGTM!

raghavyadav01 · 2024-12-26T18:48:05Z

pinot-query-runtime/src/main/java/org/apache/pinot/query/runtime/QueryRunner.java

-                  StandardCharsets.UTF_8))
-              .build();
-          responseObserver.onNext(response);
+          for (int index = 0; index < fragmentOpChains.size(); index++) {


Will this be blocking till all the fragments are executed? What happens when query times out?

raghavyadav01 · 2024-12-26T18:51:01Z

...main/java/org/apache/pinot/query/runtime/timeseries/PhysicalTimeSeriesBrokerPlanVisitor.java

+    if (planNode instanceof LeafTimeSeriesPlanNode) {
+      throw new IllegalStateException("Found leaf time series plan node in broker");
+    } else if (planNode instanceof TimeSeriesExchangeNode) {
+      int numInputServers = numInputServersByExchangeNode.get(planNode.getId());


How will the input servers be computed?

raghavyadav01 · 2024-12-26T18:54:27Z

pinot-query-runtime/src/main/java/org/apache/pinot/query/service/dispatch/QueryDispatcher.java

+    }
+  }
+
+  TimeSeriesBlock submitAndGet(long requestId, TimeSeriesDispatchablePlan plan, long timeoutMs,


Can you share short blurb about this method?

raghavyadav01 · 2024-12-26T18:55:12Z

...main/java/org/apache/pinot/query/service/dispatch/timeseries/TimeSeriesDispatchObserver.java

+   * buffer the data sent by the sender. This is set large enough that we should never hit this for any practical
+   * use-case, while guarding us against bugs.
+   */
+  public static final int MAX_QUEUE_CAPACITY = 4096;


Is this a per server Queue ?

raghavyadav01 · 2024-12-26T18:57:26Z

...main/java/org/apache/pinot/query/service/dispatch/timeseries/TimeSeriesDispatchObserver.java

@@ -30,37 +35,57 @@
 *   engine integration.
 */
 public class TimeSeriesDispatchObserver implements StreamObserver<Worker.TimeSeriesResponse> {


Now as Broker is doing reduce work ? Can broker become bottleneck? Did you perform any analysis for broker in your cluster testing?

raghavyadav01 · 2024-12-26T18:59:38Z

...meseries-planner/src/main/java/org/apache/pinot/tsdb/planner/TimeSeriesQueryEnvironment.java

+      List<BaseTimeSeriesPlanNode> planNodes, Map<String, Map<String, List<String>>> leafIdToSegmentsByInstanceId) {
+    // TODO(timeseries): Handle this gracefully and return an empty block.
+    Preconditions.checkState(!serverInstances.isEmpty(), "No servers selected for the query");
+    if (serverInstances.size() == 1) {


Why do we need to differentiate between single server vs multi server? Shouldn't it be transparent?

[timeseries] Part-4: Complete Support for Multi-Server Queries

d668f3a

ankitsultana added the timeseries-engine Tracking tag for generic time-series engine work label Dec 17, 2024

Fix test + Init TimeSeriesBuilderFactoryProvider in the Broker

9b6e5a0

ankitsultana changed the title ~~[WIP] [timeseries] Part-4: Complete Support for Multi-Server Queries~~ [timeseries] Part-4: Complete Support for Multi-Server Queries Dec 19, 2024

ankitsultana added 2 commits December 19, 2024 20:18

avoid linked queue because GC

1d5fa8a

cleanup code + fix single server handling

245d1ff

ankitsultana marked this pull request as ready for review December 20, 2024 02:09

tibrewalpratik17 approved these changes Dec 24, 2024

View reviewed changes

ankitsultana merged commit a6a3b40 into apache:master Dec 25, 2024
21 checks passed

raghavyadav01 reviewed Dec 26, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[timeseries] Part-4: Complete Support for Multi-Server Queries #14676

[timeseries] Part-4: Complete Support for Multi-Server Queries #14676

ankitsultana commented Dec 17, 2024 •

edited

Loading

codecov-commenter commented Dec 17, 2024 •

edited

Loading

tibrewalpratik17 left a comment

raghavyadav01 Dec 26, 2024

raghavyadav01 Dec 26, 2024

raghavyadav01 Dec 26, 2024

raghavyadav01 Dec 26, 2024

raghavyadav01 Dec 26, 2024

raghavyadav01 Dec 26, 2024

[timeseries] Part-4: Complete Support for Multi-Server Queries #14676

[timeseries] Part-4: Complete Support for Multi-Server Queries #14676

Conversation

ankitsultana commented Dec 17, 2024 • edited Loading

codecov-commenter commented Dec 17, 2024 • edited Loading

Codecov Report

tibrewalpratik17 left a comment

Choose a reason for hiding this comment

raghavyadav01 Dec 26, 2024

Choose a reason for hiding this comment

raghavyadav01 Dec 26, 2024

Choose a reason for hiding this comment

raghavyadav01 Dec 26, 2024

Choose a reason for hiding this comment

raghavyadav01 Dec 26, 2024

Choose a reason for hiding this comment

raghavyadav01 Dec 26, 2024

Choose a reason for hiding this comment

raghavyadav01 Dec 26, 2024

Choose a reason for hiding this comment

ankitsultana commented Dec 17, 2024 •

edited

Loading

codecov-commenter commented Dec 17, 2024 •

edited

Loading