feat(stream): add two-phase stateless simple approx percentile #17873

kwannoel · 2024-07-30T09:57:40Z

I hereby agree to the terms of the RisingWave Labs, Inc. Contributor License Agreement.

What's changed and what's your intention?

We add stateless simple approx percentile executors (local and global). This is the barebones version, without any caching, and which only works when its used alone without other aggregations.

Subsequent PRs will take care of:

Cache
Keyed Merge (incl. changes to simple agg executor, to flush once per epoch).
Performance test
Fuzz test (test relative error vs percentile_cont vs shuffle approx percentile)

LocalApproxPercentile is stateless, and just construct the buckets and corresponding counts. For negative numbers we need to get the absolute value, and log that instead. Then we pass the sign in a separate column. The schema will look like:

sign	bucket_id	count
-1	1	100
0	doesn't matter	10
1	1	10

GlobalApproxPercentile stores the bucket_ids, prefixed with a sign. On barrier, we will iterate over all buckets, and output the approximated percentile.

Checklist

I have written necessary rustdoc comments
I have added necessary unit tests and integration tests
I have added test labels as necessary. See details.
I have added fuzzing tests or opened an issue to track them. (Optional, recommended for new SQL features Sqlsmith: Sql feature generation #7934).
My PR contains breaking changes. (If it deprecates some features, please create a tracking issue to remove them in the future).
All checks passed in ./risedev check (or alias, ./risedev c)
My PR changes performance-critical code. (Please run macro/micro-benchmarks and show the results.)

My PR contains critical fixes that are necessary to be merged into the latest release. (Please check out the details)

Documentation

My PR needs documentation updates. (Please use the Release note section below to summarize the impact on users)

Release note

This is an experimental feature. The interface may change in the future. Only Streaming Approx percentile is supported at the moment.

Approx Percentile is an aggregation with the following interface:

approx_percentile(DOUBLE percentile [, DOUBLE relative_error]) within group (order by percentile_column)

percentile refers to the percentile to approximate. For example, 50th, 90th will be 0.5, 0.9 respectively.
relative_error refers to how far the approximated percentile value can stray from the actual percentile value. If unspecified, it will default to 0.01 (1%).
percentile_column refers to the column we will maintain the approx percentile for. It must be of a numeric type.

github-actions

license-eye has totally checked 5253 files.

Valid	Invalid	Ignored	Fixed
2253	2	2998	0

Click to see the invalid file list

src/stream/src/executor/approx_percentile/global.rs
src/stream/src/executor/keyed_merge.rs

github-actions

license-eye has totally checked 5257 files.

Valid	Invalid	Ignored	Fixed
2256	2	2999	0

Click to see the invalid file list

src/stream/src/executor/approx_percentile/global.rs
src/stream/src/executor/keyed_merge.rs

github-actions

license-eye has totally checked 5259 files.

Valid	Invalid	Ignored	Fixed
2259	1	2999	0

Click to see the invalid file list

src/stream/src/executor/keyed_merge.rs

github-actions

license-eye has totally checked 5259 files.

Valid	Invalid	Ignored	Fixed
2259	1	2999	0

Click to see the invalid file list

src/stream/src/executor/keyed_merge.rs

src/stream/src/executor/approx_percentile/global.rs

… use reverse iterator

kwannoel · 2024-08-03T01:56:01Z

src/stream/src/executor/approx_percentile/global.rs

+        // Just iterate over the singleton vnode.
+        // TODO(kwannoel): Should we just use separate state tables for
+        // positive and negative counts?
+        // Reverse iterator is not as efficient.


TBH I'm not totally sure about this design decision.
The alternative is to split into 3 tables:

zeros

neg

pos

Then we don't have to use reverse iteration.

The current design is to group everyting in a single table, and add a sign column.
Because everything is ascending, this means that when iterating negative values, the order will be:

- 1 - 10000 - 10000000

When we want this instead:

- 10000000 - 10000 - 1

So we will need to use a separate reverse iterator for the negative values. This approach is simpler, but may be less efficient.

Can we just use an empty prefix to scan the state table? The order should start from neg to pos.

No it doesn't work. Because we aren't just iterating on negative values.

Consider the following is in the state table (with the count column elided)

sign bucket_id

-1 -3

-1 -1

-1 1

-1 2

0 0

1 1

1 2

Our iteration order will be:

(-1, -3), (-1,-1), (-1, 1), (-1, 2), (0,0), (+1, 1), (+1, 2)

But we actually want

(-1, 2), (-1,1), (-1, -1), (-1, -3), (0,0), (+1, 1), (+1, 2)

Because as an example:

-(base^2) < -(base ^1)

I see. Actually, we can do a math trick here, for example, encoding bucket_id as - bucket_id for sign with -1. Then we will get the order we want.

I see. Actually, we can do a math trick here, for example, encoding bucket_id as - bucket_id for sign with -1. Then we will get the order we want.

That's a good idea for optimization, but I intend to keep this suggestion as a future optimization, rather than change it in this PR. Currently this is just a few lines in one location:

for keyed_row in bucket_state_table .rev_iter_with_prefix(&[Datum::None; 0], &neg_bounds, PrefetchOptions::default()) .await?

Adding -1 sign, means at all areas where we encode / decode, we will need to add the logic to reverse it. And probably have to add comments there to explain why as well. Here we can just add some comments in a single spot.

Once we add caching, I think we won't do as much state table iter anymore, and so it will mitigate any performance hit from the reverse iteration of negative values.

If we add an abstraction over state_table and cache in the caching PR, we can consider adding this optimization there.

kwannoel · 2024-08-05T02:57:36Z

Bump, PTAL, the tests have passed.

src/frontend/src/optimizer/plan_node/stream_global_approx_percentile.rs

src/stream/src/executor/approx_percentile/local.rs

chenzl25 · 2024-08-07T07:13:33Z

src/frontend/src/optimizer/plan_node/stream_global_approx_percentile.rs

@@ -95,8 +97,42 @@ impl PlanTreeNodeUnary for StreamGlobalApproxPercentile {
 impl_plan_tree_node_for_unary! {StreamGlobalApproxPercentile}

 impl StreamNode for StreamGlobalApproxPercentile {
-    fn to_stream_prost_body(&self, _state: &mut BuildFragmentGraphState) -> PbNodeBody {
-        todo!()
+    fn to_stream_prost_body(&self, state: &mut BuildFragmentGraphState) -> PbNodeBody {


It seems we don't have any shuffle between the global approx percentile and the local approx percentile?

Didn't quite get it. We do have shuffle between local -> global approx percentile. May take a look at the output inside agg.yaml. There we have:

├─StreamGlobalApproxPercentile { quantile: 0.8:Float64, relative_error: 0.01:Float64 } │ └─StreamExchange { dist: Single } │ └─StreamLocalApproxPercentile { percentile_col: $expr1, quantile: 0.8:Float64, relative_error: 0.01:Float64 }

Oh, I see it. My main branch is out of date.

BTW, I think StreamMerge inputs could be changed from binary inputs to multi inputs like StreamUnion. Maybe the in next PR.

src/stream/src/executor/approx_percentile/global.rs

chenzl25

LGTM!

github-actions bot added the Invalid PR Title label Jul 30, 2024

kwannoel changed the title ~~interim commit: add local approx percentile~~ feat(stream): add two-phase stateless simple approx percentile Jul 30, 2024

github-actions bot added type/feature and removed Invalid PR Title labels Jul 30, 2024

github-actions bot reviewed Jul 30, 2024

View reviewed changes

kwannoel force-pushed the kwannoel/approx-percentile-simple-two-phase branch from 6a0aa64 to 689ec0d Compare August 1, 2024 01:48

github-actions bot reviewed Aug 1, 2024

View reviewed changes

kwannoel commented Aug 1, 2024

View reviewed changes

src/stream/src/executor/approx_percentile/global.rs Outdated Show resolved Hide resolved

kwannoel force-pushed the kwannoel/approx-percentile-simple-two-phase branch from c5a8387 to b15d957 Compare August 2, 2024 01:35

kwannoel added 16 commits August 2, 2024 22:10

support local stateless approx percentile

1be08ab

handle chunk

3d8f85b

handle barrier

1d2626d

add local approx percentile proto

c8c900f

from_proto for global

1170acc

convert plans to proto

19060d6

fmt

48b3b46

defer keyed merge

0bd820d

interim commit: adding tests but failing

356763b

revert some debug in global

a437469

minor

768a70a

add more test, fix bugs in calculating percentile

5b0af78

support negative, but needs some fixes still, specifically we need to…

c362ed2

… use reverse iterator

properly handle neg

b2d92ba

revert debug stmts

5178950

remove some fixme

c134477

kwannoel force-pushed the kwannoel/approx-percentile-simple-two-phase branch from d0e8852 to c134477 Compare August 2, 2024 14:10

kwannoel requested review from stdrc and fuyufjh August 2, 2024 14:14

kwannoel requested review from BugenZhao and st1page August 2, 2024 14:14

kwannoel marked this pull request as ready for review August 2, 2024 14:14

kwannoel added the user-facing-changes Contains changes that are visible to users label Aug 2, 2024

kwannoel mentioned this pull request Aug 2, 2024

feat(expr): support shuffle approx_percentile #17814

Merged

9 tasks

kwannoel added 2 commits August 2, 2024 22:23

fmt

dbe8018

more fmt

a4d68b3

graphite-app bot requested a review from a team August 2, 2024 14:46

drop table and mv

6c93832

graphite-app bot requested a review from a team August 3, 2024 00:20

kwannoel commented Aug 3, 2024

View reviewed changes

stdrc reviewed Aug 6, 2024

View reviewed changes

src/frontend/src/optimizer/plan_node/stream_global_approx_percentile.rs Outdated Show resolved Hide resolved

src/frontend/src/optimizer/plan_node/stream_global_approx_percentile.rs Show resolved Hide resolved

src/stream/src/executor/approx_percentile/local.rs Show resolved Hide resolved

fix comments

84c656f

kwannoel mentioned this pull request Aug 6, 2024

Tracking: Approx Percentile #17531

Open

26 tasks

kwannoel requested a review from chenzl25 August 7, 2024 02:41

chenzl25 reviewed Aug 7, 2024

View reviewed changes

src/stream/src/executor/approx_percentile/global.rs Outdated Show resolved Hide resolved

ignore watermarks

66ecdea

kwannoel force-pushed the kwannoel/approx-percentile-simple-two-phase branch from bc3a768 to 66ecdea Compare August 7, 2024 09:16

kwannoel requested review from chenzl25 and stdrc August 8, 2024 02:50

This comment was marked as duplicate.

Sign in to view

chenzl25 approved these changes Aug 8, 2024

View reviewed changes

kwannoel added this pull request to the merge queue Aug 8, 2024

Merged via the queue into main with commit c315942 Aug 8, 2024
32 of 33 checks passed

kwannoel deleted the kwannoel/approx-percentile-simple-two-phase branch August 8, 2024 05:25

BugenZhao mentioned this pull request Sep 18, 2024

risingwave 2.0.0 risingwavelabs/homebrew-risingwave#44

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(stream): add two-phase stateless simple approx percentile #17873

feat(stream): add two-phase stateless simple approx percentile #17873

kwannoel commented Jul 30, 2024 •

edited

Loading

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

github-actions bot left a comment

kwannoel Aug 3, 2024 •

edited

Loading

chenzl25 Aug 7, 2024

kwannoel Aug 7, 2024 •

edited

Loading

chenzl25 Aug 7, 2024

kwannoel Aug 8, 2024 •

edited

Loading

kwannoel commented Aug 5, 2024

chenzl25 Aug 7, 2024

kwannoel Aug 7, 2024

chenzl25 Aug 7, 2024

chenzl25 Aug 7, 2024 •

edited

Loading

This comment was marked as duplicate.

chenzl25 left a comment

feat(stream): add two-phase stateless simple approx percentile #17873

feat(stream): add two-phase stateless simple approx percentile #17873

Conversation

kwannoel commented Jul 30, 2024 • edited Loading

What's changed and what's your intention?

Checklist

Documentation

Release note

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

github-actions bot left a comment

Choose a reason for hiding this comment

kwannoel Aug 3, 2024 • edited Loading

Choose a reason for hiding this comment

chenzl25 Aug 7, 2024

Choose a reason for hiding this comment

kwannoel Aug 7, 2024 • edited Loading

Choose a reason for hiding this comment

chenzl25 Aug 7, 2024

Choose a reason for hiding this comment

kwannoel Aug 8, 2024 • edited Loading

Choose a reason for hiding this comment

kwannoel commented Aug 5, 2024

chenzl25 Aug 7, 2024

Choose a reason for hiding this comment

kwannoel Aug 7, 2024

Choose a reason for hiding this comment

chenzl25 Aug 7, 2024

Choose a reason for hiding this comment

chenzl25 Aug 7, 2024 • edited Loading

Choose a reason for hiding this comment

This comment was marked as duplicate.

chenzl25 left a comment

Choose a reason for hiding this comment

kwannoel commented Jul 30, 2024 •

edited

Loading

kwannoel Aug 3, 2024 •

edited

Loading

kwannoel Aug 7, 2024 •

edited

Loading

kwannoel Aug 8, 2024 •

edited

Loading

chenzl25 Aug 7, 2024 •

edited

Loading