planner: eliminate aggregation with distinct #16581

SeaRise · 2020-04-19T07:55:40Z

Description

for table t (a int, b int, key int)

(10, 20, 10)
(10, 10, 10)
(20, 10, 13)

select count(distinct a), sum(b) from t group by key

==>

select count(a), sum(sum_b)
from (
	select a, sum(b) as sum_b, gid from (
		Expand(
			 projections = 
					[(a, null, key, 0),
                                         (null, b, key, 1)
					], 
			 t)
	)
	group by a, gid, key
)
group by key

Expand is extand t to t_extend(a int, b int, key int, gid int)

(10, null, 10, 0)
(null, 20, 10, 1)

(10, null, 10, 0)
(null, 10, 10, 1)

(20, null, 13, 0)
(null, 10, 13, 1)

spark sql use a operator Extend to extend t to t_extend.
we can use sql as follows to extend

	mem_table(gid int)
        (0)
	(1)
	
select if((gid = 0) a else null), if ((gid = 1) b else null), gid from t join mem_table

or implement a more efficient operator like Extend to extend t to t_extend.

Score

1200

SIG slack channel（must）:

Contact us in channel #sig-planner of TiDB Community

Mentor(must)

@winoros

Recommended Skills：

Relational algebra

The text was updated successfully, but these errors were encountered:

iontang · 2020-10-03T16:43:01Z

/pick-up

ti-challenge-bot · 2020-10-03T16:43:05Z

Pick up success.

ti-challenge-bot · 2020-10-10T17:38:34Z

@william0423 You did not submit PR within 7 days, so give up automatically.

iontang · 2020-10-11T13:30:25Z

/pick-up

ti-challenge-bot · 2020-10-11T13:30:30Z

Pick up success.

ti-challenge-bot · 2020-10-18T13:38:36Z

@william0423 You did not submit PR within 7 days, so give up automatically.

pingyu · 2020-11-10T13:12:25Z

/pick-up

ti-challenge-bot · 2020-11-10T13:12:34Z

Pick up success.

ti-challenge-bot · 2020-11-17T13:59:14Z

@pingyu You did not submit PR within 7 days, so give up automatically.

pingyu · 2020-11-17T14:15:42Z

/pick-up

ti-challenge-bot · 2020-11-17T14:15:50Z

Pick up success.

ti-challenge-bot · 2020-11-24T14:59:16Z

@pingyu You did not submit PR within 7 days, so give up automatically.

hidehalo · 2020-11-29T08:47:56Z

/pick-up

ti-challenge-bot · 2020-11-29T08:48:03Z

Pick up success.

ti-challenge-bot · 2020-12-06T09:35:18Z

@hidehalo You did not submit PR within 7 days, so give up automatically.

Enochack · 2021-03-23T03:52:25Z

Sorry I didn't get the point. Why expand the table? I found the following sql

select key, count(tmp.a), sum(tmp.sum_b)
from (
  select key, a, sum(b) as sum_b from t
  group by key, a
) as tmp
group by key

has an equivalent semantic and is more likely to have better performance.
However, if we add an expand operator like Spark SQL does, we could use it to implement with rollup and with cube clauses.

Tangruilin · 2021-12-21T06:43:50Z

I'm interested in this problem, was it solved by others? /cc @winoros

Tangruilin · 2021-12-28T13:16:25Z

/assign

close #6852, ref pingcap/tidb#16581, ref pingcap/tidb#34704

SeaRise added the type/enhancement The issue or PR belongs to an enhancement. label Apr 19, 2020

zz-jason added the sig/planner SIG: Planner label Apr 26, 2020

winoros added challenge-program high-performance labels Sep 10, 2020

ti-challenge-bot bot added the picked label Oct 3, 2020

ti-challenge-bot bot removed the picked label Oct 10, 2020

ti-challenge-bot bot added the picked label Oct 11, 2020

ti-challenge-bot bot removed the picked label Oct 18, 2020

ti-challenge-bot bot added the picked label Nov 10, 2020

ti-challenge-bot bot removed the picked label Nov 17, 2020

ti-challenge-bot bot added the picked label Nov 17, 2020

ti-challenge-bot bot removed the picked label Nov 24, 2020

ti-challenge-bot bot added the picked label Nov 29, 2020

ti-challenge-bot bot removed the picked label Dec 6, 2020

tisonkun removed the high-performance label Sep 1, 2021

ti-chi-bot assigned Tangruilin Dec 28, 2021

This was referenced Dec 15, 2022

planner, expression: support multi-distinct agg under MPP mode #39973

Merged

executor: support the grouping sets pingcap/tipb#283

Merged

AilinKid mentioned this issue Dec 26, 2022

plan, executor: implement Expand operator for grouping sets pingcap/tiflash#6545

Merged

12 tasks

AilinKid closed this as completed in pingcap/tipb#283 Dec 26, 2022

ti-chi-bot pushed a commit to pingcap/tiflash that referenced this issue Feb 22, 2023

plan, executor: implement Expand operator for grouping sets (#6545)

b2a445d

close #6852, ref pingcap/tidb#16581, ref pingcap/tidb#34704

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

planner: eliminate aggregation with distinct #16581

planner: eliminate aggregation with distinct #16581

SeaRise commented Apr 19, 2020 •

edited by winoros

Loading

iontang commented Oct 3, 2020

ti-challenge-bot bot commented Oct 3, 2020

ti-challenge-bot bot commented Oct 10, 2020

iontang commented Oct 11, 2020

ti-challenge-bot bot commented Oct 11, 2020

ti-challenge-bot bot commented Oct 18, 2020

pingyu commented Nov 10, 2020

ti-challenge-bot bot commented Nov 10, 2020

ti-challenge-bot bot commented Nov 17, 2020

pingyu commented Nov 17, 2020

ti-challenge-bot bot commented Nov 17, 2020

ti-challenge-bot bot commented Nov 24, 2020

hidehalo commented Nov 29, 2020

ti-challenge-bot bot commented Nov 29, 2020

ti-challenge-bot bot commented Dec 6, 2020

Enochack commented Mar 23, 2021 •

edited

Loading

Tangruilin commented Dec 21, 2021

Tangruilin commented Dec 28, 2021

planner: eliminate aggregation with distinct #16581

planner: eliminate aggregation with distinct #16581

Comments

SeaRise commented Apr 19, 2020 • edited by winoros Loading

Description

Score

SIG slack channel（must）:

Mentor(must)

Recommended Skills：

iontang commented Oct 3, 2020

ti-challenge-bot bot commented Oct 3, 2020

ti-challenge-bot bot commented Oct 10, 2020

iontang commented Oct 11, 2020

ti-challenge-bot bot commented Oct 11, 2020

ti-challenge-bot bot commented Oct 18, 2020

pingyu commented Nov 10, 2020

ti-challenge-bot bot commented Nov 10, 2020

ti-challenge-bot bot commented Nov 17, 2020

pingyu commented Nov 17, 2020

ti-challenge-bot bot commented Nov 17, 2020

ti-challenge-bot bot commented Nov 24, 2020

hidehalo commented Nov 29, 2020

ti-challenge-bot bot commented Nov 29, 2020

ti-challenge-bot bot commented Dec 6, 2020

Enochack commented Mar 23, 2021 • edited Loading

Tangruilin commented Dec 21, 2021

Tangruilin commented Dec 28, 2021

SeaRise commented Apr 19, 2020 •

edited by winoros

Loading

Enochack commented Mar 23, 2021 •

edited

Loading