Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Do not multiply AggregateRandom in JOINs #11633

Merged
merged 3 commits into from
Nov 8, 2022
Merged

Conversation

frouioui
Copy link
Member

@frouioui frouioui commented Nov 3, 2022

Description

This Pull Request fixes the issue described in #11625. When pushing aggregation on a join, we were skipping the multiplication of results for min and max aggregations, however, we also need to skip the multiplication for random aggregations. Random aggregations simply need to get the column form one of the join's sides without multiplying it to the other side of the join.

Related Issue(s)

Checklist

  • "Backport me!" label has been added if this change should be backported
  • Tests were added or are not required
  • Documentation was added or is not required

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>
@vitess-bot
Copy link
Contributor

vitess-bot bot commented Nov 3, 2022

Review Checklist

Hello reviewers! 👋 Please follow this checklist when reviewing this Pull Request.

General

  • Ensure that the Pull Request has a descriptive title.
  • If this is a change that users need to know about, please apply the release notes (needs details) label so that merging is blocked unless the summary release notes document is included.

If a new flag is being introduced:

  • Is it really necessary to add this flag?
  • Flag names should be clear and intuitive (as far as possible)
  • Help text should be descriptive.
  • Flag names should use dashes (-) as word separators rather than underscores (_).

If a workflow is added or modified:

  • Each item in Jobs should be named in order to mark it as required.
  • If the workflow should be required, the maintainer team should be notified.

Bug fixes

  • There should be at least one unit or end-to-end test.
  • The Pull Request description should include a link to an issue that describes the bug.

Non-trivial changes

  • There should be some code comments as to why things are implemented the way they are.

New/Existing features

  • Should be documented, either by modifying the existing documentation or creating new documentation.
  • New features should have a link to a feature request issue or an RFC that documents the use cases, corner cases and test cases.

Backward compatibility

  • Protobuf changes should be wire-compatible.
  • Changes to _vt tables and RPCs need to be backward compatible.
  • vtctl command output order should be stable and awk-able.
  • RPC changes should be compatible with vitess-operator
  • If a flag is removed, then it should also be removed from VTop, if used there.

Copy link
Contributor

@arthurschreiber arthurschreiber left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

❤️

@tokikanno
Copy link
Contributor

@frouioui : A quick question about AggregateRandom, does that mean we'll get random rows from non full group by queries?

BTW: 你會說中文? :-P

@frouioui
Copy link
Member Author

frouioui commented Nov 4, 2022

A quick question about AggregateRandom, does that mean we'll get random rows from non full group by queries?

@tokikanno, the AggregateRandom does not apply a random function to the result, it simply takes the first row that MySQL returns. The results of the AggregateRandom will not be "flaky". You can see in the following code snippet how the engine primitive AggregateRandom is used:

https://github.com/planetscale/vitess/blob/adeccd2ae1e5b34da0da2fbf89debdc5ccefde2c/go/vt/vtgate/engine/ordered_aggregate.go#L531-L532

BTW: 你會說中文? :-P

我会一点点 不过我的中文不好 我的大学在北京

@frouioui
Copy link
Member Author

frouioui commented Nov 4, 2022

@tokikanno, regarding doing a patch release: we will wait until at least the 3rd week of November to do a patch release. Since we recently released v15.0.0 we want to wait until we get more bug fixes.

@systay systay added the release notes (needs details) This PR needs to be listed in the release notes in a dedicated section (deprecation notice, etc...) label Nov 4, 2022
Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>
@frouioui
Copy link
Member Author

frouioui commented Nov 7, 2022

@systay, you added the release notes (needs details) label. I have modified the release notes for all current versions of v14 and v15 to include the known issue, and modified v16's release summary to include a note concerning this fix. I'll let you re-review and unlabel the PR if everything looks good.

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>
@systay systay removed the release notes (needs details) This PR needs to be listed in the release notes in a dedicated section (deprecation notice, etc...) label Nov 8, 2022
Copy link
Member

@deepthi deepthi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍 on the release notes changes.

@frouioui frouioui merged commit d8268bf into vitessio:main Nov 8, 2022
@frouioui frouioui deleted the fix-11625 branch November 8, 2022 19:58
@vitess-bot
Copy link
Contributor

vitess-bot bot commented Nov 8, 2022

I was unable to backport this Pull Request to the following branches: release-14.0, release-15.0.

frouioui added a commit to planetscale/vitess that referenced this pull request Nov 8, 2022
* Do not multiply AggregateRandom in JOIN

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

* Update the releases notes with new known issue

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>
frouioui added a commit to planetscale/vitess that referenced this pull request Nov 8, 2022
* Do not multiply AggregateRandom in JOIN

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

* Update the releases notes with new known issue

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>
frouioui added a commit that referenced this pull request Nov 9, 2022
* Do not multiply `AggregateRandom` in `JOIN`s (#11633)

* Do not multiply AggregateRandom in JOIN

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

* Update the releases notes with new known issue

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

* Addition of the release notes for v14.0.4

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

* set sql mode in TestAggregateRandom

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

* update plan test to remove table used json object

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>
frouioui added a commit that referenced this pull request Nov 9, 2022
* Do not multiply `AggregateRandom` in `JOIN`s (#11633)

* Do not multiply AggregateRandom in JOIN

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

* Update the releases notes with new known issue

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

* Addition of the release notes for v15.0.1

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

* Fix format issue in the release notes summary

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

* set sql mode in TestAggregateRandom

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>
notfelineit pushed a commit to planetscale/vitess that referenced this pull request Nov 16, 2022
…io#1308)

* Do not multiply `AggregateRandom` in `JOIN`s (vitessio#11633)

* Do not multiply AggregateRandom in JOIN

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

* Update the releases notes with new known issue

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>
Signed-off-by: Andres Taylor <andres@planetscale.com>

* clean up all tables after running e2e tests

Signed-off-by: Andres Taylor <andres@planetscale.com>

* Remove TestOrderByCount test

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>

Signed-off-by: Florent Poinsard <florent.poinsard@outlook.fr>
Signed-off-by: Andres Taylor <andres@planetscale.com>
Co-authored-by: FlorentP <35779988+frouioui@users.noreply.github.com>
Co-authored-by: Florent Poinsard <florent.poinsard@outlook.fr>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Bug Report: vitess generates corrupted results on cross keyspace non full group by queries
5 participants