-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Investigate performance drop for DISTINCT queries #5313
Comments
So here we have the answer - it's very inefficient because of inefficiently tracking memory (which was added some versions ago). FYI @alamb |
Nice sluthing -- thank you @comphead and @Dandandan - - I can file a ticket for the regression related to tracking memory if that would help |
so yeah, @Dandandan @alamb I'll file a ticket to optimize this part |
Yes, some quadratic complexity because of the growing state. |
(you could identify it from your flamegraph, it's just the longest bar(s) at the top consuming all the time) |
Right, the function was evident to identify, but it was introduced recently, took some time to figure out its purpose. |
Filed #5325 @Dandandan @alamb should we close this ticket? |
I agree the analysis is done -- thank you @comphead |
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
ClickBench reported a performance drop for
COUNT(DISTINCT)
computation #5276 (comment)Describe the solution you'd like
Its needed to investigate a performance drop root cause in DISTINCT queries and find out how is it possible to increase the speed.
Describe alternatives you've considered
Not doing this
Additional context
Metrics can be found #5276 (comment)
Reproduce the case #5276 (comment)
The text was updated successfully, but these errors were encountered: