Skip to content

Conversation

@JoshRosen
Copy link
Contributor

What changes were proposed in this pull request?

CollectLimit.execute() incorrectly omits per-partition limits, leading to performance regressions in case this case is hit (which should not happen in normal operation, but can occur in some cases (see #15068 for one example).

How was this patch tested?

Regression test in SQLQuerySuite that asserts the number of records scanned from the input RDD.

@SparkQA
Copy link

SparkQA commented Sep 13, 2016

Test build #65296 has finished for PR 15070 at commit ac01f87.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@hvanhovell
Copy link
Contributor

LGTM - merging to master. Thanks!

@asfgit asfgit closed this in 3f6a2bb Sep 13, 2016
asfgit pushed a commit that referenced this pull request Sep 13, 2016
CollectLimit.execute() incorrectly omits per-partition limits, leading to performance regressions in case this case is hit (which should not happen in normal operation, but can occur in some cases (see #15068 for one example).

Regression test in SQLQuerySuite that asserts the number of records scanned from the input RDD.

Author: Josh Rosen <joshrosen@databricks.com>

Closes #15070 from JoshRosen/SPARK-17515.

(cherry picked from commit 3f6a2bb)
Signed-off-by: Herman van Hovell <hvanhovell@databricks.com>
@JoshRosen JoshRosen deleted the SPARK-17515 branch September 13, 2016 15:21
wgtmac pushed a commit to wgtmac/spark that referenced this pull request Sep 19, 2016
## What changes were proposed in this pull request?

CollectLimit.execute() incorrectly omits per-partition limits, leading to performance regressions in case this case is hit (which should not happen in normal operation, but can occur in some cases (see apache#15068 for one example).

## How was this patch tested?

Regression test in SQLQuerySuite that asserts the number of records scanned from the input RDD.

Author: Josh Rosen <joshrosen@databricks.com>

Closes apache#15070 from JoshRosen/SPARK-17515.
@yhuai
Copy link
Contributor

yhuai commented Sep 21, 2016

Just a note. It is also in branch 2.0.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants