-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-1822] SchemaRDD.count() should use query optimizer #841
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Can one of the admins verify this patch? |
|
@marmbrus I tried to implement the formula you gave on the mailing list. Not sure if I missed anything. Pls take a look. Note I changed Count() to return Long to match RDD.count(). On the python side, the original rdd.count() returns Int. |
|
Thanks for the contribution! Could use it in my own workflows. Python ints are signed 32 bit numbers right? Should make that a long
|
|
@ash211 In Python 2.X, it does promote an Int to Long when overflowing (only in doc tests, where you have to be explicit about the expected result is 3 or 3L). |
|
He's on vacation this week so it might take a while for him to get back :) |
|
@rxin thanks for the heads up. I appreciate help from anyone to help burn down my open PRs, the oldest being over a month old. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do you mind adding javadoc for this? Just explain different from RDD count's, SchemaRDD count actually invokes the optimizer.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sure, will do.
|
Jenkins, add to whitelist. |
|
Merged build triggered. |
|
Merged build started. |
|
Merged build finished. All automated tests passed. |
|
All automated tests passed. |
|
Thanks. I've merged this into master & branch-1.0. |
Author: Kan Zhang <kzhang@apache.org> Closes #841 from kanzhang/SPARK-1822 and squashes the following commits: 2f8072a [Kan Zhang] [SPARK-1822] Minor style update cf4baa4 [Kan Zhang] [SPARK-1822] Adding Scaladoc e67c910 [Kan Zhang] [SPARK-1822] SchemaRDD.count() should use optimizer (cherry picked from commit 6052db9) Signed-off-by: Reynold Xin <rxin@apache.org>
Minor cleanup following #841. Author: Reynold Xin <rxin@apache.org> Closes #868 from rxin/schema-count and squashes the following commits: 5442651 [Reynold Xin] SPARK-1822: Some minor cleanup work on SchemaRDD.count()
Minor cleanup following #841. Author: Reynold Xin <rxin@apache.org> Closes #868 from rxin/schema-count and squashes the following commits: 5442651 [Reynold Xin] SPARK-1822: Some minor cleanup work on SchemaRDD.count() (cherry picked from commit d66642e) Signed-off-by: Reynold Xin <rxin@apache.org>
|
@rxin thanks for the cleanup! |
Author: Kan Zhang <kzhang@apache.org> Closes apache#841 from kanzhang/SPARK-1822 and squashes the following commits: 2f8072a [Kan Zhang] [SPARK-1822] Minor style update cf4baa4 [Kan Zhang] [SPARK-1822] Adding Scaladoc e67c910 [Kan Zhang] [SPARK-1822] SchemaRDD.count() should use optimizer
Minor cleanup following apache#841. Author: Reynold Xin <rxin@apache.org> Closes apache#868 from rxin/schema-count and squashes the following commits: 5442651 [Reynold Xin] SPARK-1822: Some minor cleanup work on SchemaRDD.count()
Minor cleanup following apache#841. Author: Reynold Xin <rxin@apache.org> Closes apache#868 from rxin/schema-count and squashes the following commits: 5442651 [Reynold Xin] SPARK-1822: Some minor cleanup work on SchemaRDD.count()
… fail (apache#841) * MapR [SPARK-903] spark.loadFromMapRDB(tableName, schema) using v2 api fail Co-authored-by: Egor Krivokon <>
… fail (apache#841) * MapR [SPARK-903] spark.loadFromMapRDB(tableName, schema) using v2 api fail Co-authored-by: Egor Krivokon <>
… fail (apache#841) * MapR [SPARK-903] spark.loadFromMapRDB(tableName, schema) using v2 api fail Co-authored-by: Egor Krivokon <>
…6.1.1.4.0 (apache#791)" (apache#841) May be the RCA of https://jirap.corp.ebay.com/browse/HADP-59331
No description provided.