-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-6743][SQL] Fix empty projections of cached data #6165
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #32759 has finished for PR 6165 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If attributes.isEmpty, we'll always return Row.empty, so we don't need to create ColumnAccessors, right? The code looks weird to me, why we handle attributes.isEmpty this way?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We need at least one column so that we know how many rows to produce, but we should not include it in the output otherwise we will not match the schema we are claiming to have. It would be better to just remember the number of rows in a partition and use that instead. This is just a quick fix so we don't return the wrong answer.
Conflicts: sql/core/src/main/scala/org/apache/spark/sql/DataFrame.scala
|
test this please |
|
Test build #33272 has finished for PR 6165 at commit
|
|
test this please |
|
Test build #33290 has finished for PR 6165 at commit
|
Author: Michael Armbrust <michael@databricks.com> Closes #6165 from marmbrus/wrongColumn and squashes the following commits: 4fad158 [Michael Armbrust] Merge remote-tracking branch 'origin/master' into wrongColumn aad7eab [Michael Armbrust] rxins comments f1e8df1 [Michael Armbrust] [SPARK-6743][SQL] Fix empty projections of cached data (cherry picked from commit 3b68cb0) Signed-off-by: Michael Armbrust <michael@databricks.com>
Author: Michael Armbrust <michael@databricks.com> Closes apache#6165 from marmbrus/wrongColumn and squashes the following commits: 4fad158 [Michael Armbrust] Merge remote-tracking branch 'origin/master' into wrongColumn aad7eab [Michael Armbrust] rxins comments f1e8df1 [Michael Armbrust] [SPARK-6743][SQL] Fix empty projections of cached data
Author: Michael Armbrust <michael@databricks.com> Closes apache#6165 from marmbrus/wrongColumn and squashes the following commits: 4fad158 [Michael Armbrust] Merge remote-tracking branch 'origin/master' into wrongColumn aad7eab [Michael Armbrust] rxins comments f1e8df1 [Michael Armbrust] [SPARK-6743][SQL] Fix empty projections of cached data
Author: Michael Armbrust <michael@databricks.com> Closes apache#6165 from marmbrus/wrongColumn and squashes the following commits: 4fad158 [Michael Armbrust] Merge remote-tracking branch 'origin/master' into wrongColumn aad7eab [Michael Armbrust] rxins comments f1e8df1 [Michael Armbrust] [SPARK-6743][SQL] Fix empty projections of cached data
No description provided.