-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-8501] [SQL] Avoids reading schema from empty ORC files (backport to 1.4) #7200
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #36439 has finished for PR 7200 at commit
|
|
@liancheng Because in spark, we will not create the orc file if the record is empty. It is only happens with the ORC file created by hive, right? Forget it. Your test case answers the question. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In what situation, will the third case happen? If not exist, can we collapse the 2nd and 3rd case?
|
some minor comments. Overall, LGTM |
|
@zhzhan Thanks for the review! Updated. Actually while writing Javadoc of |
9538bff to
725e9e3
Compare
|
Test build #36455 has finished for PR 7200 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The numbers were increased to 100 to workaround SPARK-8501. Now it's fixed, so revert them back.
|
Test build #36458 has finished for PR 7200 at commit
|
|
Merging to branch-1.4. |
…rt to 1.4) This PR backports #7199 to branch-1.4 Author: Cheng Lian <lian@databricks.com> Closes #7200 from liancheng/spark-8501-for-1.4 and squashes the following commits: 725e9e3 [Cheng Lian] Addresses comments 0fa25af [Cheng Lian] Avoids reading schema from empty ORC files
|
This PR has already merged right? |
|
@sarutak Thanks for reminding, closing it :) |
This PR backports #7199 to branch-1.4