-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-22348][SQL] The table cache providing ColumnarBatch should also do partition batch pruning #19569
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Good catch, thank you. Would it be possible to add a test case for pruning with table cache? |
|
@kiszk Thanks. I've roughly checked existing tests. Seems that there are no related ones for pruning the table cache. Let me see if I can add one. |
| // Using these variables here to avoid serialization of entire objects (if referenced directly) | ||
| // within the map Partitions closure. | ||
| val relOutput: AttributeSeq = relation.output | ||
| val buffers = filteredCachedBatches() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this seems to be a leftover of copy-paste
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Removed. Thanks.
|
also cc @cloud-fan for review. |
|
Test build #83027 has finished for PR 19569 at commit
|
|
good catch! merging to master, thanks! |
|
Test build #83031 has finished for PR 19569 at commit
|
|
Test build #83032 has finished for PR 19569 at commit
|
What changes were proposed in this pull request?
We enable table cache
InMemoryTableScanExecto provideColumnarBatchnow. But the cached batches are retrieved without pruning. In this case, we still need to do partition batch pruning.How was this patch tested?
Existing tests.