-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-6800][SQL] Update doc for JDBCRelation's columnPartition #5488
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Test build #30157 has finished for PR 5488 at commit
|
|
This is not a bug, this is the intended behavior. I think the right fix here is to update the programming guide and other documentation to clearly state this. If users want to limit the scope of the data, they should add predicates which will be pushed down to the database. |
|
ok. If I understand it correctly, |
|
That is correct. |
|
Test build #30296 has finished for PR 5488 at commit
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The parameters minValue and maxValue are advisory in that incorrect values may cause the partitioning to be poor, but no data will fail to be represented.
The sentence above already explains that the filters are only used for partitioning and that all data will always be returned. I think the best place to update would be in the SQL programming guide, in the table under the section "JDBC To Other Databases".
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Updated.
|
Test build #30307 timed out for PR 5488 at commit |
|
Hi.
This doc should be updated too. How can one add predicates to limit the scope of data being pushed from the DB using the SQLContext API? Will a select limiting that scope make the table not to be pushed entirely? |
|
@micaelcapitao Thanks. I updated the doc too. I think you can use jdbc data source API to create temporary table and then use WHERE clause to add predicates. |
|
That's correct. WHERE clause predicates and data frame filter operations
|
|
Test build #30354 has finished for PR 5488 at commit
|
JIRA https://issues.apache.org/jira/browse/SPARK-6800 Author: Liang-Chi Hsieh <viirya@gmail.com> Closes #5488 from viirya/fix_jdbc_where and squashes the following commits: 51386c8 [Liang-Chi Hsieh] Update code comment. 1dcc929 [Liang-Chi Hsieh] Update document. 3eb74d6 [Liang-Chi Hsieh] Revert and modify doc. df11783 [Liang-Chi Hsieh] Merge remote-tracking branch 'upstream/master' into fix_jdbc_where 3e7db15 [Liang-Chi Hsieh] Fix wrong logic to generate WHERE clause for JDBC. (cherry picked from commit e3e4e9a) Signed-off-by: Michael Armbrust <michael@databricks.com>
|
Thanks! Merged to master and branch-1.3 |
JIRA https://issues.apache.org/jira/browse/SPARK-6800