-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-16402] [SQL] JDBC Source: Implement save API of DataFrameWriter #14077
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Hi, @gatorsmile . Sorry for bothering you. :) |
|
@dongjoon-hyun I assume what you are saying is about the |
|
Test build #61867 has finished for PR 14077 at commit
|
|
I guess this is a duplicate of #12601. Maybe we should fix the title and add |
|
@HyukjinKwon uh... I did not realize there exists such a PR. I think the implementation of this PR is much simple. We can close #12601 after this is merged. Thanks! |
|
This may seem simpler, but that's because it seems to be taking some shortcuts to avoid having to refactor. This currently creates a cycle along the lines of |
|
@JustinPihony Thanks for your review! Let me try to answer your concerns.
|
|
@gatorsmile If |
|
@JustinPihony Thank you for confirming that it is a bug in another PR. Regarding the solution of this PR, it is not a true circular reference. The solution in this PR is to minimize the duplicate codes. I also think it make senses to move the common code logics from If you wants to do it in your PR, I am also fine. Please minimize the code changes and add the test cases introduced in this PR. Thanks! |
|
(Personally, I hope this does not get delayed because this usage was shown in Spark Summit PPT and I guess users would try to use this API.) |
|
Then the best course of action would be to use my current impl as it works no matter the position of copy. I can add the additional tests if that would make it more amenable? Otherwise I'll push a reduced code set in the morning, but it would rely on the copy location move PR
|
|
@JustinPihony You know, I do not care which PR is merged eventually. You can try to clean your PR at your best. I will review your PR when it is ready. Thanks for your work! Please continue to submit more PRs for improving Spark. To reduce the code changes in your PR, I think we should not extend |
|
Thanks. I will have to wait until SPARK-16401 is resolved or else the code will not pass tests, though. I also pinged Reynold in JIRA since he had suggested to implement the |
|
@JustinPihony How about you first moving the |
|
@gatorsmile As I said above, I actually think it might be better to keep the work that was already done and am waiting for Reynold's feedback. |
|
Test build #64805 has finished for PR 14077 at commit
|
|
Test build #64806 has finished for PR 14077 at commit
|
|
@gatorsmile Just a reminder that we might be able to close this. |
|
Sure, let me close it now |
What changes were proposed in this pull request?
Before this PR, we are unable to call the
saveAPI ofDataFrameWriterwhen the source is JDBC. For example,The error message users will get is like
At the same time, users can do it for the other data sources, like
parquetThis PR is to implement
createRelationofCreatableRelationProvider. After the changes, we can usesaveAPI ofDataFrameWriter.Closes #12601
How was this patch tested?
Added test cases