-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-16240][ML] ML persistence backward compatibility for LDA #15034
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
To reviewers: This code was taken and modified from [https://github.com//pull/14112]. @GayathriMurali should be the primary author when we merge this into master and branch-2.0 |
|
CC @hhbyyh Would you mind taking a look? thanks! |
|
Test build #65168 has finished for PR 15034 at commit
|
| */ | ||
| def getAndSetParams(model: LDAParams, metadata: Metadata): Unit = { | ||
| VersionUtils.majorMinorVersion(metadata.sparkVersion) match { | ||
| case (1, 6) => |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
we didn't support LDA serialization in 1.5, right?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nope, it was 1.6
|
Thanks @jkbradley. This matches what I had in mind. LGTM. |
|
Thanks @hhbyyh ! |
|
Test build #3279 has finished for PR 15034 at commit
|
|
Jenkins, retest this please. |
|
Test build #65621 has finished for PR 15034 at commit
|
|
@jkbradley, it looks like this is legitimately failing MiMa (not sure why it passed on the first run...): |
|
@JoshRosen That is weird; MiMa passes for me locally, but I see that it shouldn't. I added a MiMaException; this should not be a problem for users. |
|
Test build #65789 has finished for PR 15034 at commit
|
|
I'll go ahead and merge this. Thanks @hhbyyh for reviewing it! |
Allow Spark 2.x to load instances of LDA, LocalLDAModel, and DistributedLDAModel saved from Spark 1.6. I tested this manually, saving the 3 types from 1.6 and loading them into master (2.x). In the future, we can add generic tests for testing backwards compatibility across all ML models in SPARK-15573. Author: Joseph K. Bradley <joseph@databricks.com> Closes apache#15034 from jkbradley/lda-backwards.
… backport ## What changes were proposed in this pull request? Allow Spark 2.x to load instances of LDA, LocalLDAModel, and DistributedLDAModel saved from Spark 1.6. Backport of #15034 for branch-2.0 ## How was this patch tested? I tested this manually, saving the 3 types from 1.6 and loading them into master (2.x). In the future, we can add generic tests for testing backwards compatibility across all ML models in SPARK-15573. Author: Gayathri Murali <gayathri.m.softie@gmail.com> Author: Joseph K. Bradley <joseph@databricks.com> Closes #15205 from jkbradley/lda-backward-2.0.
What changes were proposed in this pull request?
Allow Spark 2.x to load instances of LDA, LocalLDAModel, and DistributedLDAModel saved from Spark 1.6.
How was this patch tested?
I tested this manually, saving the 3 types from 1.6 and loading them into master (2.x). In the future, we can add generic tests for testing backwards compatibility across all ML models in SPARK-15573.