-
Notifications
You must be signed in to change notification settings - Fork 29k
[SPARK-6486] [MLlib] [Python] Add BlockMatrix to PySpark. #7761
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-6486] [MLlib] [Python] Add BlockMatrix to PySpark. #7761
Conversation
|
@dusenberrymw Did you coordinate with @MechCoder about this? I thought he already started working on it. |
|
Test build #38916 has finished for PR 7761 at commit
|
|
@mengxr We've been discussing things on PR #7554, but I wasn't sure if Manoj was working on this one. It's a pretty natural extension of 7554, so I already had most of it completed. @MechCoder Apologies if we were both working on this one. I've appreciated the thoughts on #7554, and I'd definitely love any thoughts on this one as well. |
|
@dusenberrymw Next time, you should check and ping the JIRA page first before you start coding to avoid duplicate work. |
|
@mengxr Yes, certainly. As soon as you merge #7746, I can finish #7554 and this PR. I'd also like to work on SPARK-6488. |
|
Test build #38937 has finished for PR 7761 at commit
|
|
Jenkins, retest this please. |
|
Test build #38955 has finished for PR 7761 at commit
|
|
Test build #150 has finished for PR 7761 at commit
|
|
@dusenberrymw You could have pinged me on the JIRA, since I already started work on this. (But no hard feelings) I can give a pass on this after I get back access to my machine on Saturday. |
|
@MechCoder Yes, apologies again for not realizing you were also working on this. I look forward to your thoughts! |
…logic to match that of the other distributed matrices, adding conversions, and adding documentation.
9a813d3 to
b8acc1c
Compare
|
Test build #39808 has finished for PR 7761 at commit
|
|
Test build #39834 has finished for PR 7761 at commit
|
mengxr This adds the `BlockMatrix` to PySpark. I have the conversions to `IndexedRowMatrix` and `CoordinateMatrix` ready as well, so once PR #7554 is completed (which relies on PR #7746), this PR can be finished. Author: Mike Dusenberry <mwdusenb@us.ibm.com> Closes #7761 from dusenberrymw/SPARK-6486_Add_BlockMatrix_to_PySpark and squashes the following commits: 27195c2 [Mike Dusenberry] Adding one more check to _convert_to_matrix_block_tuple, and a few minor documentation changes. ae50883 [Mike Dusenberry] Minor update: BlockMatrix should inherit from DistributedMatrix. b8acc1c [Mike Dusenberry] Moving BlockMatrix to pyspark.mllib.linalg.distributed, updating the logic to match that of the other distributed matrices, adding conversions, and adding documentation. c014002 [Mike Dusenberry] Using properties for better documentation. 3bda6ab [Mike Dusenberry] Adding documentation. 8fb3095 [Mike Dusenberry] Small cleanup. e17af2e [Mike Dusenberry] Adding BlockMatrix to PySpark. (cherry picked from commit 34dcf10) Signed-off-by: Xiangrui Meng <meng@databricks.com>
|
LGTM. Merged into master and branch-1.5. Thanks! |
|
Great, thanks @mengxr! |
@mengxr This adds the
BlockMatrixto PySpark. I have the conversions toIndexedRowMatrixandCoordinateMatrixready as well, so once PR #7554 is completed (which relies on PR #7746), this PR can be finished.