Use spark's built-in SVD #8

d-v-b · 2018-02-17T22:22:29Z

Recently Spark added a built-in implementation of the SVD as a method on the RowMatrix class. I'm interested in running comparing the performance of this algorithm against the SVD implemented in thunder-factorization. If the Spark version is favorable, we should consider adding support for it in the factorization algorithms that use the SVD

The text was updated successfully, but these errors were encountered:

d-v-b · 2018-02-19T00:22:52Z

ping @freeman-lab, @jwittenbach
My initial tests indicate that the pyspark SVD is much, much faster than the implementation in SVD.py. What would I need to show / do to get a PR reviewed that leverages the spark SVD for thunder-factorization?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use spark's built-in SVD #8

Use spark's built-in SVD #8

d-v-b commented Feb 17, 2018 •

edited

Loading

d-v-b commented Feb 19, 2018 •

edited

Loading

Use spark's built-in SVD #8

Use spark's built-in SVD #8

Comments

d-v-b commented Feb 17, 2018 • edited Loading

d-v-b commented Feb 19, 2018 • edited Loading

d-v-b commented Feb 17, 2018 •

edited

Loading

d-v-b commented Feb 19, 2018 •

edited

Loading