10-605

Machine Learning with Large Datasets

Distributed SGD for Matrix Factorization on Spark

The main program is dsgd_mf.py To run the main program, use the following command:

spark-submit dsgd_mf.py (num_factors) (num_workers) (num_iterations) (beta_value) (lambda_value) (inputV_filepath) (outputW_filepath) (outputH_filepath)

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
README.md		README.md
dsgd_mf.py		dsgd_mf.py
homework7.pdf		homework7.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

10-605

About

Releases

Packages

Languages

shinysingh/10-605

Folders and files

Latest commit

History

Repository files navigation

10-605

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages