Skip to content

Conversation

@jkbradley
Copy link
Member

What changes were proposed in this pull request?

Currently, MLlib's StandardScaler scales columns using the corrected standard deviation (sqrt of unbiased variance). This matches what R's scale package does.

This PR documents this fact.

How was this patch tested?

doc only

@jkbradley jkbradley changed the title [SPARK-14478][ML][MLLIB] Doc that StandardScaler uses the corrected sample std [SPARK-14478][ML][MLLIB][DOC] Doc that StandardScaler uses the corrected sample std Apr 20, 2016
@jkbradley
Copy link
Member Author

CC: @yanboliang What do you think?

@SparkQA
Copy link

SparkQA commented Apr 20, 2016

Test build #56326 has finished for PR 12519 at commit 9cb0457.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@yanboliang
Copy link
Contributor

@jkbradley An appropriate document is enough for now, this PR looks good to me. Thanks!

@asfgit asfgit closed this in acc7e59 Apr 20, 2016
@jkbradley jkbradley deleted the scaler-variance-doc branch April 20, 2016 19:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants