Skip to content

Conversation

@Lewuathe
Copy link
Contributor

@Lewuathe Lewuathe commented Apr 4, 2015

Add below methods in pyspark for MultivariateStatisticalSummary

  • normL1
  • normL2

@Lewuathe Lewuathe changed the title Implement missing methods for MultivariateStatisticalSummary [SPARK-6262][MLLIB]Implement missing methods for MultivariateStatisticalSummary Apr 4, 2015
@AmplabJenkins
Copy link

Test PASSed.
Refer to this link for build results (access rights to CI server needed):
https://amplab.cs.berkeley.edu/jenkins//job/SparkPullRequestBuilder/29708/
Test PASSed.

@asfgit asfgit closed this in acffc43 Apr 5, 2015
@mengxr
Copy link
Contributor

mengxr commented Apr 5, 2015

LGTM. Merged into master. @Lewuathe There is a minor issue with your test. It doesn't actually test normL1 and normL2 but just the size of the output. We can create a RDD like

rdd = sc.parallelize(xrange(10)).map(lambda x: Vectors.dense(x))
...
assert normL1 and normL2

Could you submit another PR and update the test? Thanks!

asfgit pushed a commit that referenced this pull request Apr 7, 2015
…for normL1...

... and normL2.
Add test cases to insufficient unit test for `normL1` and `normL2`.

Ref: #5359

Author: lewuathe <lewuathe@me.com>

Closes #5374 from Lewuathe/SPARK-6720 and squashes the following commits:

5541b24 [lewuathe] More accurate tests
dc5718c [lewuathe] [SPARK-6720] PySpark MultivariateStatisticalSummary unit test for normL1 and normL2
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants