Skip to content

Conversation

@mpjlu
Copy link

@mpjlu mpjlu commented Dec 29, 2016

What changes were proposed in this pull request?

Add FDR test case in ml/feature/ChiSqSelectorSuite.
Improve some comments in the code.
This is a follow-up pr for #15212.

How was this patch tested?

ut

@SparkQA
Copy link

SparkQA commented Dec 29, 2016

Test build #70721 has finished for PR 16434 at commit 17a7288.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

Copy link
Member

@jkbradley jkbradley left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the PR! Responding to your comment in the other PR, I do think it's worth creating a new test set (or replacing the current one?) which lets us test FDR.

Chi-Squared feature selection, which selects categorical features to use for predicting a
categorical label.
Creates a ChiSquared feature selector.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This line is now duplicated info, so I'd remove it.

@jkbradley
Copy link
Member

Also, could you please change the PR description to be self-contained (rather than just referencing another PR)? The description becomes the commit message.

@SparkQA
Copy link

SparkQA commented Dec 30, 2016

Test build #70744 has finished for PR 16434 at commit 2d87dd3.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@mpjlu
Copy link
Author

mpjlu commented Jan 5, 2017

Hi @jkbradley , I have updated this PR per your comments. Thanks.

chisq.test(x2,y)
chisq.test(x3,y)
/*
* Contingency tables
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's kind of nice seeing these tables, but for validation, it's much easier for users to check using R. Could you please include a section for verifying the results with R, like there was before?

@jkbradley
Copy link
Member

Thanks @mpjlu ! The changes look good, except that I'd like to have a code snippet for verifying with R.

@SparkQA
Copy link

SparkQA commented Jan 6, 2017

Test build #70957 has finished for PR 16434 at commit 840ddfc.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@mpjlu
Copy link
Author

mpjlu commented Jan 6, 2017

Thanks @jkbradley @srowen , I have added a code snippet for verifying with R.
The statics and p-value data in the comment is generated by R.

@srowen
Copy link
Member

srowen commented Jan 10, 2017

Merged to master

@asfgit asfgit closed this in 32286ba Jan 10, 2017
uzadude pushed a commit to uzadude/spark that referenced this pull request Jan 27, 2017
## What changes were proposed in this pull request?
Add FDR test case in ml/feature/ChiSqSelectorSuite.
Improve some comments in the code.
This is a follow-up pr for apache#15212.

## How was this patch tested?
ut

Author: Peng, Meng <peng.meng@intel.com>

Closes apache#16434 from mpjlu/fdr_fwe_update.
cmonkey pushed a commit to cmonkey/spark that referenced this pull request Feb 15, 2017
## What changes were proposed in this pull request?
Add FDR test case in ml/feature/ChiSqSelectorSuite.
Improve some comments in the code.
This is a follow-up pr for apache#15212.

## How was this patch tested?
ut

Author: Peng, Meng <peng.meng@intel.com>

Closes apache#16434 from mpjlu/fdr_fwe_update.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants