Starting point code for MDR implementation #1

TuanNguyen27 · 2016-06-09T14:13:23Z

Assumption(s): Labels are only binary

Implemented:

fit(self, features, classes): simply build a dictionary that maps each
instance of the feature vector to a tuple. The tuple keeps count of how
many times a particular label value appears with that instance of
feature vector. Key: tuple of feature values - Value: tuple of label
frequency/label counts

transform(self, features): After the dictionary is completed, combine
each instance of feature vector above into one corresponding label that
has the frequency ratio greater than its standard default ratio.

score(self, features, classes): Compare the new combined feature vector
with its corresponding class labels, and count the times the two match.
Output the average accuracy by averaging the match count over the
length of the new feature vector / classes vector.

Implementation is tested in main() by training MDR on the training set
and getting accuracy_score on the test set.

Assumption(s): Labels are only binary Implemented: fit(self, features, classes): simply build a dictionary that maps each instance of the feature vector to a tuple. The tuple keeps count of how many times a particular label value appears with that instance of feature vector. Key: tuple of feature values - Value: tuple of label frequency/label counts transform(self, features): After the dictionary is completed, combine each instance of feature vector above into one corresponding label that has the frequency ratio greater than its standard default ratio. score(self, features, classes): Compare the new combined feature vector with its corresponding class labels, and count the times the two match. Output the average accuracy by averaging the match count over the length of the new feature vector / classes vector. Implementation is tested in main() by training MDR on the training set and getting accuracy_score on the test set.

rhiever · 2016-06-09T14:43:40Z

mdr/mdr.py

-            description
+        tie_break: type int (default: 0)
+            description: specify the default label in case there's a tie in a given set of feature values 
+        default_label: type int (default: 0)


Remove the words "type"

oops got cha!

Changed fdict to feature_map Removed ‘type’ in line 34 & 36

Fixed all bugs according to Randal Olson’s comments.

rhiever · 2016-06-10T17:19:31Z

mdr/mdr.py

@@ -18,29 +18,33 @@
 """

 import pandas as pd
-
+import numpy as np 
+from collections import defaultdict
 from __future__ import print_function


from __future__ import print_function must be the first import in the file.

second fix according to comments

rhiever reviewed Jun 9, 2016
View reviewed changes

TuanNguyen27 added 2 commits June 9, 2016 11:04

Fixing some syntactic errors

93ef3fc

Changed fdict to feature_map Removed ‘type’ in line 34 & 36

Fixed EpistasisLab#1

13c2019

Fixed all bugs according to Randal Olson’s comments.

rhiever reviewed Jun 10, 2016
View reviewed changes

fixed

7cea8b8

second fix according to comments

rhiever merged commit 094efd3 into EpistasisLab:master Jun 15, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Starting point code for MDR implementation #1

Starting point code for MDR implementation #1

TuanNguyen27 commented Jun 9, 2016

rhiever Jun 9, 2016

TuanNguyen27 Jun 9, 2016

rhiever Jun 10, 2016

Starting point code for MDR implementation #1

Starting point code for MDR implementation #1

Conversation

TuanNguyen27 commented Jun 9, 2016

rhiever Jun 9, 2016

Choose a reason for hiding this comment

TuanNguyen27 Jun 9, 2016

Choose a reason for hiding this comment

rhiever Jun 10, 2016

Choose a reason for hiding this comment