GitHub - BMPMS/Udacity-Enron-Data-Machine-Learning: Machine Learning

The aim of this project is to use SKLearn tools to predict whether an Enron employee is likely to have committed fraud or not.

The code for this project is split into several python files as follows:

poi_id.py - the main python file which iterates through the various project tasks
exploredata.py - various functions relating to null value and outlier analysis as updates and the creation of new features.
feature_selection.py - initial KBest feature selection code
algorithms.py - code for testing the five different chosen algorithms
tester.py - Udacity file used to test the algorithm results. Also includes Cross-Validation.

my_classifier, my_dataset and my_feature_list are files generated by the Udacity tester.py file for project assessment.

NB: I'm aware that the minmaxscaler has no effect on my final results because the chosen algorithm is a Decision Tree.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
README.md		README.md
algorithms.py		algorithms.py
explore_data.py		explore_data.py
feature_selection.py		feature_selection.py
my_classifier.pkl		my_classifier.pkl
my_dataset.pkl		my_dataset.pkl
my_feature_list.pkl		my_feature_list.pkl
poi_id.py		poi_id.py

Provide feedback