SpamFilters

#Naive-Bayes ##Version 3.0

(C) Brendan J. Herger Analytics Master's Candidate at University of San Francisco 13herger@gmail.com

Available under MIT License http://opensource.org/licenses/MIT

##Intro

Basically, when fed a list of documents and the categories these files belong to, this program will use a Naive-Bayes word frequency method to model the categories. This model can then be applied to identify documents of unknown categories.

This is a simple package for document classification using a Naive-Bayes scheme (fairly well documented, so I won't describe it here). The main package is fairly independent of I/O, but does require the training data, training labels, and test data to be formatted as described in the NaiveBayes documentation.

##Usage

An example script is provided in ExampleData.py, but the crux of this module are the following three lines:

# create model
nb = NaiveBayes.NaiveBayes()

# train
nb.fit(train_df, train_labels)

# predict
output = nb.predict(test_df)

##Thank you notes

This base project was created as a project submission for the Summer 2014 section of MSAN 593, taught at the University of San Francisco by Yannet Interian. I would like to thank her and the faculty for their support and guidance.

It has been updated as an assigment for the Module 2 2014 section of MSAN 621 Machine Learning, taught at the University of San Francisco by Cynthia 'Cindi' Thompson. I would like to thank her and the faculty for their support and guidance.

As a side note, I've written all code in this package from scratch (excluding imports, obviously). As such, I am able to make it available under the MIT License.# SpamFilters

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
__pycache__		__pycache__
data/review_polarity/txt_sentoken		data/review_polarity/txt_sentoken
naive-bayes-classifier		naive-bayes-classifier
ExampleData.py		ExampleData.py
NaiveBayes.pyc		NaiveBayes.pyc
README.md		README.md
__init__.py		__init__.py
__init__.pyc		__init__.pyc
bhUtilties.py		bhUtilties.py
bhUtilties.pyc		bhUtilties.pyc
naivebayes.py		naivebayes.py
practice.py		practice.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SpamFilters

About

Releases

Packages

Languages

jpetersen2/SpamFilters

Folders and files

Latest commit

History

Repository files navigation

SpamFilters

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages