Skip to content
/ prizma Public
forked from gitter-badger/prizma

An experiment environment for Text Classification

License

Notifications You must be signed in to change notification settings

knowlp/prizma

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

prizma

A Feature Extraction and Selection Tool for Categorizing Text Documents

  • Read directory structured and csv formatted datasets

  • Directory to CSV dataset conversion

  • Support for subcategories

  • Feature Extraction including n-grams terms

  • Best Terms selection based on TF-IDF, Mutual Information, Information Gain, and other metrics

  • Extracted features can be saved in WEKA ARFF format.

  • A more detailed documentation is comming soon...

About

An experiment environment for Text Classification

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Java 100.0%