- merge pull request from @rishabh-tripathi allowing text corpus objects to also be built with an array of strings
- couple minor code refinements
- tokenization changes to support German (courtesy of @LeFnord)
- user defined stop word list (also via @LeFnord)
- change stop word removal back (optimization)
- added stopwords list and included downcasing to improve performance
- Bug fix for text documents by Rio Akasaka
- Bug fix by Rio Akasaka, fixes issues with segfaults under Ruby 1.9.2
- top_words method now returns actual words if they exist in the vocabulary
- Completely broke backwards compatibility
- Reworked many classes to make functionality more reasonable
- Added ability to load documents from text files
- Bug fixes by Todd Foster
- First stable release