Information Retrieval Assignment 2
- Text conversion to lowercase.
- Tokenization using nltk.
- Removal of stop words using nltk.
- Special characters excluding alphanumeric are removed.
- All singly occurring characters are removed.
- Finally a set of all the words is created.
- Input Query is case insensitive.