DataExploration.py to explore the data and show most-used ingredients for each cuisine
CrossValidation.py to tune parameters with GridSearchCV
submission.py for the final submission
Use nltk.WordNetLemmatizer to make bag of words from 'ingredients'.Use Tfidfvectorizer to build tfidf matrix from bag of words.Use LinearSVC to build models, with C=1 for best results.
Final score is 0.78872