Predicting Fine-Grained Sentiments For Scraped Amazon Reviews Using SVM and FastText Models Trained On Stanford NLP Treebank :

Usage Guide:

File Descriptions:

'amazon_review.py' :
contains the code to scrape Amazon Reviews.
'stanford_sentiment_treebank_exploratory_data_analysis.py' :
contains the code to generate train ('sst_train.txt'), dev ('sst_dev.txt'), and test ('sst_test.txt') files and perform EDA.
'svm_train_and_predict.py' :
contains the code to train and predict using SVM model and store as a CSV ('svm_predicted_sentiments.csv').
'train_fasttext_sentiment_analysis.py' :
contains the code to train FastText model and store non-quantized('sst.bin') as well as quantized('sst_quantized.ftz') models.
'fasttext_predict_sentiment.py' :
contains the code to predict sentiments using FastText for Amazon Reviews and store as a CSV ('fastText_predicted_sentiments.csv').
'visualize_results.py' :
contains the code to Visualize Results.
'customer_reviews.csv' :
contains the Scraped Reviews.
'svm_predicted_sentiments.csv' :
contains the sentiments predicted using SVM.
'fastText_predicted_sentiments.csv' :
contains the sentiments predicted using FastText.

Run 'amazon_review.py' at Command Line using Scrapy Runspider:

Run this code from the command line to run 'amazon_review.py' and store results as 'customer_reviews.csv'

scrapy runspider amazon_review.py -o customer_reviews.csv

Training and Testing SVM Model:

Use the following code to train and predict using SVM Model.

python svm_train_and_predict.py

After running this file, there is a 'svm_predicted_sentiments.csv' file generated containing the predicted sentiments.

Training FastText:

Use the following code to train FastText Model. It takes around 3-5 minutes on CPU to complete training.

python train_fasttext_sentiment_analysis.py

After training, there will be a model saved as 'sst.bin' and a quantized model saved as 'sst_quantized.ftz'.

Testing Trained Model:

Use the following code to test the quantized FastText Model.

python fasttext_predict_sentiment.py

This code will output a 'fastText_predicted_sentiments.csv' file containing the predicted sentiments.

Results:

WordCloud For Amazon Reviews:

Fasttext Model Results:

Accuracy: 34.25531914893617%

Macro F1-score: 0.28648371805100803

Support Vector Machine(SVM) Model Results:

Accuracy: 40.0%

Macro F1-score: 0.3325577295787083

Thus, it is clearly evident that SVM outperforms FastText for the test dataset containing reviews scraped from Amazon!

Contact :

For any query/feedback, please contact:

Lakshay Mehra: mehralakshay2@gmail.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting Fine-Grained Sentiments For Scraped Amazon Reviews Using SVM and FastText Models Trained On Stanford NLP Treebank :

Usage Guide:

File Descriptions:

'amazon_review.py' :

'stanford_sentiment_treebank_exploratory_data_analysis.py' :

'svm_train_and_predict.py' :

'train_fasttext_sentiment_analysis.py' :

'fasttext_predict_sentiment.py' :

'visualize_results.py' :

'customer_reviews.csv' :

'svm_predicted_sentiments.csv' :

'fastText_predicted_sentiments.csv' :

Run 'amazon_review.py' at Command Line using Scrapy Runspider:

Training and Testing SVM Model:

Training FastText:

Testing Trained Model:

Results:

WordCloud For Amazon Reviews:

Fasttext Model Results:

Accuracy: 34.25531914893617%

Macro F1-score: 0.28648371805100803

Support Vector Machine(SVM) Model Results:

Accuracy: 40.0%

Macro F1-score: 0.3325577295787083

Thus, it is clearly evident that SVM outperforms FastText for the test dataset containing reviews scraped from Amazon!

Contact :

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Figure_1.png		Figure_1.png
Figure_2.png		Figure_2.png
Figure_3.png		Figure_3.png
Figure_4.png		Figure_4.png
Figure_5.png		Figure_5.png
Figure_6.png		Figure_6.png
Figure_7.png		Figure_7.png
README.md		README.md
amazon_review.py		amazon_review.py
customer_reviews.csv		customer_reviews.csv
fastText_predicted_sentiments.csv		fastText_predicted_sentiments.csv
fasttext_predict_sentiment.py		fasttext_predict_sentiment.py
sst_test.txt		sst_test.txt
sst_train.txt		sst_train.txt
stanford_sentiment_treebank_exploratory_data_analysis.py		stanford_sentiment_treebank_exploratory_data_analysis.py
svm_predicted_sentiments.csv		svm_predicted_sentiments.csv
svm_train_and_predict.py		svm_train_and_predict.py
train_fasttext_sentiment_analysis.py		train_fasttext_sentiment_analysis.py
visualize_results.py		visualize_results.py

lakshaymehra/fine-grained-sentiment-analysis

Folders and files

Latest commit

History

Repository files navigation

Predicting Fine-Grained Sentiments For Scraped Amazon Reviews Using SVM and FastText Models Trained On Stanford NLP Treebank :

Usage Guide:

File Descriptions:

'amazon_review.py' :

'stanford_sentiment_treebank_exploratory_data_analysis.py' :

'svm_train_and_predict.py' :

'train_fasttext_sentiment_analysis.py' :

'fasttext_predict_sentiment.py' :

'visualize_results.py' :

'customer_reviews.csv' :

'svm_predicted_sentiments.csv' :

'fastText_predicted_sentiments.csv' :

Run 'amazon_review.py' at Command Line using Scrapy Runspider:

Training and Testing SVM Model:

Training FastText:

Testing Trained Model:

Results:

WordCloud For Amazon Reviews:

Fasttext Model Results:

Accuracy: 34.25531914893617%

Macro F1-score: 0.28648371805100803

Support Vector Machine(SVM) Model Results:

Accuracy: 40.0%

Macro F1-score: 0.3325577295787083

Thus, it is clearly evident that SVM outperforms FastText for the test dataset containing reviews scraped from Amazon!

Contact :

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages