Sentiment Analysis With NLP

Overview

This project focuses on performing sentiment analysis on 150 blog posts scraped from various websites. By leveraging Natural Language Processing (NLP) techniques and machine learning models, the project is able to predict the sentiments of these blog posts.

Introduction

Sentiment analysis, also known as opinion mining, involves determining the sentiment expressed in a piece of text. This project aims to categorize the sentiments of blog posts as positive, negative, or neutral. The analysis is carried out using various NLP techniques and machine learning models.

Data Collection

The dataset consists of 150 blog posts scraped from different websites. The blogs cover a wide range of topics to ensure diversity in the sentiment analysis. Web scraping tools such as BeautifulSoup and Scrapy were used to collect the blog posts.

Preprocessing

Preprocessing steps include:

Cleaning the text (removing HTML tags, punctuation, numbers, and special characters)
Tokenization
Stop words removal
Lemmatization

These steps ensure that the text data is in a suitable format for modeling.

Modeling

Several machine learning models were applied to predict the sentiments:

Logistic Regression
Support Vector Machines (SVM)
Random Forest
Naive Bayes

Additionally, advanced NLP techniques like TF-IDF and word embeddings were utilized to improve model performance.

Evaluation

The models were evaluated based on metrics such as accuracy, precision, recall, and F1-score. Cross-validation was performed to ensure the robustness of the models.

Results

The best performing model achieved an accuracy of XX% (update with actual result) on the test set. Detailed results, including confusion matrices and performance metrics for each model, can be found in the results directory.

Usage

To run this project locally, follow these steps:

Clone the repository:

git clone https://github.com/yourusername/sentiment-analysis-blog-posts.git

Navigate to the project directory:
```
cd sentiment-analysis-blog-posts
```
Install the required dependencies:
```
pip install -r requirements.txt
```
Run the preprocessing script:
```
python preprocess.py
```
Train the models:
```
python train.py
```
Evaluate the models:
```
python evaluate.py
```

Contributors

This project was developed by Himanshu Mahajan.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
MasterDictionary		MasterDictionary
StopWords		StopWords
scraped_files		scraped_files
Himanshu.py		Himanshu.py
Input.xlsx		Input.xlsx
Instructions.txt		Instructions.txt
OUTPUT.csv		OUTPUT.csv
OUTPUT.xlsx		OUTPUT.xlsx
Output-Data-Structure.xlsx		Output-Data-Structure.xlsx
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Analysis With NLP

Overview

Table of Contents

Introduction

Data Collection

Preprocessing

Modeling

Evaluation

Results

Usage

Contributors

License

About

Releases

Packages

Languages

himanshumahajan138/Sentimental-Analysis-With-NLP

Folders and files

Latest commit

History

Repository files navigation

Sentiment Analysis With NLP

Overview

Table of Contents

Introduction

Data Collection

Preprocessing

Modeling

Evaluation

Results

Usage

Contributors

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages