In the dynamic landscape of digital communication, the ability to decipher sentiments expressed in textual content has become increasingly paramount. This project encapsulates a solution to this burgeoning need—a Chrome extension leveraging Natural Language Processing (NLP) for sentiment analysis of comments on tweets. This extension provides a concise summary and categorizing the overall sentiment and emotions expressed within the comments.
Two separate CNNs were trained in order to classify between 5 different emotions (sadness, joy, love, anger and fear) and between positive, negative and neutral sentiment.
Large Language Models (LLMs) were employed for two primary tasks:
- Summarization: The LLM, specifically the gpt-3.5-turbo by OpenAI, was utilized to provide concise summaries of user comments on a post in addition to the classification made by the previously discussed models.
- Retrieval-Augmented Generation (RAG) for question answering: Leveraging the capabilities of the gpt-3.5-turbo in conjunction with retrieval-augmented generation, our system is adept at answering user queries related to all posts analyzed by the extension. For the transformation and storing of comments into embeddings, we integrated Pinecone as our vector store and the text-embedding-ada-002 model.