- Website Interaction: The chatbot uses LangChain to interact with and extract information from various websites.
- Large Language Model Integration: Powered by GPT-4.
- Streamlit GUI: A clean and intuitive user interface built with Streamlit.
- Python-based: Entirely coded in Python.
- CSS: Used CSS to make the GUI more attractive and user-friendly.
- User Paste's any website link to the input box.
- All the website text is scrapped using BeautifulSoup.
- Text is split into small chunks so that it can be fed to the Embeddings Model.
- Embeddings model performs vectorization and stores it in vector data base.
- The user query is also embeded and top matching text present in the vector data base is found out through semantic search.
- The Top ranked text found from our webpage data and the user query both are feeded to the LLM model.
- LLM model produces the answer to the query and displays it to the user.
- A RAG chatbot is an AI-powered chatbot that uses Retrieval Augmented Generation (RAG) to provide more accurate and relevant responses than traditional chatbots.
- For the Demo we are using Wikipedia Page of "INDIA IN OLYMPICS 2024".
data:image/s3,"s3://crabby-images/40d4a/40d4ab3794236021516b8e037427f5f9967e20b9" alt="Screenshot 2024-09-02 at 4 19 33 PM"
- Paste the link of Wikipedia website in the input box.
data:image/s3,"s3://crabby-images/00541/00541afc89261e499217a9a5e7667b6e7292e8e9" alt="Screenshot 2024-09-02 at 4 19 43 PM"
- After the link is pasted, the bot automatically start.
data:image/s3,"s3://crabby-images/04912/04912b72af5e2b9c7ad5290563517515480addea" alt="Screenshot 2024-09-02 at 4 19 50 PM"
- Question 1
data:image/s3,"s3://crabby-images/d4346/d4346b63ee341ca2f0a5cc72ff53460e1a2a7917" alt="Screenshot 2024-09-02 at 4 20 22 PM"
- Question 2
data:image/s3,"s3://crabby-images/a9899/a98998420fb82c7a239f2d84d6fa7c679bd39a80" alt="Screenshot 2024-09-02 at 4 20 51 PM"
- Question 3. The bot also remembers the chat history and answers accordingly.
data:image/s3,"s3://crabby-images/8947e/8947e9a4355d12fc57bd2f42d829f1f604aa6d2c" alt="Screenshot 2024-09-02 at 4 22 16 PM"
- Question 4. It also summarizes the webpage information.
data:image/s3,"s3://crabby-images/2ed92/2ed92e41c248f83ca2472f4744e1a670df09661a" alt="Screenshot 2024-09-02 at 4 25 34 PM"