GitHub - PraveenKumar-Rajendran/LLM-CustomChatBot-with-RAG: Building Custom Chatbot with Retrieval Augmented Generation(RAG)

LLM Custom Chat Bot with RAG

Execution Instructions

Create the necessary Conda environment (Python 3.9 recommended):
```
conda create --name chatbot python=3.9
conda activate chatbot
```

Clone the repository:

git clone <repository_url>
cd <repository_directory>

Install the required packages:
```
pip install -r requirements.txt
```
Open the Jupyter notebook:
```
jupyter notebook project.ipynb
```
Replace the API key: Locate the line:
```
openai.api_key = "YOUR API KEY"
```
Replace "YOUR API KEY" with your actual OpenAI API key.
Execute the notebook cells: Run the cells sequentially to see the results. If you want to customize the project for your own dataset, custom data processing will be required. Adjust the cells as needed to fit your specific data and requirements.

Workflow

Data Source

character_descriptions.csv - This file contains character descriptions from theater, television, and film productions. Each row includes the name, description, medium, and setting of the character. All characters were invented by an OpenAI model.

Reasoning for Selection

The dataset chosen for this project is particularly suitable for several reasons. Firstly, the characters within this dataset are invented by an OpenAI model, ensuring that they are unique and unlikely to be pre-existing entities within the knowledge base of any large language model (LLM). Consequently, directly querying the LLM about these characters without additional context would be ineffective and inappropriate.

To address this, we will generate embeddings for the character descriptions, allowing us to retrieve relevant context based on the user’s query. This context will be incorporated into a custom prompt, enabling the LLM to provide more accurate and contextually relevant responses. By leveraging this method, we enhance the LLM's ability to handle inquiries about these specific, unique characters, ultimately improving the quality and relevance of the generated answers.

Data Wrangling

Full Descriptive Text

Combine the columns into a single descriptive paragraph for each character.

Format:

[Name] is a [Description]. This character appears in a [Medium] set in [Setting].

Example:

Emily is a young woman in her early 20s, an aspiring actress. This character appears in a play set in England.

Workflow

Inspecting Non-customized Results of Available Characters
- Understand how the LLM responds to queries about the characters without additional context.
Get the Embeddings for the Text Data
- Generate embeddings for each character description to facilitate context retrieval.
Get Relevant Text for Custom Prompt
- Retrieve the most relevant character descriptions based on the user's query using cosine similarity.
Custom Prompt Creation
- Combine the user's query with the retrieved context to form a custom prompt.
Custom Prompt Answering
- Use the custom prompt to query the LLM for more accurate and contextually relevant responses.
Custom Performance Demonstration
- Demonstrate the improved performance of the LLM with the custom chatbot using specific queries about the characters.

By following this workflow, we ensure that the LLM can provide detailed and accurate answers about the characters in the dataset, leveraging the unique context provided by the custom prompts.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
.gitignore		.gitignore
README.md		README.md
character_descriptions_with_embeddings.csv		character_descriptions_with_embeddings.csv
project.ipynb		project.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Custom Chat Bot with RAG

Execution Instructions

Workflow

Data Source

Reasoning for Selection

Data Wrangling

Full Descriptive Text

Workflow

About

Releases

Packages

Languages

PraveenKumar-Rajendran/LLM-CustomChatBot-with-RAG

Folders and files

Latest commit

History

Repository files navigation

LLM Custom Chat Bot with RAG

Execution Instructions

Workflow

Data Source

Reasoning for Selection

Data Wrangling

Full Descriptive Text

Workflow

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages