RAG-ready

RAG-ready pipelines your PDF preprocessing, chunking, vectorisation and storage to Pinecone. Go from documents to a rag ready pinecone index in one go. Be it for search applications, chatbot development or recommendation systems - RAG-ready simplifies it all.

Requirements

Python 3.6+
.env file with the following keys:
- JINA_API_KEY
- PINECONE_API_KEY
- PINECONE_INDEX_NAME

Installation

Clone the repository:

git clone https://github.com/FazlOmar9/RAG-ready.git
cd RAG-ready

Create a virtual environment:
```
python -m venv .venv
```
Activate the virtual environment:
- On Windows:
```
.venv\Scripts\activate
```
- On macOS/Linux:
```
source .venv/bin/activate
```
Install the required packages:
```
pip install -r requirements.txt
```

Create a .env file in the root directory with the following content:

JINA_API_KEY=your_jina_api_key
PINECONE_API_KEY=your_pinecone_api_key
PINECONE_INDEX_NAME=your_pinecone_index_name

Usage

Place your PDF file in the documents directory.
Run the main script with the path to your PDF file as an argument:
```
python main.py documents/your_pdf_file.pdf
```

This will extract text from the PDF, segment it, generate embeddings, and upload the vectors to Pinecone.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
documents		documents
.gitignore		.gitignore
README.md		README.md
embedding.py		embedding.py
main.py		main.py
pine.py		pine.py
reader.py		reader.py
requirements.txt		requirements.txt
segment.py		segment.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

RAG-ready

Requirements

Installation

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Languages

FazlOmar9/RAG-ready

Folders and files

Latest commit

History

Repository files navigation

RAG-ready

Requirements

Installation

Usage

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages