GitHub

This project provides an interactive research assistant designed to enhance your document comprehension and navigation. The assistant allows users to query multiple PDFs for specific answers, retrieving relevant document sections. It includes an intuitive interface for viewing and navigating to specific pages of PDFs or PowerPoint files, offering targeted previews for improved understanding.

Getting Started

Prerequisites

Ensure you have all required dependencies installed. You can do this by running:

pip install -r requirements.txt

Setup

Place the vector_store directory (for storing vectorized document representations) and the nlp_data directory (containing your PDFs) in the project's root directory.
Ensure all necessary files are available in these directories.

Running the Application

Start the Flask API:
```
python backend.py
```
This will launch the API on localhost:5003.
Run the Streamlit application:
```
streamlit run app.py
```
The interface will be available at localhost:8501 on your local machine.
Optionally, explore the demo.py notebook for additional functionalities and demonstrations of the project's capabilities.

RAG Evaluation

Create evaluation dataset using the code provided in prepare_ragas_set.ipynb.
Run the code in run_ragas.py.
The evaluation results will be saved in mini_result.txt.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
PoC		PoC
logs		logs
mini		mini
.gitignore		.gitignore
PDF_Retrieval_Assistant_presentation.pdf		PDF_Retrieval_Assistant_presentation.pdf
PDF_Retrieval_Assistant_report.pdf		PDF_Retrieval_Assistant_report.pdf
README.md		README.md
app.py		app.py
backend.py		backend.py
mini_dataset.json		mini_dataset.json
mini_result.txt		mini_result.txt
prepare_ragas_set.ipynb		prepare_ragas_set.ipynb
requirements.txt		requirements.txt
run_ragas.py		run_ragas.py
template.ipynb		template.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Getting Started

Prerequisites

Setup

Running the Application

RAG Evaluation

About

Contributors 2

Languages

hbujakow/NLP2024Z_PDF_Assistant

Folders and files

Latest commit

History

Repository files navigation

Getting Started

Prerequisites

Setup

Running the Application

RAG Evaluation

About

Topics

Resources

Stars

Watchers

Forks

Contributors 2

Languages