Note
... i wanted to explore the rag stuff, so this is the simplest implmentation of rag.
... it is my attempt project to understand how search engine works.
Important
... WhaleSearch is a RAG-powered search engine that combines Google search results with local LLM processing using Phi-3-mini to provide detailed answers with source citations. It supports both regular web searches and image searches, with results enhanced through vector embeddings and ChromaDB for efficient information retrieval.
- Python: Version 3.8 or higher is required.
- SerpAPI API Key: get API key from SerpAPI.
- Model: Download a model like
Phi-3-mini-4k-instruct-fp16.gguf
and place it in an accessible location.
- Clone the repository:
git clone https://github.com/your-repo/whalesearch.git cd whalesearch
- Download the llm weights
!wget https://huggingface.co/microsoft/Phi-3-mini-4k-instruct-gguf/resolve/main/Phi-3-mini-4k-instruct-fp16.gguf
- Install all required libraries
pip install -r requirements.txt
- Run the app through gradio
cd src python3 app.py
- Enter your serapi-key and explore your curiosity.
1732274576503487.mov
- Add support for more search APIs for broader data coverage.
- Enable multilingual query handling.
- Optimize performance for low-resource environments.
- also make it useful for complet local search
- implement llamaCPP from scratch to inference phi-3
- should solve math problem
- keyword search need be added
work in process...