ollama-rag

description:

Local LLM that can access specific and specialized knowledge it otherwise doesn't have access to.

Data is derived from urls or pdf files that you provide, and stored as a RAG in a Postgres vector db. It can output to the command line, or to a web ui via streamlit or gradio. The embeddings in the db keep the knowledge persistent and permanently available.

general information:

This is an ongoing work in progress and proof of concept project. Not everything is setup yet, and will be added in the future. You might have to troubleshoot it, google answers etc, to get it to run on your specific setup. It is meant for experimentation and will change over time... hopefully for the better.

basic install:

Python
- Install python
```
sudo apt-get install python3
```
- Check version / verify installation:
```
python3 --version
```
Ollama
- Install Ollama
```
curl -fsSL https://ollama.com/install.sh | sh
```
- add a coding model (I used mistral-openorca, but you can use any from their models list, or from hugging-face if you know how to add them manually)
```
ollama pull mistral-openorca
```
- add 'nomic-text-embed' (greatly increaes the token context window)
```
ollama pull nomic-embed-text
```
- run list command to ensure models are available
```
ollama list
```
Install Docker Engine or Docker Desktop:
- Docker Install Docs
clone this repo, then:
- install all from requirements.txt
```
pip install -r requirements.txt
```
- compose the docker file
- I used postgres/postgres for the name & pw. Change as needed in the yml file
- run the python apps:
  - app1.py: command line only, you need to add your updates manually to the code to change the urls and search questions
  - app2.py: gradio ui version

next steps:

Implement the vectordb from postgres so it does something

Implement the pypdf library so that pdf's can be used for updating the llm knowledge base

Implement the streamlit ui version

Dockerize the entire app to simplify installs for people

Implement additional function calling to enhance the usefulness

Add option to choose any models available in your ollama install instead of manually assigning one in the code

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
LICENSE		LICENSE
README.md		README.md
app1.py		app1.py
app2.py		app2.py
docker-compose.yml		docker-compose.yml
init.sql		init.sql
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ollama-rag

description:

general information:

basic install:

next steps:

About

Releases

Packages

Languages

License

dj0le/ollama-rag

Folders and files

Latest commit

History

Repository files navigation

ollama-rag

description:

general information:

basic install:

next steps:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages