Skip to content

Latest commit

 

History

History
24 lines (17 loc) · 1.28 KB

README.md

File metadata and controls

24 lines (17 loc) · 1.28 KB

What is this ?

A modified version of llama-retrieval-plugin based on Facebook's LLaMA which uses Weaviate and HuggingFace's Sentence Transformers to perform vectorized semantic search.

Required environment variables

export BEARER_TOKEN=$(openssl rand -hex 32)
export WEAVIATE_HOST=http://127.0.0.1
export WEAVIATE_PORT=8080
export WEAVIATE_INDEX=CustomDocument

Tips & Lessons

  • The vector database schema is fully customizable and can be adapted to your needs
  • Different embedding models yield different performances on semantic search (I chose the best performing model for semantic search according to HuggingFace but your mileage may vary)
  • The cloud deployment of Weaviate may work better than the local deploy (ran into some errors that were harder to debug than making the switch to the cloud version)

Contributors & License