-
Notifications
You must be signed in to change notification settings - Fork 7.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to run privateGPT in kubernetes with HA (2 replicas)? #1561
Comments
Seems no one came across this issue? or I'm the only person running it in 2 pods?
After some investigation, I found that there is a data file saving the doc on harddrive:
Anyone knows how to avoid this so it can share the states across 2 pods? Thanks very much. |
I'm able to make it run in 2 PODs by changing the document store and index store to use MongoDB: Not sure if this is the correct way of making it stateless. |
Discussed in #1558
Originally posted by minixxie January 30, 2024
Hello,
First thank you so much for providing this awesome project!
I'm able to run this in kubernetes, but when I try to scale out to 2 replicas (2 pods), I found that the documents ingested are not shared among 2 pods.
First, I found the data being persisted in "local_data/" folder, so I found the doc and spin up qdrant, and change the settings.yaml as follow:
I saw the log of the pod showing the check on qdrant was successful:
After I ingested the doc inside the 1st pod:
I'm able to get the doc from the list endpoint:
However, if I check the list endpoint in the 2nd pod, it's empty:
This means they are not sharing the data from the vector database? Is there any way to run it in HA mode, so all replicas share the same set of documents ingested?
docker image I'm using: 3x3cut0r/privategpt:0.2.0
OS: mac OS mac book pro (Apple M2)
runtime: colima:
The text was updated successfully, but these errors were encountered: