Add feature : add chromadb support as a vector database #60

powerli2002 · 2024-12-14T08:38:12Z

This PR is independent of ES and is based on the latest main branch.

This PR partially addresses the #57 (comment)

Abstract：
Support for chroma as a vector database has been implemented in ModelCache.

Notes:
The implementation of chromadb in this PR uses the chromadb.PersistentClient method to persist it locally. According to the official documentation, this is not a method suitable for production environments. If changed to HttpClient or AsyncHttpClient, the chroma run --path /db_path command needs to be run in advance, which might need to be mentioned in the README document.

Example for chromadb_config.ini

[chromadb]  
persist_directory=./chromadb

I have found some inconsistencies in multicache, such as:

For the vector database, the logic implemented with Redis is that different types of data are stored in different index, while this logic does not exist in Faiss.
Multiple method names are inconsistent with the corresponding vector database methods of the non-multimodal modelcache. For example, rebuilding the database has the multimodal method: def rebuild_idx(self, model):, and the non-multimodal method: def rebuild_col(self, model):.
In the multimodal Redis implementation logic, almost all methods use the parameter mm_type, but when calling these methods, there are cases where this parameter is not passed, such as in add, delete, etc. However, due to the need to store data, I still chose to store the data in different collections.
Additionally, some software packages may not have been included in the requirements.txt.

Given the lack of relevant documentation and my limited understanding of certain features of multicache, as well as the above confusions, my multimodal implementation is for reference only, and if there are any issues, feel free to contact me.

peng3307165 · 2024-12-18T02:31:34Z

Thank you for your continued work. We plan to send you a CodeFuse souvenir. You can also participate in activities related to CodeFuse open source in the future. Can we obtain your contact information? My email is: hongen.phe@antgroup.com

powerli2002 · 2024-12-19T08:27:37Z

Thank you for your continued work. We plan to send you a CodeFuse souvenir. You can also participate in activities related to CodeFuse open source in the future. Can we obtain your contact information? My email is: hongen.phe@antgroup.com

Thank you kindly. I have contacted you via my private Foxmail email. Please check your inbox at your convenience.

peng3307165 · 2024-12-22T01:38:43Z

Thank you for your contribution to the ModelCache project! we've accepted your code. We truly appreciate your efforts and collaboration. Best wishes!

powerli2002 added 2 commits December 14, 2024 16:35

Add feature : add chromadb support as a vector database

aead2c1

Add feature : add chromadb support as a vector database

f3c7657

peng3307165 merged commit 3dad91c into codefuse-ai:main Dec 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add feature : add chromadb support as a vector database #60

Add feature : add chromadb support as a vector database #60

Uh oh!

powerli2002 commented Dec 14, 2024 •

edited

Loading

Uh oh!

peng3307165 commented Dec 18, 2024

Uh oh!

powerli2002 commented Dec 19, 2024

Uh oh!

peng3307165 commented Dec 22, 2024

Uh oh!

Uh oh!

Add feature : add chromadb support as a vector database #60

Add feature : add chromadb support as a vector database #60

Uh oh!

Conversation

powerli2002 commented Dec 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

peng3307165 commented Dec 18, 2024

Uh oh!

powerli2002 commented Dec 19, 2024

Uh oh!

peng3307165 commented Dec 22, 2024

Uh oh!

Uh oh!

powerli2002 commented Dec 14, 2024 •

edited

Loading