Closed
Description
Is your feature request related to a problem? Please describe.
Nowadays, embedding + reranker is the SOTA solution to improve the accuracy of RAG system. We've already have the embedding API support in LocalAI, it would be a big step forward if we can support reranker API.
Describe the solution you'd like
There are many reranker models out there, some famous names: bce-reranker-base_v1
, CohereRerank
, bge-reranker-v2-m3
. I think the Jina reranker API would be a good format to implement. https://jina.ai/reranker/#apiform
Describe alternatives you've considered
n/a
Additional context
The benchmark regarding embedding+reranker for RAG: