feat: support huggingface/text-embeddings-inference for faster embedding inference #39

liwenshipro · 2024-05-24T04:54:27Z

Text Embeddings Inference (TEI) is a toolkit for deploying and serving open source text embeddings and sequence
classification models. TEI enables high-performance extraction for the most popular models, including FlagEmbedding,
Ember, GTE and E5. TEI implements many features such as:

No model graph compilation step
Metal support for local execution on Macs
Small docker images and fast boot times. Get ready for true serverless!
Token based dynamic batching
Optimized transformers code for inference using Flash Attention,
Candle
and cuBLASLt
Safetensors weight loading
Production ready (distributed tracing with Open Telemetry, Prometheus metrics)

This PR support TEI faster embedding inference with modelcache, the speedup is shown as follows:

…ing inference

peng3307165 · 2024-05-24T23:42:03Z

Thank you for participating in the ModelCache open-source project; we welcome your involvement, and the addition of huggingface/text-embeddings-inference is a good idea. We offer two suggestions regarding your submission:

1 Using TextEmbeddingsInference as a class name and text_embeddings_inference as a variable name for LazyImport is somewhat generic, users may confuse concepts. It is recommended that names with greater distinction, such as HuggingfaceTEI or Huggingface_TEI, be used to enhance recognizability

2 Given the use of URL requests, it is recommended to add an example to the examples/embedding directory. I have already added the relevant directory, and you can pull the latest main branch to obtain it.

peng3307165 · 2024-09-14T02:58:55Z

We have merged your commit into the main branch. Thank you for your contributions to the ModelCache project.
Best wishes！

feat: support huggingface/text-embeddings-inference for faster embedd…

a7472d3

…ing inference

liwenshipro added 4 commits May 25, 2024 09:07

Merge branch 'codefuse-ai:main' into main

c9f6a2f

fix: rename huggingface TEI class

01acdcd

add huggingface tei example

18d70df

fix: rename huggingface tei

7605a82

peng3307165 closed this Sep 14, 2024

peng3307165 reopened this Sep 14, 2024

peng3307165 merged commit 27f6b78 into codefuse-ai:main Sep 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support huggingface/text-embeddings-inference for faster embedding inference #39

feat: support huggingface/text-embeddings-inference for faster embedding inference #39

liwenshipro commented May 24, 2024

peng3307165 commented May 24, 2024

peng3307165 commented Sep 14, 2024

feat: support huggingface/text-embeddings-inference for faster embedding inference #39

feat: support huggingface/text-embeddings-inference for faster embedding inference #39

Conversation

liwenshipro commented May 24, 2024

peng3307165 commented May 24, 2024

peng3307165 commented Sep 14, 2024