[Feature] 可以支持embedding模型吗，类似于xinference的功能 #1927

jxfruit · 2024-07-05T04:03:30Z

Motivation

具体场景就是，想利用lmdeploy提供超快的推理能力，然后用一个私有化的知识库工具langchain-chatchat，这个需要embedding模型支持，所以看下大佬们有没有这方面的规划

Related resources

No response

Additional context

No response

lvhan028 · 2024-07-05T04:10:59Z

可否提供一个 embedding 模型的 list？我们先调研下看看

jxfruit · 2024-07-05T04:13:46Z

https://inference.readthedocs.io/zh-cn/latest/models/builtin/embedding/index.html

lvhan028 · 2024-07-05T05:00:41Z

@AllentDan Could you investigate this feature?

thiner · 2024-07-05T06:37:59Z

如果支持embedding模型，最好也能支持reranker模型。可以参考：mudler/LocalAI#2121
可用模型https://huggingface.co/BAAI/bge-reranker-v2-m3 进行测试

lvhan028 · 2024-07-05T06:54:33Z

我们需要调研下，看好不好支持，以及怎么支持。
在调研结论出来之前，不能给什么承诺。还请谅解。

lvhan028 · 2024-07-11T09:51:43Z

@jxfruit 你是想用 lmdeploy 加速 embeddings 模型的推理，是吧

AllentDan · 2024-07-11T10:07:48Z

I will check the implementations of fastchat and xinference.

jxfruit · 2024-07-12T08:16:24Z

@jxfruit 你是想用 lmdeploy 加速 embeddings 模型的推理，是吧

如果可以支持的话当然最好了，我目前最大诉求就是能支持推理就行，不用加速，以后做加速也可以

AllentDan · 2024-07-16T08:48:23Z

@jxfruit 用过 fastchat 的 embedding 服务吗？先确定一下，类 llama 模型的 embedding 是否符合你需求。目前我这边使用了几个支持 embedding 模型的开源框架，主要是 bert，T5 和 llama。llama 模型只有 fastchat 支持。

update:
vllm 支持了一个 Mistral 结构的 embedding 模型

jxfruit · 2024-07-17T06:25:26Z

@jxfruit 用过 fastchat 的 embedding 服务吗？先确定一下，类 llama 模型的 embedding 是否符合你需求。目前我这边使用了几个支持 embedding 模型的开源框架，主要是 bert，T5 和 llama。llama 模型只有 fastchat 支持

fastchat 没有用过，我们目前对具体的模型还没有诉求。但是看了一些，目前还是主要考虑xinference，建议参考下xinference这个项目呢，从Langchain-Chatchat项目里摘过来的一个本地部署框架的对比：

Tendo33 · 2024-07-18T09:13:43Z

附议，这样的话一所有部署任务一个框架就统一了.
目前比较统一的框架：https://github.com/xusenlinzy/api-for-open-llm

HughesZhang2021 · 2024-07-25T11:20:11Z

looking forward to supporting embedding model soon....

lvhan028 · 2024-07-25T12:36:36Z

Hi, folks,
感谢大家对 LMDeploy 的支持和认可。很遗憾，经过我们内部的分析和讨论后，决定暂不支持 embedding 模型。
在未来半年的工作中，我们会专注于 LLM 的推理优化，以及支持 InternLM 的内部研发。

lvhan028 assigned AllentDan Jul 5, 2024

lvhan028 unassigned AllentDan Jul 22, 2024

lvhan028 closed this as completed Jul 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] 可以支持embedding模型吗，类似于xinference的功能 #1927

[Feature] 可以支持embedding模型吗，类似于xinference的功能 #1927

jxfruit commented Jul 5, 2024

lvhan028 commented Jul 5, 2024

jxfruit commented Jul 5, 2024

lvhan028 commented Jul 5, 2024

thiner commented Jul 5, 2024 •

edited

Loading

lvhan028 commented Jul 5, 2024

lvhan028 commented Jul 11, 2024

AllentDan commented Jul 11, 2024

jxfruit commented Jul 12, 2024

AllentDan commented Jul 16, 2024 •

edited

Loading

jxfruit commented Jul 17, 2024

Tendo33 commented Jul 18, 2024 •

edited

Loading

HughesZhang2021 commented Jul 25, 2024

lvhan028 commented Jul 25, 2024

[Feature] 可以支持embedding模型吗，类似于xinference的功能 #1927

[Feature] 可以支持embedding模型吗，类似于xinference的功能 #1927

Comments

jxfruit commented Jul 5, 2024

Motivation

Related resources

Additional context

lvhan028 commented Jul 5, 2024

jxfruit commented Jul 5, 2024

lvhan028 commented Jul 5, 2024

thiner commented Jul 5, 2024 • edited Loading

lvhan028 commented Jul 5, 2024

lvhan028 commented Jul 11, 2024

AllentDan commented Jul 11, 2024

jxfruit commented Jul 12, 2024

AllentDan commented Jul 16, 2024 • edited Loading

jxfruit commented Jul 17, 2024

Tendo33 commented Jul 18, 2024 • edited Loading

HughesZhang2021 commented Jul 25, 2024

lvhan028 commented Jul 25, 2024

thiner commented Jul 5, 2024 •

edited

Loading

AllentDan commented Jul 16, 2024 •

edited

Loading

Tendo33 commented Jul 18, 2024 •

edited

Loading