OpenAI compatible server: tokenizer arg causes issues with pooling resources amongst models #7815

thealmightygrant · 2024-11-19T17:38:37Z

thealmightygrant
Nov 19, 2024

Hi y'all, I was testing out the OpenAI compatible server, and I saw that we can include the tokenizer directly in the vllm_backend using the model.json. Could we remove it from being a required argument for the OpenAI server?

My thought here is that this opens up serving multiple models from the same server, even if those models use different tokenizers.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenAI compatible server: tokenizer arg causes issues with pooling resources amongst models #7815

{{title}}

Replies: 0 comments

Select a reply

OpenAI compatible server: tokenizer arg causes issues with pooling resources amongst models #7815

thealmightygrant Nov 19, 2024

Replies: 0 comments

thealmightygrant
Nov 19, 2024