Inquiry: Support for External Inference Endpoint in Tabby #767

d0rc · 2023-11-11T19:13:31Z

d0rc
Nov 11, 2023

Hey Tabby Team,

Love what you're doing with Tabby, it's a game-changer! Quick question: is there a way to route Tabby's inference requests to an external endpoint? I've got a setup that batches requests to remote GPUs for better performance.

It could lower latency and let users like me use bigger, custom models. Plus, it might be a cool feature for others too. Just a thought – totally get it if it's not doable right now.

Thanks for all the awesome work!

wsxiaoys · 2023-11-12T04:37:56Z

wsxiaoys
Nov 12, 2023
Maintainer

Is this endpoint dedicate to tabby? If that's the case, you might consider deploy tabby to this machine directly?

2 replies

d0rc Nov 14, 2023
Author

I was hoping I can give Tabby endpoint for inference, like with OpenAI protocol or something similar.

wsxiaoys Nov 16, 2023
Maintainer

Related discussion: #795

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inquiry: Support for External Inference Endpoint in Tabby #767

{{title}}

Replies: 1 comment 2 replies

{{title}}

{{title}}

{{title}}

Select a reply

Inquiry: Support for External Inference Endpoint in Tabby #767

d0rc Nov 11, 2023

Replies: 1 comment · 2 replies

wsxiaoys Nov 12, 2023 Maintainer

d0rc Nov 14, 2023 Author

wsxiaoys Nov 16, 2023 Maintainer

d0rc
Nov 11, 2023

Replies: 1 comment 2 replies

wsxiaoys
Nov 12, 2023
Maintainer

d0rc Nov 14, 2023
Author

wsxiaoys Nov 16, 2023
Maintainer