-
Notifications
You must be signed in to change notification settings - Fork 520
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature request] Add simple HTTP API server like in llama.cpp with api like OpenAI #1
Comments
Great suggestion, if there's others who interested please +emoji above and we'll prioritize this :) |
Just for the update: |
Also with 💎Gemma in 🦙Llama.CPP you get CUDA, Neon and AMD GPUs support! |
adding a api like support would be great these models can be used on cpu for smaller tasks. |
I have a question: why using http but not websocket? As I known, the answer token is generated one word by one word. |
WebSocket is more suitable for instant messenger style UI but may not be ideal for other UI types. And I think it is better to integrate gemma.cpp as a module into the web backend framework than to implement the HTTP/WebSocket API directly. Here is my WebSocket online demo solution, and you can try it here or via this Kaggle notebook. In this solution gemma.cpp is a module of OpenResty which makes it easy to implement WebSocket or HTTP API. |
Could you please confirm if this issue is resolved for you with the above comment ? Please feel free to close the issue if it is resolved ? Thank you. |
ok app |
Yes it is
…On Wed, 16 Oct 2024, 19:14 Leszek, ***@***.***> wrote:
ok app
—
Reply to this email directly, view it on GitHub
<#1 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AFCLH3EJX5QCVDE3KMMDK53Z32NFVAVCNFSM6AAAAABQATJCOOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDIMJXGQZTSMRZG4>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
Thank you
…On Thu, 17 Oct 2024, 20:08 Zeenat Randeree, ***@***.***> wrote:
Yes it is
On Wed, 16 Oct 2024, 19:14 Leszek, ***@***.***> wrote:
> ok app
>
> —
> Reply to this email directly, view it on GitHub
> <#1 (comment)>,
> or unsubscribe
> <https://github.com/notifications/unsubscribe-auth/AFCLH3EJX5QCVDE3KMMDK53Z32NFVAVCNFSM6AAAAABQATJCOOVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDIMJXGQZTSMRZG4>
> .
> You are receiving this because you are subscribed to this thread.Message
> ID: ***@***.***>
>
|
Closing this issue, please feel free reopen if this is still a valid request. Thank you! |
For more infos here
https://github.com/ggerganov/llama.cpp/blob/master/examples/server/README.md
The text was updated successfully, but these errors were encountered: