Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Serve] MicroServing API refactor #3071

Merged
merged 1 commit into from
Dec 18, 2024

Conversation

MasterJH5574
Copy link
Member

This PR refactors the MicroServing REST API. With this PR, we now have all the microserving REST APIs under file
python/mlc_llm/serve/entrypoints/microserving_entrypoints.py. And relative protocol data structures are placed under python/mlc_llm/protocol/microserving_protocol.py. These REST APIs essentially wrap and redirect to the OpenAI v1/completions API.

Besides, this PR applies some API name renaming to be consistent with writeups.

This PR refactors the MicroServing REST API. With this PR, we now
have all the microserving REST APIs under file
`python/mlc_llm/serve/entrypoints/microserving_entrypoints.py`.
And relative protocol data structures are placed under
`python/mlc_llm/protocol/microserving_protocol.py`.
These REST APIs essentially wrap and redirect to the OpenAI
`v1/completions` API.

Besides, this PR applies some API name renaming to be consistent
with writeups.
@jinhongyii jinhongyii merged commit 8a1bfd6 into mlc-ai:main Dec 18, 2024
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants