-
-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[v0.2.3] Release Tracker #1856
Labels
release
Related to new version release
Comments
I will also add #1662 |
AWQ's bad prefill performance exacerbated the issue. The root issue was non-chunked prefill. |
@WoosukKwon for the prometheus metrics, I made a new PR, please take a look. |
Closed by #1903 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
ETA: Nov 30th - Dec 2nd.
Major changes
PRs to be merged before the release
echo
for chat API #1756API causes slowdown in batch request handling #1707(We have to solve AWQ perf first, which might be possible in time).Add support for prometheus metrics #1662(use the new one instead)The text was updated successfully, but these errors were encountered: