-
-
Notifications
You must be signed in to change notification settings - Fork 11.1k
[Benchmark] Fix regression in structured output benchmark #25500
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Benchmark] Fix regression in structured output benchmark #25500
Conversation
This was a mistake introduced by vllm-project#22772. Structured output requests were not actually working because the format spec was not placed in the proper new location in the request body. Signed-off-by: Russell Bryant <rbryant@redhat.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request correctly fixes a regression in the structured output benchmark by updating how the request body is constructed for structured output requests. The schema is now properly nested under the structured_outputs key, aligning with the API. While this change is correct, I noticed that the code for stopping the profiler (around line 668) still uses the old, incorrect format for the extra_body. It would be good to fix this as well to ensure profiling works correctly.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thanks!
Signed-off-by: Russell Bryant <rbryant@redhat.com> Signed-off-by: yewentao256 <zhyanwentao@126.com>
…ct#25500) Signed-off-by: Russell Bryant <rbryant@redhat.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>
…ct#25500) Signed-off-by: Russell Bryant <rbryant@redhat.com>
…ct#25500) Signed-off-by: Russell Bryant <rbryant@redhat.com>
…ct#25500) Signed-off-by: Russell Bryant <rbryant@redhat.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>
This was a mistake introduced by #22772. Structured output requests
were not actually working because the format spec was not placed in
the proper new location in the request body.
Signed-off-by: Russell Bryant rbryant@redhat.com