Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature]: Supporting Guided Decoding via AsyncLLMEngine #8218

Closed
1 task done
DhruvaBansal00 opened this issue Sep 5, 2024 · 2 comments · Fixed by #8252
Closed
1 task done

[Feature]: Supporting Guided Decoding via AsyncLLMEngine #8218

DhruvaBansal00 opened this issue Sep 5, 2024 · 2 comments · Fixed by #8252

Comments

@DhruvaBansal00
Copy link

🚀 The feature, motivation and pitch

Today, Guided Decoding can only be accessed if we use the OpenAI compatible API interface, since that is the only way of passing in a GuidedDecodingRequest object into the generation query.

It should theoretically also be possible to support this via the AsyncLLMEngine.generate() interface. Anything blocking from supporting this today? I don't see it being possible to pass in a GuidedDecodingRequest into the generate method today.

Alternatives

No response

Additional context

No response

Before submitting a new issue...

  • Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.
@njhill
Copy link
Member

njhill commented Sep 5, 2024

@joerunde

@joerunde
Copy link
Contributor

joerunde commented Sep 6, 2024

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants