[Feature]: Supporting Guided Decoding via AsyncLLMEngine #8218

DhruvaBansal00 · 2024-09-05T23:34:13Z

🚀 The feature, motivation and pitch

Today, Guided Decoding can only be accessed if we use the OpenAI compatible API interface, since that is the only way of passing in a GuidedDecodingRequest object into the generation query.

It should theoretically also be possible to support this via the AsyncLLMEngine.generate() interface. Anything blocking from supporting this today? I don't see it being possible to pass in a GuidedDecodingRequest into the generate method today.

Alternatives

No response

Additional context

No response

Before submitting a new issue...

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

The text was updated successfully, but these errors were encountered:

njhill · 2024-09-05T23:48:31Z

@joerunde

joerunde · 2024-09-06T22:53:35Z

@DhruvaBansal00 mind taking a look at https://github.com/vllm-project/vllm/pull/8252/files#r1747793503?

DhruvaBansal00 added the feature request label Sep 5, 2024

joerunde mentioned this issue Sep 6, 2024

[Frontend][Core] Move guided decoding params into sampling params #8252

Merged

DarkLight1337 closed this as completed in #8252 Oct 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Supporting Guided Decoding via AsyncLLMEngine #8218

[Feature]: Supporting Guided Decoding via AsyncLLMEngine #8218

DhruvaBansal00 commented Sep 5, 2024

njhill commented Sep 5, 2024

joerunde commented Sep 6, 2024

[Feature]: Supporting Guided Decoding via AsyncLLMEngine #8218

[Feature]: Supporting Guided Decoding via AsyncLLMEngine #8218

Comments

DhruvaBansal00 commented Sep 5, 2024

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

njhill commented Sep 5, 2024

joerunde commented Sep 6, 2024