Make `skip_special_tokens` a generation parameter. #893

cyanic-selkie · 2023-08-28T08:38:35Z

Hi,

I have a few models that return structured output by utilizing special tokens as delimiters. As of now, vLLM always skips special tokens during decoding. Would it be possible to add skip_special_tokens as a generation parameter?

TGI sort of supports this by giving you the option to return individual tokens with their IDs and a boolean indicating whether they are special or not.

The text was updated successfully, but these errors were encountered:

bingfengyiren · 2023-09-04T12:22:52Z

I had the same problem as you.

WoosukKwon · 2023-09-28T02:26:24Z

@cyanic-selkie and @bingfengyiren We've just merged #1186, which adds skip_special_tokens to SamplingParams. This feature will be included in a new release which will be released very soon.

blahblahasdf mentioned this issue Sep 6, 2023

Configuration to allow output of special tokens #970

Closed

WoosukKwon closed this as completed Sep 28, 2023

irthomasthomas mentioned this issue Mar 22, 2024

SELF-RAG: Learning to Retrieve, Generate and Critique through Self-reflection irthomasthomas/undecidability#778

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make `skip_special_tokens` a generation parameter. #893

Make `skip_special_tokens` a generation parameter. #893

cyanic-selkie commented Aug 28, 2023

bingfengyiren commented Sep 4, 2023

WoosukKwon commented Sep 28, 2023

Make skip_special_tokens a generation parameter. #893

Make skip_special_tokens a generation parameter. #893

Comments

cyanic-selkie commented Aug 28, 2023

bingfengyiren commented Sep 4, 2023

WoosukKwon commented Sep 28, 2023

Make `skip_special_tokens` a generation parameter. #893

Make `skip_special_tokens` a generation parameter. #893