feat(api_server): add options for users to control output #1613

aarnphm · 2023-11-09T16:44:01Z

cc @WoosukKwon when you have bandwidth. This allows users to have more fine tuned control on how the API server yield back the text. Default behaviour are still the same, but it will address #1612

Signed-off-by: Aaron 29749331+aarnphm@users.noreply.github.com

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

Tostino · 2023-11-14T16:18:44Z

@aarnphm I implemented somewhat similar functionality for the OpenAI API chat endpoint (here: #1493) for completing a model response. I used return_full_response as the parameter name. I did not add it for the generation endpoint there though, so I figured we should have consistent implementations of this type of feature throughout.

aarnphm · 2023-11-14T17:23:38Z

@Tostino I believe for OpenAI there is a echo feature right?

Tostino · 2023-11-14T18:41:55Z

Correct, for their completions (legacy) API they support echo. Not for their chat completions API.

The current support looks like:

if request.echo:
        # We do not support echo since the vLLM engine does not
        # currently support getting the logprobs of prompt tokens.
        return create_error_response(HTTPStatus.BAD_REQUEST,
                                     "echo is not currently supported")

simon-mo · 2023-11-15T19:05:58Z

Currently, we are minimizing the functionality of api server. API server is not designed for production and mostly for demo purpose.

If you can add echo to OpenAI server that would be great. I believe we are support prompt log prob now.

aarnphm · 2023-11-15T21:51:17Z

Thats fair. I will close this PR then and will leave it to the OpenAI to implement echo support.

aarnphm added 2 commits November 9, 2023 11:43

feat(api_server): add options for uers to fine tune prompt output

4cfb3c4

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

fix(lint): update linter for consistent quote format

117a062

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

aarnphm changed the title ~~feat(api_server): add options for uers to fine tune prompt output~~ feat(api_server): add options for users to control output Nov 9, 2023

chore: update implementation

ff305fc

Signed-off-by: Aaron <29749331+aarnphm@users.noreply.github.com>

aarnphm closed this Nov 15, 2023

aarnphm deleted the feat/api-server-echo-prompt branch November 15, 2023 21:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(api_server): add options for users to control output #1613

feat(api_server): add options for users to control output #1613

aarnphm commented Nov 9, 2023 •

edited

Loading

Tostino commented Nov 14, 2023

aarnphm commented Nov 14, 2023

Tostino commented Nov 14, 2023 •

edited

Loading

simon-mo commented Nov 15, 2023

aarnphm commented Nov 15, 2023

feat(api_server): add options for users to control output #1613

feat(api_server): add options for users to control output #1613

Conversation

aarnphm commented Nov 9, 2023 • edited Loading

Tostino commented Nov 14, 2023

aarnphm commented Nov 14, 2023

Tostino commented Nov 14, 2023 • edited Loading

simon-mo commented Nov 15, 2023

aarnphm commented Nov 15, 2023

aarnphm commented Nov 9, 2023 •

edited

Loading

Tostino commented Nov 14, 2023 •

edited

Loading