Add outlines vLLM OpenAI server #598

Soufiane-Ra · 2024-01-29T17:20:42Z

To use with Langchain / OpenAI add extra_body={"schema": yourSchema} or extra_body={"regex": yourRegex} example:

llm = ChatOpenAI ( openai_api_key="EMPTY",
                   openai_api_base="http://localhost:8000/v1",
                   model_name=model_path,
                   extra_body={"schema": json_schema },
                 )

lapp0 · 2024-01-29T18:17:30Z

Thanks for the PR! I see two issues this would close

#512

#594

lapp0

Good job integrating vllms openai module with outlines!

Could you document usage and availability in vllm.md (or a new markdown file openai_endpoint.md)?

I'll do thorough smoke testing later this week.

outlines/serve/openai_server.py

lapp0 · 2024-01-29T18:19:46Z

outlines/serve/openai_server.py

+from http import HTTPStatus
+from typing import AsyncGenerator, Dict, List, Optional, Tuple, Union
+
+from aioprometheus import MetricsMiddleware


Please ensure any new dependencies are added are part of pyproject.toml

Wouldn't they be installed when installing vLLM?

Yes, I should have pulled before comparing to my local openai_server.py...

lapp0 · 2024-01-29T18:24:49Z

outlines/serve/openai_server.py

+            best_of=request.best_of,
+            top_k=request.top_k,
+            ignore_eos=request.ignore_eos,
+            use_beam_search=request.use_beam_search,


Beam search functionality depends on #539

jamestwhedbee · 2024-02-01T01:03:59Z

I'd expect this to extend OpenAI's existing response_format field to include the schema, rather than use an extra_body field. Thoughts?

rlouf · 2024-02-01T10:38:15Z

Thank you for opening a PR, this is impressive and I will need a few days to review it. In the meantime please make sure that the checks in the CI are passing.

7flash · 2024-02-01T11:00:17Z

Wow looks amazing, so I understand now after I deploy outlines to my server, then I can just add another line into my litellm config pointing to outlines, and then it just works compatible with my existing UI?

docs/reference/vllm.md

lapp0 · 2024-02-05T19:46:34Z

I tried smoke testing, but got

  File "/root/outlines/outlines/serve/openai_server.py", line 86
    parser.add_argument(
IndentationError: unexpected indent

rlouf · 2024-02-06T09:44:58Z

@Soufiane-Ra could you solve the formatting issues?

danielchalef · 2024-02-19T23:12:55Z

outlines/serve/openai_server.py

+    return StreamingResponse(
+            completion_stream_generator(), media_type="text/event-stream"
+        )
+    else:


@Soufiane-Ra Great work! We'd love to use this. It looks like there is another indentation error here. Could you fix it? Thx

findalexli · 2024-02-27T17:29:26Z

Following this and would love to see any update

olaf-beh · 2024-03-04T11:12:55Z

I'd expect this to extend OpenAI's existing response_format field to include the schema, rather than use an extra_body field. Thoughts?

I cannot agree. To be openai compatible we have to use extra_body. They specified extra_body field exactly for the use case to pass custom application fields.

E.g. if you use openai library for sending requests to vllm.engine.openai.entrypoints.openai.apiserver and you use an extra JSON field which is not specified by official openai API, then the openai python library will raise an error (I tried that a couple of weeks ago).

Just my two cents ...

Just saw this PR in vLLM: vllm-project/vllm#3211
It integrates oulines in vllm.engine.openai.entrypoints.openai.apiserver. This solves this PR, doesn't it?

rlouf · 2024-03-07T12:55:40Z

Thank you for contributing! In the meantime vLLM integrated Outlines, and you can now use structured generation directly from there.

Soufiane-Ra added 2 commits January 29, 2024 18:09

Add files via upload

bb0fdec

Update openai_server.py

f6a3e8d

lapp0 reviewed Jan 29, 2024

View reviewed changes

Soufiane-Ra added 2 commits January 29, 2024 19:47

Update openai_server.py

4ed8a7c

Update vllm.md

5b79ebf

Soufiane-Ra added 3 commits February 2, 2024 12:34

Update openai_server.py

1cc6758

Merge branch 'main' into main

dbce926

Merge branch 'main' into main

dfb5140

rlouf reviewed Feb 5, 2024

View reviewed changes

docs/reference/vllm.md Show resolved Hide resolved

This was linked to issues Feb 5, 2024

Expose OpenAI API with CFG #594

Closed

vLLM OpenAI compatible server integration #512

Closed

rlouf added structured generation Linked to structured generation vLLM Things involving vLLM support labels Feb 5, 2024

Soufiane-Ra added 3 commits February 5, 2024 13:49

Merge branch 'main' into main

7e3c33e

Update vllm.md

d69ff16

Update vllm.md

a107568

7flash mentioned this pull request Feb 6, 2024

Expose OpenAI API with CFG #594

Closed

Update openai_server.py

1ec6da6

Soufiane-Ra added 2 commits February 6, 2024 15:20

Update openai_server.py

7c5e6ab

Merge branch 'main' into main

e64d13b

danielchalef reviewed Feb 19, 2024

View reviewed changes

rlouf closed this Mar 7, 2024

Add outlines vLLM OpenAI server #598

Add outlines vLLM OpenAI server #598

Uh oh!

Conversation

Soufiane-Ra commented Jan 29, 2024

Uh oh!

lapp0 commented Jan 29, 2024

Uh oh!

lapp0 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lapp0 Jan 29, 2024

Choose a reason for hiding this comment

Uh oh!

rlouf Feb 5, 2024

Choose a reason for hiding this comment

Uh oh!

lapp0 Feb 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lapp0 Jan 29, 2024

Choose a reason for hiding this comment

Uh oh!

jamestwhedbee commented Feb 1, 2024

Uh oh!

rlouf commented Feb 1, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

7flash commented Feb 1, 2024

Uh oh!

Uh oh!

lapp0 commented Feb 5, 2024

Uh oh!

rlouf commented Feb 6, 2024

Uh oh!

danielchalef Feb 19, 2024

Choose a reason for hiding this comment

Uh oh!

findalexli commented Feb 27, 2024

Uh oh!

olaf-beh commented Mar 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rlouf commented Mar 7, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

lapp0 left a comment •

edited

Loading

lapp0 Feb 5, 2024 •

edited

Loading

rlouf commented Feb 1, 2024 •

edited

Loading

olaf-beh commented Mar 4, 2024 •

edited

Loading