Update the Dockerfile of the LiteLLM Proxy server and some refactorings #628

coconut49 · 2023-10-17T17:36:56Z

Description

The purpose of this PR is to fix a number of issues encountered when I trying to deploy LiteLLM Proxy in docker and to do some refactoring of the code.

Tests

I tested the chat completion feature of OpenAI, Azure OpenAI, Bedrock and OpenRouter in my environment, and it worked.

…on control.

…toml.

vercel · 2023-10-17T17:37:01Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
litellm	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Oct 17, 2023 5:55pm

coconut49 · 2023-10-17T17:50:58Z

Dockerfile

-
-RUN python proxy_cli.py --config -f /app/secrets_template.toml
-
-RUN python proxy_cli.py


We shouldn't RUN python proxy_cli.py as it will cause the docker build process to get blocked.

coconut49 · 2023-10-17T17:51:25Z

litellm/proxy/proxy_cli.py

@@ -7,7 +7,6 @@
 import operator

 config_filename = "litellm.secrets.toml"
-pkg_config_filename = "template.secrets.toml"


The variable is not being used.

coconut49 · 2023-10-17T17:52:47Z

litellm/proxy/proxy_server.py

-    except Exception as e:
-        pass


Errors that occur during the loading configuration phase should not be swallowed but should be thrown to let the process exit fast.

coconut49 · 2023-10-17T17:56:21Z

litellm/proxy/proxy_server.py

+@router.post("/v1/completions")
 @router.post("/completions")


The logic of the two routes is similar, one of which gives default parameters for litellm_completion and the other does not. It's confusing. I've combined them into one function here

coconut49 · 2023-10-17T17:56:29Z

litellm/proxy/proxy_server.py

 @router.post("/v1/chat/completions")
-async def v1_chat_completion(request: Request):
+@router.post("/chat/completions")


coconut49 · 2023-10-17T17:57:42Z

requirements.txt

+backoff
+boto3
+uvicorn
+fastapi
+tomli
+appdirs
+tomli-w


The requirements file looks like it's for docker images, and I want our docker images to contain as many packages as possible so that users don't need to repackage images to install new dependencies

coconut49 · 2023-10-17T17:58:10Z

secrets_template.toml

+# add_function_to_prompt = true # e.g: Ollama doesn't support functions, so add it to the prompt instead
+# drop_params = true # drop any params not supported by the provider (e.g. Ollama)


In Toml, True is the wrong way to write it

krrishdholakia · 2023-10-17T18:27:57Z

@coconut49 lgtm!

krrishdholakia · 2023-10-17T18:29:29Z

Your changes to requirements.txt are interesting - presumably this is to deal with the pip install we run only for proxy-specific packages.

@coconut49 You mentioned not wanting your users to deal with this. Can you chat for 10 minutes this week? I'm trying to learn about prod/proxy use-cases.

coconut49 · 2023-10-18T06:46:08Z

Hi @krrishdholakia
Sorry, I don't have any current use cases for production environments. I deployed ProxyServer because I have API access to LLMs for various vendors. I wanted to convert them to OpenAI format so I could use them in various ChatUIs.

coconut49 · 2023-10-18T06:47:52Z

For example, TypingMind, which I'm using, allows me to add as many LLMs as I want that conform to the OpenAI API format.

coconut49 added 14 commits October 17, 2023 22:28

Add .idea/ directory to .gitignore to exclude IDE settings from versi…

b8bde73

…on control.

Add '*.pyc' to .gitignore to ignore Python compiled files

5ab1312

rm litellm/__pycache__/

5c1a460

Remove all __pycache__ directories

2039066

Refactor start script and Dockerfile, switch to bash entrypoint

e8a5681

Expose port 8000 in Dockerfile and add deployment instructions.

0d24bca

Add backoff and boto3 to requirements.txt

d4f7fb7

Set WORKDIR to /app before installing requirements in Dockerfile.

266b3b8

Refactor proxy_server.py for readability and code consistency

4414594

Change boolean values in secrets_template.toml from title to lower case.

762e4c8

Refactor proxy_server.py to simplify v1 endpoints and improve logging

0939302

Refactor Dockerfile and proxy_cli.py to use new secrets file location

07f06c6

Change Dockerfile to rename secrets_template.toml to litellm.secrets.…

bb91a86

…toml.

Add uvicorn, fastapi, tomli, and tomli-w to requirements.txt

cd5bb4a

vercel bot deployed to Preview October 17, 2023 17:37 View deployment

merge

db55dac

vercel bot deployed to Preview October 17, 2023 17:49 View deployment

coconut49 commented Oct 17, 2023

View reviewed changes

Add new routes for v1 versioning in proxy server

356a104

vercel bot deployed to Preview October 17, 2023 17:55 View deployment

coconut49 commented Oct 17, 2023

View reviewed changes

krrishdholakia merged commit dbd1d70 into BerriAI:main Oct 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update the Dockerfile of the LiteLLM Proxy server and some refactorings #628

Update the Dockerfile of the LiteLLM Proxy server and some refactorings #628

coconut49 commented Oct 17, 2023 •

edited

Loading

vercel bot commented Oct 17, 2023 •

edited

Loading

coconut49 Oct 17, 2023

coconut49 Oct 17, 2023

coconut49 Oct 17, 2023

coconut49 Oct 17, 2023

coconut49 Oct 17, 2023

coconut49 Oct 17, 2023

coconut49 Oct 17, 2023

krrishdholakia commented Oct 17, 2023

krrishdholakia commented Oct 17, 2023 •

edited

Loading

coconut49 commented Oct 18, 2023

coconut49 commented Oct 18, 2023


		RUN python proxy_cli.py --config -f /app/secrets_template.toml

		RUN python proxy_cli.py

		# add_function_to_prompt = true # e.g: Ollama doesn't support functions, so add it to the prompt instead
		# drop_params = true # drop any params not supported by the provider (e.g. Ollama)

		@router.post("/v1/completions")
		@router.post("/completions")

Update the Dockerfile of the LiteLLM Proxy server and some refactorings #628

Update the Dockerfile of the LiteLLM Proxy server and some refactorings #628

Conversation

coconut49 commented Oct 17, 2023 • edited Loading

Description

Tests

vercel bot commented Oct 17, 2023 • edited Loading

coconut49 Oct 17, 2023

Choose a reason for hiding this comment

coconut49 Oct 17, 2023

Choose a reason for hiding this comment

coconut49 Oct 17, 2023

Choose a reason for hiding this comment

coconut49 Oct 17, 2023

Choose a reason for hiding this comment

coconut49 Oct 17, 2023

Choose a reason for hiding this comment

coconut49 Oct 17, 2023

Choose a reason for hiding this comment

coconut49 Oct 17, 2023

Choose a reason for hiding this comment

krrishdholakia commented Oct 17, 2023

krrishdholakia commented Oct 17, 2023 • edited Loading

coconut49 commented Oct 18, 2023

coconut49 commented Oct 18, 2023

coconut49 commented Oct 17, 2023 •

edited

Loading

vercel bot commented Oct 17, 2023 •

edited

Loading

krrishdholakia commented Oct 17, 2023 •

edited

Loading