[Feature]: local-only load model cost #1434

timlrx · 2024-01-13T08:48:17Z

The Feature

Currently model_cost is initialised by default by fetching the model price json file from the repository.

Lazy loading model_cost would allow a user to override model_cost and eliminate the additional network request if it is not required. Sketch of suggested implementation:

model_cost = None 

def load_model_cost():
    global model_cost
    if model_cost is None:
       get_model_cost_map(url=model_cost_map_url) # default fallback behaviour

For all functions that use litellm.model_cost, add a load_model_cost() line. Functions that would be affected include cost_per_token, register_model, get_max_tokens, get_model_info, trim_messages.

Thanks for considering!

Motivation, pitch

I would like to load model_cost locally, but currently there does not seem to be a way to do so since it is set when litellm is imported
Deferring the load might also be good for other users who do not use the cost related functions

Twitter / LinkedIn details

No response

The text was updated successfully, but these errors were encountered:

krrishdholakia · 2024-01-13T13:10:16Z

Hey @timlrx we already support local model cost - https://docs.litellm.ai/docs/completion/token_usage#8-register_model

Is this what you were looking for?

--
Loading the map is how we add support for new models without requiring users to upgrade versions each time.

timlrx · 2024-01-13T15:02:18Z

Hi @krrishdholakia, not quite. Given the following code:

import litellm

litellm.register_model(model_cost={"gpt-4": {"max_tokens": 8192}})

Because litellm init gets executed first, the user has to pull model cost from github before registering a new model / overriding the model_cost object.

Hence, I am proposing to load model cost only when it is being called, using the current logic(pull from github, user does not need to upgrade manually). This will also allow a user to override model_cost directly and no network request needs to be made.

krrishdholakia · 2024-01-17T18:55:23Z

@timlrx seeing this late - sorry! can we do a quick call on this? Want to understand this better

Attaching my calendly if that helps - https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

krrishdholakia · 2024-01-23T14:23:35Z

Hi @timlrx is your issue now solved with - https://docs.litellm.ai/docs/proxy/custom_pricing

If not, can you help me understand what the problem you're currently facing is?

timlrx · 2024-01-25T15:07:14Z

Sorry for the slow reply, but it would be pretty easy for me to show over a call. Let's chat tomorrow (might be later today for you).

Manouchehri · 2024-01-26T18:13:36Z

Related side-comment: eventually (in a few months) we will be using LiteLLM in an "offline"/isolated environment. (To be more specific, we will have access to api.openai.com and to our VMs, but nothing else. i.e. downloading more stuff at runtime from GitHub.com will not work.)

I think the way to test this, would be to run a CI/CD test on LiteLLM with firewall rules applied at runtime.

krrishdholakia · 2024-02-02T01:50:14Z

updating title based on conversation

krrishdholakia · 2024-02-02T02:05:59Z

This is now supported - ec427ae

If you have firewalls, and want to just use the local copy of the model cost map, you can do so like this:

export LITELLM_LOCAL_MODEL_COST_MAP="True"

Note: this means you will need to upgrade to get updated pricing, and newer models.

krrishdholakia · 2024-02-02T02:06:16Z

@timlrx feel free to close the issue if this solves your problem

timlrx · 2024-02-02T03:07:28Z

@krrishdholakia, yes the changes looks good to me, thanks!

Manouchehri · 2024-02-02T03:10:30Z

Thanks! I'm going to set it on my Cloud Run deployment as well, since the extra fetch probably isn't benefiting cold start times (even if small). =P

timlrx added the enhancement New feature or request label Jan 13, 2024

ishaan-jaff mentioned this issue Jan 22, 2024

[22/01/2024 - 29/01/2024] New Models/Endpoints/Providers/Improvements #1542

Closed

29 tasks

krrishdholakia mentioned this issue Jan 29, 2024

[Feature]: [29/01/2024 - 05/02/2024] New Models/Endpoints/Providers/Improvements #1665

Closed

36 tasks

krrishdholakia changed the title ~~[Feature]: Lazy load model cost~~ [Feature]: local-only load model cost Feb 2, 2024

krrishdholakia closed this as completed Feb 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: local-only load model cost #1434

[Feature]: local-only load model cost #1434

timlrx commented Jan 13, 2024

krrishdholakia commented Jan 13, 2024 •

edited

Loading

timlrx commented Jan 13, 2024

krrishdholakia commented Jan 17, 2024

krrishdholakia commented Jan 23, 2024

timlrx commented Jan 25, 2024

Manouchehri commented Jan 26, 2024

krrishdholakia commented Feb 2, 2024

krrishdholakia commented Feb 2, 2024

krrishdholakia commented Feb 2, 2024

timlrx commented Feb 2, 2024

Manouchehri commented Feb 2, 2024

[Feature]: local-only load model cost #1434

[Feature]: local-only load model cost #1434

Comments

timlrx commented Jan 13, 2024

The Feature

Motivation, pitch

Twitter / LinkedIn details

krrishdholakia commented Jan 13, 2024 • edited Loading

timlrx commented Jan 13, 2024

krrishdholakia commented Jan 17, 2024

krrishdholakia commented Jan 23, 2024

timlrx commented Jan 25, 2024

Manouchehri commented Jan 26, 2024

krrishdholakia commented Feb 2, 2024

krrishdholakia commented Feb 2, 2024

krrishdholakia commented Feb 2, 2024

timlrx commented Feb 2, 2024

Manouchehri commented Feb 2, 2024

krrishdholakia commented Jan 13, 2024 •

edited

Loading