Support local and custom models #246

leo4life2 · 2024-09-30T05:28:46Z

Issue #165

Done:

Interface changes for adding a model with HuggingFace model name
- UI logic: put HuggingFace model name in "add model" text area (e.g. owner/modelName), click Add, select the model name in the dropdown, and start chatting.
Loading & running model from HuggingFace

WIP:

stream tokenizer decode minor bug
model name persisting (will probably use localSession)
support local models

AlexCheema · 2024-09-30T12:08:40Z

exo/api/chatgpt_api.py

+ shard = model_base_shards[chat_request.model].get(self.inference_engine_classname, None)
+ else:
+ # HF models
+ hf_model_url = f"https://huggingface.co/{chat_request.model}"


This needs to use HF_ENDPOINT environment variable which was just added

AlexCheema · 2024-09-30T12:10:28Z

This looks great.

What we'd really need here to get this merged is automatically sharding the model. The PyTorch implementation here does this automatically, you can see how they do it #139
That way any language model can be supported without having to explicitly write a sharded model implementation.

ignore .vscode

73ffde0

leo4life2 closed this Sep 30, 2024

leo4life2 reopened this Sep 30, 2024

Interface update & use huggingface models

b19c7f4

leo4life2 force-pushed the main branch from 425d78d to b19c7f4 Compare September 30, 2024 05:29

AlexCheema reviewed Sep 30, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support local and custom models #246

Support local and custom models #246

leo4life2 commented Sep 30, 2024 •

edited

Loading

AlexCheema Sep 30, 2024

AlexCheema commented Sep 30, 2024

Support local and custom models #246

Are you sure you want to change the base?

Support local and custom models #246

Conversation

leo4life2 commented Sep 30, 2024 • edited Loading

AlexCheema Sep 30, 2024

Choose a reason for hiding this comment

AlexCheema commented Sep 30, 2024

leo4life2 commented Sep 30, 2024 •

edited

Loading