Using HostedGPT with a Local LLM like Gemma 2 9B #471

krschacht · 2024-07-23T16:11:23Z

krschacht
Jul 23, 2024
Maintainer

We recently added the ability for HostedGPT to point to any API server, including local servers. You can now run the app completely free if you download a model to your computer.

I've been trying out the new Gemma 2 models. My computer is not powerful enough to run the 27B model so I'm using Gemma 2 9B. Here are the steps, in case anyone would like to try it.

I recommend LM Studio but a lot of other people use Llama.cpp / Llamafiles so those should work too.

Download https://lmstudio.ai/
Search for "gemma-2-9b" within the app and download "Q6_K" (this is the full quality model, the others are "compressed" which lowers the quality.)
In the app, click the Local Server tab on the left (diamond arrows icon) and click "Start Server"
Downloaded HostedGPT: git clone https://github.com/AllYourBot/hostedgpt.git
cd hostedgpt and run docker compose up --build
Now open http://localhost:3000/ and you'll see a sign up screen for HostedGPT. Create your account.
After your logged in, click your name in bottom left and go to Settings
First: API Services > Add New >
- Name: LM Studio
- Driver: openai
- URL: http://localhost:1234/
- Token: (leave blank)
Then: Language Model > Add New >
- Name: Gemma 2 9b
- API Name: bartowski/gemma-2-9b-it-GGUF
- API Service: select LM Studio
Finally: click "New Assistant"
- Name: Gemma 2 9b
- Language Model: select Gemma 2 9b
- Save
Exit settings (X in top right) and try chatting with your new Gemma 2 model by selecting it on the left

If you try this, I'm eager to hear how well it works — both the setup and the model itself.

alphabetek · 2024-07-29T16:37:26Z

alphabetek
Jul 29, 2024

It works great!

If the same model is provided by multiple API services, the user could be confused about which one to choose from the list in the new assistant screen. Maybe adding a filter for "API services" on that screen or showing the API service in the list (e.g. Gemma 2 9b - LM Studio) could avoid this ambiguity.

2 replies

krschacht Jul 29, 2024
Maintainer Author

Glad it's working! And that's a really a good suggestion. Or at the least, even if there is no explicit filter, the list should at least include mention of the API service so that you can visually distinguish them.

krschacht Jul 29, 2024
Maintainer Author

I plan to take a design pass on those settings screens soon so I added this as an item: #432

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using HostedGPT with a Local LLM like Gemma 2 9B #471

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 2 replies

{{title}}

{{title}}

{{title}}

Select a reply

Using HostedGPT with a Local LLM like Gemma 2 9B #471

krschacht Jul 23, 2024 Maintainer

Replies: 1 comment · 2 replies

alphabetek Jul 29, 2024

krschacht Jul 29, 2024 Maintainer Author

krschacht Jul 29, 2024 Maintainer Author

krschacht
Jul 23, 2024
Maintainer

Replies: 1 comment 2 replies

alphabetek
Jul 29, 2024

krschacht Jul 29, 2024
Maintainer Author

krschacht Jul 29, 2024
Maintainer Author