lmstudio_en

Using LM Studio for Chatting

LM Studio is a cross-platform (macOS, Windows, Linux) large model chat program capable of loading GGUF format models (from llama.cpp). Below, I'll briefly explain how to use it on macOS as an example. For other uses, please try it yourself and consult the official manual for more information.

Step 1: Download the Application for Your Platform

Official download link: LM Studio

Step 2: Organize the Model Directory

Create a models directory and organize the model files in a format similar to that used by Hugging Face's model repository. You can copy the models or set up symbolic links. For example, the hfl/llama-3-chinese-8b-instruct model could be organized in the following directory structure, with the GGUF model (in llama.cpp format) in the last layer:

- hfl
  - llama-3-chinese-8b-instruct
    - ggml-model-q8_0.gguf

Step 3: Open the Application and Set the Model Directory

After opening the program, click on the last folder icon TAB on the left side.
In the Local models folder section, click change and point it to the newly created models directory from Step 2.
Once set up, all recognized models will be listed (if none are recognized, try restarting the program).
Click on “choose default preset” and select the Llama 3 instruction template.

step3

Step 4: Start Chatting

Click the third conversation button on the left to enter the chat program.
At the top, select a model to load; on the right sidebar, you can set system instructions, enable GPU, context window size, etc.; on the left sidebar, you can start a new conversation.

step4

These steps should help you get started with using LM Studio to run large language models and engage in interactive chat sessions using your model of choice!

中文文档

English Docs

Model Reconstruction
Model Quantization, Inference and Deployment
System Performance
Training Scripts
- Pre-training Scripts
- Instruction Fine-tuning Scripts
FAQ

Provide feedback

Saved searches

Use saved searches to filter your results more quickly