This repo (on main
branch) is already included by openllm by default.
If you want more up-to-date untested models, please add our nightly branch.
openllm repo add nightly https://github.com/bentoml/openllm@nightly
Model |
Version |
Huggingface Link |
llama3 |
70b-instruct-awq-4bit-e968 |
HF Link |
llama3 |
70b-instruct-fp16-6aed |
HF Link |
llama3 |
8b-instruct-awq-4bit-f9de |
HF Link |
llama3 |
8b-instruct-fp16-f703 |
HF Link |
Model |
Version |
Huggingface Link |
phi3 |
3.8b-instruct-fp16-30b8 |
HF Link |
phi3 |
3.8b-instruct-ggml-q4-f5db |
HF Link |
Model |
Version |
Huggingface Link |
mistral |
7b-instruct-awq-4bit-0850 |
HF Link |
mistral |
7b-instruct-fp16-ac2b |
HF Link |
Model |
Version |
Huggingface Link |
qwen2 |
0.5b-instruct-fp16-fcc6 |
HF Link |
qwen2 |
1.5b-instruct-fp16-50d8 |
HF Link |
qwen2 |
57b-a14b-instruct-fp16-3f06 |
HF Link |
qwen2 |
72b-instruct-awq-4bit-15fd |
HF Link |
qwen2 |
72b-instruct-fp16-7b44 |
HF Link |
qwen2 |
7b-instruct-awq-4bit-ce1b |
HF Link |
qwen2 |
7b-instruct-fp16-844c |
HF Link |
Model |
Version |
Huggingface Link |
gemma |
2b-instruct-fp16-0856 |
HF Link |
gemma |
7b-instruct-awq-4bit-d11b |
HF Link |
gemma |
7b-instruct-fp16-3e1c |
HF Link |
Model |
Version |
Huggingface Link |
llama2 |
13b-chat-fp16-921b |
HF Link |
llama2 |
70b-chat-fp16-258c |
HF Link |
llama2 |
7b-chat-awq-4bit-8df2 |
HF Link |
llama2 |
7b-chat-fp16-2e3a |
HF Link |
Model |
Version |
Huggingface Link |
mixtral |
8x7b-instruct-v0.1-awq-4bit-2953 |
HF Link |
mixtral |
8x7b-instruct-v0.1-fp16-71c6 |
HF Link |