Skip to content

zak23/zaks-openllm-models

 
 

Repository files navigation

Zaks repository of openllm

This repo (on main branch) is already included by openllm by default.

If you want more up-to-date untested models, please add our nightly branch.

openllm repo add nightly https://github.com/bentoml/openllm@nightly

Supported Models

Table of Contents


Llama-3

Model Version Huggingface Link
llama3 70b-instruct-awq-4bit-e968 HF Link
llama3 70b-instruct-fp16-6aed HF Link
llama3 8b-instruct-awq-4bit-f9de HF Link
llama3 8b-instruct-fp16-f703 HF Link

Phi-3

Model Version Huggingface Link
phi3 3.8b-instruct-fp16-30b8 HF Link
phi3 3.8b-instruct-ggml-q4-f5db HF Link

Mistral

Model Version Huggingface Link
mistral 7b-instruct-awq-4bit-0850 HF Link
mistral 7b-instruct-fp16-ac2b HF Link

Qwen-2

Model Version Huggingface Link
qwen2 0.5b-instruct-fp16-fcc6 HF Link
qwen2 1.5b-instruct-fp16-50d8 HF Link
qwen2 57b-a14b-instruct-fp16-3f06 HF Link
qwen2 72b-instruct-awq-4bit-15fd HF Link
qwen2 72b-instruct-fp16-7b44 HF Link
qwen2 7b-instruct-awq-4bit-ce1b HF Link
qwen2 7b-instruct-fp16-844c HF Link

Gemma

Model Version Huggingface Link
gemma 2b-instruct-fp16-0856 HF Link
gemma 7b-instruct-awq-4bit-d11b HF Link
gemma 7b-instruct-fp16-3e1c HF Link

Llama-2

Model Version Huggingface Link
llama2 13b-chat-fp16-921b HF Link
llama2 70b-chat-fp16-258c HF Link
llama2 7b-chat-awq-4bit-8df2 HF Link
llama2 7b-chat-fp16-2e3a HF Link

Mixtral

Model Version Huggingface Link
mixtral 8x7b-instruct-v0.1-awq-4bit-2953 HF Link
mixtral 8x7b-instruct-v0.1-fp16-71c6 HF Link

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 63.9%
  • Python 17.7%
  • Jinja 14.6%
  • TypeScript 2.6%
  • CSS 0.5%
  • Smarty 0.5%
  • Other 0.2%