ENH: support more GPTQ and AWQ format for some models #1243

xiaodouzi666 · 2024-04-03T10:17:07Z

No description provided.

xinference/model/llm/llm_family.json

qinxuye · 2024-04-03T11:28:00Z

Lack of llama-2-chat 70b awq, and 13b awq&gptq

qinxuye

LGTM

4424137 added 2 commits April 3, 2024 13:59

change llm_family.json file

2dbc689

ENH: Modify llm_family.json

32730fe

XprobeBot added the enhancement New feature or request label Apr 3, 2024

XprobeBot added this to the v0.10.1 milestone Apr 3, 2024

xiaodouzi666 closed this Apr 3, 2024

xiaodouzi666 reopened this Apr 3, 2024

clean all gptq errors

a674e82

qinxuye reviewed Apr 3, 2024

View reviewed changes

reupload llm-family

c339d00

qinxuye reviewed Apr 3, 2024

View reviewed changes

xinference/model/llm/llm_family.json Outdated Show resolved Hide resolved

fix errors

12c8c72

qinxuye changed the title ~~ENH: Modify llm_family.json~~ ENH: support more GPTQ and AWQ format for some models Apr 3, 2024

4424137 added 6 commits April 3, 2024 22:36

fix comments

bf54751

add all gptq and awq

e8f9a30

fix duplicate

c163250

fix all mistakes

47df378

add llama-13b-awq

4924e87

add mistral v0.1 gptq and awq

b83e01c

qinxuye approved these changes Apr 5, 2024

View reviewed changes

qinxuye merged commit 3b922b6 into xorbitsai:main Apr 5, 2024
10 of 12 checks passed

Provide feedback