CI for add gptq and awq int4 support in intel platform #2494

ErikKaum · 2024-09-05T15:43:53Z

Run CI for pr #2444

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

sywangyi · 2024-09-06T02:45:50Z

I check the 3 failure cases and find they are related with the bug fix in https://github.com/huggingface/text-generation-inference/pull/2444/files#diff-d8aff332cf9104dd7460d2f53575239dc1f4bcdd374e575b8a504568bfc2e078R325.
which will cause "Narsil/starcoder-gptq" 2 TP not to use exllama kernel. if you check 1TP of this model, which is using exllama kernel.
the generation result is close to the current 2TP result which is using exllama.

sywangyi · 2024-09-06T02:48:00Z

@Narsil to comment.

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

sywangyi · 2024-09-13T10:42:45Z

seems the failure is not related with the PR
ERROR integration-tests/models/test_flash_medusa.py::test_flash_medusa_simple - RuntimeError: Launcher crashed
ERROR integration-tests/models/test_flash_medusa.py::test_flash_medusa_all_params - RuntimeError: Launcher crashed
ERROR integration-tests/models/test_flash_medusa.py::test_flash_medusa_load - RuntimeError: Launcher crashed

add gptq and awq int4 support in intel platform

0b02d45

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

sywangyi added 3 commits September 9, 2024 23:19

Merge branch 'main' into gpt_awq_4

8c3859d

fix ci failure

8857b68

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

Merge branch 'main' into gpt_awq_4

10628e8

Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CI for add gptq and awq int4 support in intel platform #2494

CI for add gptq and awq int4 support in intel platform #2494

ErikKaum commented Sep 5, 2024

sywangyi commented Sep 6, 2024 •

edited

Loading

sywangyi commented Sep 6, 2024

sywangyi commented Sep 13, 2024

CI for add gptq and awq int4 support in intel platform #2494

Are you sure you want to change the base?

CI for add gptq and awq int4 support in intel platform #2494

Conversation

ErikKaum commented Sep 5, 2024

sywangyi commented Sep 6, 2024 • edited Loading

sywangyi commented Sep 6, 2024

sywangyi commented Sep 13, 2024

sywangyi commented Sep 6, 2024 •

edited

Loading