Add support for Mixtral models. #480

LaaZa · 2023-12-12T01:12:38Z

Adds support for mistralai/Mixtral-8x7B-v0.1

Quantization/inference tested also with hf-internal-testing/Mixtral-tiny

Requires #479 for working inference.

transformers>=4.36.0 + for transformers inference requires huggingface/transformers#27956

Fixes #476

This reverts commit f5ef1cd.

fxmarty · 2023-12-13T15:57:21Z

Thank you, can you add a test for it? Not necessarily for the quantization, but just for running the model.

For example in tests/test_q4.py

LaaZa · 2023-12-13T16:37:14Z

Added test using TheBlokeAI/Mixtral-tiny-GPTQ

fxmarty

Thanks a lot! Can you solve conflicts as well, sorry!

I'll give a try to the test and it's good to go. I'll do a release by Friday.

LaaZa · 2023-12-13T16:42:07Z

Can you add StableLMEpoch first?

fxmarty · 2023-12-13T17:02:11Z

Done thanks!

# Conflicts: # auto_gptq/modeling/__init__.py # auto_gptq/modeling/auto.py

LaaZa · 2023-12-13T17:27:59Z

Sorry, my merges broke. But I got it now.

LaaZa added 5 commits December 11, 2023 15:41

Support for Mixtral models.

eec6fbe

Test fix.

f5ef1cd

Revert "Test fix."

7939908

This reverts commit f5ef1cd.

Different module layout.

e322290

Swap around module layout.

6996f8a

LaaZa added 2 commits December 13, 2023 18:08

Merge branch 'main' into Mixtral

5c424f6

Add Mixtral generation test.

171b50e

fxmarty approved these changes Dec 13, 2023

View reviewed changes

Merge branch 'main' into Mixtral

6fcd181

# Conflicts: # auto_gptq/modeling/__init__.py # auto_gptq/modeling/auto.py

fxmarty merged commit 1ce453f into AutoGPTQ:main Dec 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for Mixtral models. #480

Add support for Mixtral models. #480

LaaZa commented Dec 12, 2023

fxmarty commented Dec 13, 2023 •

edited

Loading

LaaZa commented Dec 13, 2023

fxmarty left a comment

LaaZa commented Dec 13, 2023

fxmarty commented Dec 13, 2023

LaaZa commented Dec 13, 2023

Add support for Mixtral models. #480

Add support for Mixtral models. #480

Conversation

LaaZa commented Dec 12, 2023

fxmarty commented Dec 13, 2023 • edited Loading

LaaZa commented Dec 13, 2023

fxmarty left a comment

Choose a reason for hiding this comment

LaaZa commented Dec 13, 2023

fxmarty commented Dec 13, 2023

LaaZa commented Dec 13, 2023

fxmarty commented Dec 13, 2023 •

edited

Loading