Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mixtral-8x7b-32kseqlen #2804

Closed
surak opened this issue Dec 11, 2023 · 4 comments
Closed

Mixtral-8x7b-32kseqlen #2804

surak opened this issue Dec 11, 2023 · 4 comments

Comments

@surak
Copy link
Collaborator

surak commented Dec 11, 2023

Hi,

I just managed to download the whole thing and upload to the inference servers.

Is there support to run Mixtral on FastChat?

Relevant links:
huggingface/transformers#27942
huggingface/transformers@accccdd

@graz68a
Copy link

graz68a commented Dec 11, 2023

Hope to see Mixtral-8x7b too !

@infwinston
Copy link
Member

Latest vllm should support it already. Would be awesome if you guys help us test if vllm_worker.py works for Mixtral-8x7b
https://github.com/vllm-project/vllm

@egortolmachev
Copy link

Latest vllm should support it already. Would be awesome if you guys help us test if vllm_worker.py works for Mixtral-8x7b https://github.com/vllm-project/vllm

No, it's dont)))

@surak
Copy link
Collaborator Author

surak commented Dec 12, 2023

It works - we might need to update the dependency versions of a lot of stuff, as I had to do quite some juggling of "pip install --no-deps" in the middle of the night to get it running, but it works!

@surak surak closed this as completed Dec 12, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants