[Feature]: LoRA support for Pixtral #8802

spring-anth · 2024-09-25T12:18:13Z

🚀 The feature, motivation and pitch

I have finetuned the linear layers of Pixtral on my own dataset and would like to host the LoRA adapters as is possible for Mistral. It would great if this would be supported in the future.

Related issue: #8685 as the base model I used for finetuning is the HF version

Alternatives

No response

Additional context

No response

Before submitting a new issue...

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

DarkLight1337 · 2024-09-25T16:05:20Z

LoRA support for VLMs in general is still WIP. cc @jeejeelee

jeejeelee · 2024-09-25T23:18:30Z

LoRA support for VLMs in general is still WIP. cc @jeejeelee

Thanks for the ping, I should be able to complete the temporary solution for VL support of LoRA this week.

jeejeelee · 2024-09-30T08:30:35Z

@spring-anth I have completed the integration of Pixtral support for LoRA, see: https://github.com/jeejeelee/vllm/tree/pixtral-support-lora. Could you please verify this locally? I don't have the enough resources to train the LoRA model myself.

spring-anth · 2024-09-30T14:23:34Z

@jeejeelee Thank you! I checked out your branch and set it as my current vllm implementation via git clone https://github.com/jeejeelee/vllm.git cd vllm python python_only_dev.py
unfortunately I get this ValueError: [rank0]: self.model = get_model(model_config=self.model_config, [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "/home/docker/.local/lib/python3.11/site-packages/vllm/model_executor/model_loader/__init__.py", line 19, in get_model [rank0]: return loader.load_model(model_config=model_config, [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "/home/docker/.local/lib/python3.11/site-packages/vllm/model_executor/model_loader/loader.py", line 399, in load_model [rank0]: model = _initialize_model(model_config, self.load_config, [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "/home/docker/.local/lib/python3.11/site-packages/vllm/model_executor/model_loader/loader.py", line 176, in _initialize_model [rank0]: return build_model( [rank0]: ^^^^^^^^^^^^ [rank0]: File "/home/docker/.local/lib/python3.11/site-packages/vllm/model_executor/model_loader/loader.py", line 157, in build_model [rank0]: extra_kwargs = _get_model_initialization_kwargs(model_class, lora_config, [rank0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [rank0]: File "/home/docker/.local/lib/python3.11/site-packages/vllm/model_executor/model_loader/loader.py", line 134, in _get_model_initialization_kwargs [rank0]: raise ValueError( [rank0]: ValueError: Model PixtralForConditionalGeneration does not support LoRA, but LoRA is enabled. Support for this model may be added in the future. If this is important to you, please open an issue on github.

jeejeelee · 2024-09-30T15:11:59Z

@spring-anth Hi, which branch are you using? Is it pixtral-support-lora?

spring-anth · 2024-10-07T10:58:46Z

@jeejeelee You were right, I was on the wrong branch, silly mistake. Unfortunately I currently can't test for you if your change works as I trained pixtral with the transformers-compatible version. Therefore I can only use the LoRA weights for Pixtral once the transformers version of Pixtral is supported in vLLM (which is work in progress). My current workaround is merging the weights and transforming the model back to the vLLM compatible version.

jeejeelee · 2024-10-09T02:27:01Z

@spring-anth Currently, vLLM only supports LoRA trained with PEFT

spring-anth · 2024-10-11T06:34:52Z

@jeejeelee Yes, that's not what I meant I did train with PEFT but the training is based on the HF Transformers version of Pixtral (https://huggingface.co/mistral-community/pixtral-12b) which uses a different structure than the vLLM supported version (https://huggingface.co/mistralai/Pixtral-12B-2409)

tensimixt · 2024-10-22T12:33:20Z

@jeejeelee does this work with #5036? If so can test this week if inference with mistralai/Pixtral-12b-2409 with Lora Adapter works. Or should i use another PR to test vllm inference of this?

Or I can just use this branch https://github.com/jeejeelee/vllm/tree/pixtral-support-lora

Will this work for python -m vllm.entrypoints.openai.api_server where model is set to pixtral and lora modules to pixtral lora adapter?

Thank you!

jeejeelee · 2024-10-22T14:25:08Z

@jeejeelee does this work with #5036? If so can test this week if inference with mistralai/Pixtral-12b-2409 with Lora Adapter works. Or should i use another PR to test vllm inference of this?

Or I can just use this branch https://github.com/jeejeelee/vllm/tree/pixtral-support-lora

Will this work for python -m vllm.entrypoints.openai.api_server where model is set to pixtral and lora modules to pixtral lora adapter?

Thank you!

I thinki it should work, see: https://docs.vllm.ai/en/latest/models/lora.html#serving-lora-adapters

tensimixt · 2024-10-24T22:17:37Z

@jeejeelee when doing git checkout pixtral-support-lora and then pip install -e . does it build correctly for you or it crashes or does it very long time to build for you? when doing git checkout pr-5036 build vllm takes only 10-15 minutes, but for new vllm build take very long time been 45 minutes still buidling vllm is there way to make it faster build, thank you!

jeejeelee · 2024-10-25T00:07:20Z

I also need to spend a long time, unless compiling on high-performance CPU servers.

spring-anth added the feature request label Sep 25, 2024

DarkLight1337 mentioned this issue Sep 25, 2024

[RFC]: Multi-modality Support Refactoring #4194

Open

69 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: LoRA support for Pixtral #8802

[Feature]: LoRA support for Pixtral #8802

spring-anth commented Sep 25, 2024

DarkLight1337 commented Sep 25, 2024

jeejeelee commented Sep 25, 2024

jeejeelee commented Sep 30, 2024

spring-anth commented Sep 30, 2024

jeejeelee commented Sep 30, 2024

spring-anth commented Oct 7, 2024

jeejeelee commented Oct 9, 2024

spring-anth commented Oct 11, 2024

tensimixt commented Oct 22, 2024 •

edited

Loading

jeejeelee commented Oct 22, 2024

tensimixt commented Oct 24, 2024 •

edited

Loading

jeejeelee commented Oct 25, 2024

[Feature]: LoRA support for Pixtral #8802

[Feature]: LoRA support for Pixtral #8802

Comments

spring-anth commented Sep 25, 2024

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...

DarkLight1337 commented Sep 25, 2024

jeejeelee commented Sep 25, 2024

jeejeelee commented Sep 30, 2024

spring-anth commented Sep 30, 2024

jeejeelee commented Sep 30, 2024

spring-anth commented Oct 7, 2024

jeejeelee commented Oct 9, 2024

spring-anth commented Oct 11, 2024

tensimixt commented Oct 22, 2024 • edited Loading

jeejeelee commented Oct 22, 2024

tensimixt commented Oct 24, 2024 • edited Loading

jeejeelee commented Oct 25, 2024

tensimixt commented Oct 22, 2024 •

edited

Loading

tensimixt commented Oct 24, 2024 •

edited

Loading