Please add lora support for higher ranks and alpha values #2847

parikshitsaikia1619 · 2024-02-13T07:26:31Z

ValueError: LoRA rank 64 is greater than max_lora_rank 16.

SuperBruceJia · 2024-03-06T01:47:34Z

Mark

Peter-Devine · 2024-03-08T05:43:46Z

Bump

dspoka · 2024-03-16T21:24:47Z

It's not super well documented but you need to just pass in "-max-lora-rank 64" or whatever when serving since default is 16.

python -m vllm.entrypoints.openai.api_server --max-lora-rank 64
--model model_name
--enable-lora
--lora-modules lora-name=lora_path

spreadingmind · 2024-03-20T15:27:31Z

It's not super well documented but you need to just pass in "-max-lora-rank 64" or whatever when serving since default is 16.

python -m vllm.entrypoints.openai.api_server --max-lora-rank 64 --model model_name --enable-lora --lora-modules lora-name=lora_path

Thanks for the answer, it helped me as well. For those who use code, it would be here:

          llm = LLM(
            model=args.model, tensor_parallel_size=torch.cuda.device_count(), 
            dtype=args.dtype, trust_remote_code=True, enable_lora=True, max_lora_rank=64
        )

Napuh · 2024-04-10T07:49:17Z

Both answers work for me, up to rank 64. Rank > 64 is not supported yet.

See #3934

patrickrho · 2024-06-09T04:56:12Z

Can we get Lora rank > 64 supported and merged?

edit: im also curious if this was by design to support up to 64 rank, if so please let me know

kevinjesse · 2024-06-11T18:18:43Z

Bump. I need adapters that are much, much larger to be supported. Thanks

jiangjin1999 · 2024-06-29T05:23:31Z

Is there something special about lora rank >64. Wonder why only <=64 is supported

JohnUiterwyk · 2024-08-02T06:46:36Z

same here, this is a blocker for me

Peter-Devine · 2024-08-04T08:36:46Z

@JohnUiterwyk , has this not been fixed by the suggestions from @dspoka and @spreadingmind ? Their suggestions worked for me.

JohnUiterwyk · 2024-09-02T09:54:23Z

No, as the maximum max_lora_rank is 64, going higher than that throws an error. I have adapters with rank 128 and 256 for certain uses cases, and can not serve them with vllm as a result of the hardcoded limit for the allowed value passed to max_lora_rank

github-actions · 2024-12-02T02:09:05Z

This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!

AntreasAntonio · 2024-12-09T17:46:15Z

Any updates on this? Recent papers showed that a rank=256 seems to be very beneficial for example. I suspect this trend will continue to be the case and increasing in the near future.

JinhyunBang mentioned this issue Nov 25, 2024

[Misc] Allow LoRA to adaptively increase rank and remove possible_max_ranks #10623

Open

github-actions bot added the stale label Dec 2, 2024

github-actions bot added unstale and removed stale labels Dec 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Please add lora support for higher ranks and alpha values #2847

Please add lora support for higher ranks and alpha values #2847

parikshitsaikia1619 commented Feb 13, 2024

SuperBruceJia commented Mar 6, 2024

Peter-Devine commented Mar 8, 2024

dspoka commented Mar 16, 2024

spreadingmind commented Mar 20, 2024

Napuh commented Apr 10, 2024

patrickrho commented Jun 9, 2024 •

edited

Loading

kevinjesse commented Jun 11, 2024

jiangjin1999 commented Jun 29, 2024

JohnUiterwyk commented Aug 2, 2024

Peter-Devine commented Aug 4, 2024 •

edited

Loading

JohnUiterwyk commented Sep 2, 2024

github-actions bot commented Dec 2, 2024

AntreasAntonio commented Dec 9, 2024

Please add lora support for higher ranks and alpha values #2847

Please add lora support for higher ranks and alpha values #2847

Comments

parikshitsaikia1619 commented Feb 13, 2024

SuperBruceJia commented Mar 6, 2024

Peter-Devine commented Mar 8, 2024

dspoka commented Mar 16, 2024

spreadingmind commented Mar 20, 2024

Napuh commented Apr 10, 2024

patrickrho commented Jun 9, 2024 • edited Loading

kevinjesse commented Jun 11, 2024

jiangjin1999 commented Jun 29, 2024

JohnUiterwyk commented Aug 2, 2024

Peter-Devine commented Aug 4, 2024 • edited Loading

JohnUiterwyk commented Sep 2, 2024

github-actions bot commented Dec 2, 2024

AntreasAntonio commented Dec 9, 2024

patrickrho commented Jun 9, 2024 •

edited

Loading

Peter-Devine commented Aug 4, 2024 •

edited

Loading