Skip to content

[RFC]: Join the MultiLora and MultiLora Dynammic Serving feature develop #396

@ZhengJun9

Description

@ZhengJun9

Motivation.

We would like to join the MultiLora and MultiLora Dynammic Serving feature develop.

Proposed Change.

We want to implement the following:
1、The MultiLora usage should be the same as vllm, see https://docs.vllm.ai/en/latest/features/lora.html
2、What's more, for production environments use, the dynamically serving LoRA Adapters should be persistence. That means when the docker is restarted in some case, the load/unload lora adapters should not roll back to the initial state.

Feedback Period.

we plan to finnish this work in two weeks

CC List.

Yikun
wangxiyuan

Any Other Things.

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    RFCRequest For Comments

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions