Skip to content

Conversation

jesse996
Copy link
Contributor

@jesse996 jesse996 commented May 9, 2025

What this PR does / why we need it?

According to this RFC #396 and this #448, we pull request relavant code to support LoRA in v1 Engine

Does this PR introduce any user-facing change?

Following openai HTTP apis will be supported:
/v1/load_lora_adapter
/v1/unload_lora_adapter

How was this patch tested?

git clone https://github.com/vllm-project/vllm.git
cd vllm/examples/offline_inference/ && python3 multilora_inference.py

@wangxiyuan
Copy link
Collaborator

Thanks for the PR. it's better to add e2e test for lora as well.

@wangxiyuan
Copy link
Collaborator

The test is not ran by default. it should be added here as well https://github.com/vllm-project/vllm-ascend/blob/main/.github/workflows/vllm_ascend_test.yaml#L114C1-L115C1

@jesse996 jesse996 closed this May 16, 2025
@jesse996 jesse996 reopened this May 16, 2025
What this PR does / why we need it?
According to this RFC #396 and this #448, we pull request relavant code to support LoRA in v1 Engine

Does this PR introduce any user-facing change?
Following openai HTTP apis will be supported:
/v1/load_lora_adapter
/v1/unload_lora_adapter

How was this patch tested?
git clone https://github.com/vllm-project/vllm.git
cd vllm/examples/offline_inference/ && python3 multilora_inference.py

Signed-off-by: jesse <szxfml@gmail.com>
Copy link
Collaborator

@wangxiyuan wangxiyuan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, the CI test time increased. from 30min to 90min. I suggest to move to daily test in the future.

@wangxiyuan
Copy link
Collaborator

Let's focus on #893 instead.

@wangxiyuan wangxiyuan closed this May 20, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
module:tests ready read for review
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants