How "vllm-server" option is meant to work? #17079

About-to-break · 2025-11-13T14:53:43Z

About-to-break
Nov 13, 2025

Sorry if this was asked before or me being dumb.

When initializing PaddleOCRVL class, there are allowed options like: "vl_rec_backend": "vllm-server" and "vl_rec_server_url": "http://gpu_server.local:8000/v1",.

How is it supposed to be used if i cannot specify an API key for vLLM or the model name? I wanna use vLLM to run a custom compatible VL and have benefits of Paddle's own ROI detectors.

And how even this is supposed to work? Like i get all detected ROI's transalted by VL one by one and get a full result?

liuhongen1234567 · 2025-11-17T11:40:42Z

liuhongen1234567
Nov 17, 2025
Collaborator

Hello, "vl_rec_backend" refers to the inference backend, which is essentially a method of performing inference. The "vl_rec_server_url" is the link to the started vLLM service. It can be configured to point to any vLLM service—as long as the model is integrated with vLLM and the service is launched, this link will be available. There is no scenario where an API key is missing. We recommend first familiarizing yourself with the specific process of deploying vLLM. To use a custom visual language model, simply launch a service based on that model and enter the service address into the vl_rec_server_url field.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How "vllm-server" option is meant to work? #17079

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How "vllm-server" option is meant to work? #17079

Uh oh!

About-to-break Nov 13, 2025

Replies: 1 comment

Uh oh!

liuhongen1234567 Nov 17, 2025 Collaborator

About-to-break
Nov 13, 2025

liuhongen1234567
Nov 17, 2025
Collaborator