[Scheduler] validate max_num_batched_tokens and max_model_len in AscendSchedulerConfig #5726
Artifacts
Produced during runtime
Name | Size | Digest | |
---|---|---|---|
vllm-ascend-ubuntu-24.04-arm-py3.11-wheel
|
453 KB |
sha256:2d0276d7beafab9a2c017fe33acd2966ee6c89f3b0efc6708bfa5452925017b8
|
|
vllm-ascend-ubuntu-24.04-py3.11-wheel
|
462 KB |
sha256:5da682d5fc7c62bd1502c0624d24332208347fb73f245829257b19facd06c2e0
|
|