Skip to content

Commit c1bfb7a

Browse files
author
wangxiaoxin-sherie
committed
xx
1 parent bcda351 commit c1bfb7a

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

vllm_ascend/worker/model_runner_v1.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2205,6 +2205,9 @@ def _build_attention_metadata(self, create_mixed_batch, num_reqs,
22052205
self.seq_lens_np[:num_reqs] = seq_lens
22062206
self.seq_lens_np[num_reqs:] = 0
22072207

2208+
self.query_start_loc[:num_reqs + 1] = torch.arange(num_reqs + 1)
2209+
self.query_start_loc_cpu[:num_reqs + 1] = torch.arange(num_reqs + 1)
2210+
22082211
num_computed_tokens_cpu = (
22092212
self.input_batch.num_computed_tokens_cpu_tensor[:num_reqs])
22102213

0 commit comments

Comments
 (0)