Skip to content

Commit 47c43db

Browse files
committed
bugfix for mtp>1
Signed-off-by: zouyida2052 <zouyida2002@gmail.com>
1 parent 4b2a9c4 commit 47c43db

File tree

1 file changed

+1
-0
lines changed

1 file changed

+1
-0
lines changed

vllm_ascend/attention/mla_v1.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -201,6 +201,7 @@ def __init__(self,
201201
assert self.decode_threshold <= 16, f"decode_threshold exceeded \
202202
npu_fused_infer_attention_score TND layout's limit of 16, \
203203
got {self.decode_threshold}"
204+
204205
self.reorder_batch_threshold = self.decode_threshold
205206

206207
if self.chunked_prefill_enabled:

0 commit comments

Comments
 (0)