Skip to content

Commit 7f680d0

Browse files
committed
增加计算流等待预取流结束
1 parent 51254d0 commit 7f680d0

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm_ascend/ops/moe/moe_mlp.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -86,7 +86,7 @@ def quant_apply_mlp(hidden_states: torch.Tensor,
8686

8787
bias1, bias2 = None, None
8888
_output_dtype = w2_scale.dtype
89-
89+
wait_prefetch_done()
9090
is_mc2 = get_forward_context().moe_comm_method_name == "mc2commimpl"
9191
if w1_scale_bias is None and is_mc2:
9292
if w1_scale.dtype != torch.float32:

0 commit comments

Comments
 (0)