Skip to content

Commit 0b80c6a

Browse files
committed
update
Signed-off-by: ttanzhiqiang <389825161@qq.com>
1 parent f1fe025 commit 0b80c6a

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

examples/run_dp_attention_etp16.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,6 +18,6 @@ nohup python -m vllm.entrypoints.openai.api_server --model=/mnt/deepseek/DeepSee
1818
--max-num-batched-tokens 32768 \
1919
--block-size 128 \
2020
--no-enable-prefix-caching \
21-
--additional-config '{"torchair_graph_batch_sizes":[24],"expert_tensor_parallel_size":16,"use_cached_npu_graph":true,"ascend_scheduler_config":{},"enable_graph_mode":true}' \
21+
--additional-config '{"torchair_graph_config":{"enabled":true,"use_cached_graph":true,"graph_batch_sizes":[24]},"ascend_scheduler_config":{"enabled":true},"expert_tensor_parallel_size":16}' \
2222
--gpu-memory-utilization 0.96 &> run.log &
2323
disown

0 commit comments

Comments
 (0)