[Doc] Add Qwen3-Omni-30B-A3B-Thinking Tutorials #3991

Meihan-chen · 2025-11-05T00:52:06Z

What this PR does / why we need it?

Add Qwen3-Omni-30B-A3B-Thinking Tutorials

Does this PR introduce any user-facing change?

No

How was this patch tested?

vLLM version: v0.11.0
vLLM main: vllm-project/vllm@83f478b

github-actions · 2025-11-05T00:52:13Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This PR adds a new tutorial for running Qwen3-Omni-30B-A3B-Thinking on multiple NPUs. The documentation is comprehensive, but I've found a few critical issues in the provided code snippets that would prevent users from successfully running the examples. Specifically, there's an incorrect package name in a pip install command and an incorrect model path in the offline inference script. There is also a typo in the filename of the new document. Please address these issues to ensure the tutorial is accurate and easy to follow.

gemini-code-assist · 2025-11-05T00:52:54Z

docs/source/tutorials/multi_npu_qwen3_omni_30B_A3B_Thingking.md

+
+```bash
+# If you already have transformers installed, please update transformer version >= 4.57.0.dev0
+# pip install transformer -U 


The package name for the Hugging Face transformers library is transformers (plural), not transformer. The current command will fail because the package transformer does not exist on PyPI.

Suggested change

# pip install transformer -U

# pip install transformers -U

gemini-code-assist · 2025-11-05T00:52:54Z

docs/source/tutorials/multi_npu_qwen3_omni_30B_A3B_Thinking.md

+
+
+def main():
+    MODEL_PATH = "/Qwen/Qwen3-Omni-30B-A3B-Thinking"


The MODEL_PATH is set to an absolute path "/Qwen/Qwen3-Omni-30B-A3B-Thinking". This is inconsistent with other tutorials and the online inference command in this same file, which use the model identifier directly. Using an absolute path might cause model loading to fail if the model is not present at that exact location in the container. It's better to use the model identifier and let vLLM handle the download and caching, especially since VLLM_USE_MODELSCOPE=True is set.

Suggested change

MODEL_PATH = "/Qwen/Qwen3-Omni-30B-A3B-Thinking"

MODEL_PATH = "Qwen/Qwen3-Omni-30B-A3B-Thinking"

gemini-code-assist · 2025-11-05T00:52:54Z

docs/source/tutorials/multi_npu_qwen3_omni_30B_A3B_Thinking.md

@@ -0,0 +1,192 @@
+# Multi-NPU (Qwen3-Omni-30B-A3B-Thinking)


There's a typo in the filename. It should be multi_npu_qwen3_omni_30B_A3B_Thinking.md instead of multi_npu_qwen3_omni_30B_A3B_Thingking.md. Please correct the filename for consistency and clarity.

wangxiyuan · 2025-11-05T01:09:41Z

please following the change 5f08e07

We're working on tutorial refactor now.

Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>

github-actions bot added the documentation Improvements or additions to documentation label Nov 5, 2025

Meihan-chen changed the title ~~[DOC] Add Qwen3-Omni-30B-A3B-Thinking Tutorials~~ [Doc] Add Qwen3-Omni-30B-A3B-Thinking Tutorials Nov 5, 2025

gemini-code-assist bot reviewed Nov 5, 2025

View reviewed changes

Meihan-chen force-pushed the qwen3-omni-doc branch from 0aab0b3 to 2f48bac Compare November 5, 2025 01:02

Meihan-chen force-pushed the qwen3-omni-doc branch 4 times, most recently from 7b0e7ff to 6824773 Compare November 5, 2025 01:30

add Qwen3-Omni-30B-A3B-Thinking doc

b12abbc

Signed-off-by: Meihan-chen <jcccx.cmh@gmail.com>

Meihan-chen force-pushed the qwen3-omni-doc branch from 6824773 to b12abbc Compare November 5, 2025 02:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Doc] Add Qwen3-Omni-30B-A3B-Thinking Tutorials #3991

[Doc] Add Qwen3-Omni-30B-A3B-Thinking Tutorials #3991

Meihan-chen commented Nov 5, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Nov 5, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Nov 5, 2025

Uh oh!

gemini-code-assist bot Nov 5, 2025

Uh oh!

gemini-code-assist bot Nov 5, 2025

Uh oh!

wangxiyuan commented Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	MODEL_PATH = "/Qwen/Qwen3-Omni-30B-A3B-Thinking"
	MODEL_PATH = "Qwen/Qwen3-Omni-30B-A3B-Thinking"

[Doc] Add Qwen3-Omni-30B-A3B-Thinking Tutorials #3991

Are you sure you want to change the base?

[Doc] Add Qwen3-Omni-30B-A3B-Thinking Tutorials #3991

Conversation

Meihan-chen commented Nov 5, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Nov 5, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

wangxiyuan commented Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Meihan-chen commented Nov 5, 2025 •

edited by github-actions bot

Loading