Add aisbench nightly test cases #3474

jiangyunfan1 · 2025-10-15T07:14:36Z

What this PR does / why we need it?

This PR adds the first aisbench case for nightly test, it lays a foundation for following performance and accuracy tests in nightly test.

Does this PR introduce any user-facing change?

No

How was this patch tested?

By running the test

vLLM version: v0.11.0rc3
vLLM main: https://github.com/vllm-project/vllm/commit/v0.11.0

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

Signed-off-by: wangli <wangli858794774@gmail.com>

Signed-off-by: Yikun Jiang <yikunkero@gmail.com>

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

gemini-code-assist

Code Review

This pull request adds new nightly tests using aisbench and a corresponding helper script. The overall approach is sound, but the new tools/aisbench.py script contains several critical and high-severity issues. These include potential UnboundLocalError exceptions due to incorrect variable scoping, a resource leak from an unmanaged subprocess, a risk of an infinite loop when monitoring the subprocess, and fragile logic that relies on hardcoded values or specific string formats. Addressing these issues is crucial for making the test runner robust and reliable.

gemini-code-assist · 2025-10-15T07:17:33Z

tools/aisbench.py

+        if self.task_type == "accuracy":
+            aisbench_cmd = [
+                'ais_bench', '--models', f'{self.request_conf}_custom',
+                '--datasets', f'{dataset_conf}', '--debug'
+            ]
+        if self.task_type == "performance":
+            aisbench_cmd = [
+                'ais_bench', '--models', f'{self.request_conf}_custom',
+                '--datasets', f'{dataset_conf}_custom', '--debug', '--mode', 
+                'perf'
+            ]
+            if self.num_prompts:
+                aisbench_cmd.extend(['--num-prompts', str(self.num_prompts)])


The aisbench_cmd variable is only defined within the if self.task_type == "accuracy": and if self.task_type == "performance": blocks. If self.task_type has any other value, aisbench_cmd will be unbound when used on line 56, causing an UnboundLocalError. It's better to use an if/elif/else structure to handle all cases, raising an error for unsupported task types.

if self.task_type == "accuracy": aisbench_cmd = [ 'ais_bench', '--models', f'{self.request_conf}_custom', '--datasets', f'{dataset_conf}', '--debug' ] elif self.task_type == "performance": aisbench_cmd = [ 'ais_bench', '--models', f'{self.request_conf}_custom', '--datasets', f'{dataset_conf}_custom', '--debug', '--mode', 'perf' ] if self.num_prompts: aisbench_cmd.extend(['--num-prompts', str(self.num_prompts)]) else: raise ValueError(f"Unsupported task_type: {self.task_type}")

tools/aisbench.py

gemini-code-assist · 2025-10-15T07:17:34Z

tools/aisbench.py

+        while True:
+            line = self.proc.stdout.readline().strip()
+            print(line)
+            if result_msg in line:
+                self.result_line = line
+                return
+            if "ERROR" in line:
+                raise RuntimeError(
+                    "Some errors happen to Aisbench task.") from None


The while True loop reading from self.proc.stdout can hang indefinitely if the subprocess closes its standard output without printing either the expected result_msg or an "ERROR" line. readline() will block, waiting for data that will never arrive. The loop should also check if the process has terminated. When an error does occur, including stderr in the exception message would be very helpful for debugging.

while True: line = self.proc.stdout.readline().strip() if not line and self.proc.poll() is not None: # Process ended without finding the result message stderr = self.proc.stderr.read() raise RuntimeError(f"Aisbench task finished unexpectedly. Stderr: {stderr}") print(line) if result_msg in line: self.result_line = line return if "ERROR" in line: stderr = self.proc.stderr.read() raise RuntimeError(f"Some errors happen to Aisbench task. Stderr: {stderr}")

gemini-code-assist · 2025-10-15T07:17:34Z

tools/aisbench.py

+        result_csv_file = os.path.join(result_dir, "gsm8kdataset.csv")
+        result_json_file = os.path.join(result_dir, "gsm8kdataset.json")


The result filenames gsm8kdataset.csv and gsm8kdataset.json are hardcoded. This restricts this method to only work with the gsm8k dataset. To make this runner more flexible and reusable for other datasets, the dataset name should be derived dynamically, for instance from self.dataset_conf.

dataset_name = self.dataset_conf.split('/')[0] result_csv_file = os.path.join(result_dir, f"{dataset_name}dataset.csv") result_json_file = os.path.join(result_dir, f"{dataset_name}dataset.json")

github-actions · 2025-10-15T07:20:31Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

jiangyunfan1 and others added 30 commits October 12, 2025 09:24

add a case

8edcacf

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

add a job

9b55223

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

fix some issues

798ca2e

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

fix some issues

1543c62

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

fix some issues

b4e6a7a

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

fix some issues

3eb5cfe

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

fix some issues

065ddf5

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

fix some issues

ac0c471

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

fix some issues

ce98fe8

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

fix some issues

23f767a

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

fix some issues

e87e849

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

fix some issues

a3b1970

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

add changes

6dca606

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

delete old files

4d10b7e

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

add a case of qwen3-32b-int8

cc3c4a1

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

rebase

9ef536b

Signed-off-by: wangli <wangli858794774@gmail.com>

fix

22fe4d7

Signed-off-by: wangli <wangli858794774@gmail.com>

add workflow

93a7749

Signed-off-by: wangli <wangli858794774@gmail.com>

fix path

3eb536a

Signed-off-by: wangli <wangli858794774@gmail.com>

fix

cc25561

Signed-off-by: wangli <wangli858794774@gmail.com>

just for test

6a4003b

Signed-off-by: wangli <wangli858794774@gmail.com>

fix

ba518fb

Signed-off-by: wangli <wangli858794774@gmail.com>

add workflow

799ebb3

Signed-off-by: wangli <wangli858794774@gmail.com>

fix

f852e30

Signed-off-by: wangli <wangli858794774@gmail.com>

rm vllm_use_v1 env

26762c8

Signed-off-by: wangli <wangli858794774@gmail.com>

add port

8dbb2ed

Signed-off-by: wangli <wangli858794774@gmail.com>

add trigger

24320bb

Signed-off-by: wangli <wangli858794774@gmail.com>

test

93d8dfc

Signed-off-by: wangli <wangli858794774@gmail.com>

test

45bfb9c

Signed-off-by: wangli <wangli858794774@gmail.com>

revert

fae574f

Signed-off-by: wangli <wangli858794774@gmail.com>

Potabk and others added 10 commits October 12, 2025 09:24

revert

5261d96

Signed-off-by: wangli <wangli858794774@gmail.com>

fix

4b6c11b

Signed-off-by: wangli <wangli858794774@gmail.com>

fix

ce6354f

Signed-off-by: wangli <wangli858794774@gmail.com>

fix

bbab015

Signed-off-by: wangli <wangli858794774@gmail.com>

fix

3bc091c

Signed-off-by: wangli <wangli858794774@gmail.com>

add test

346118b

Signed-off-by: wangli <wangli858794774@gmail.com>

fix lint

670e4a9

Signed-off-by: Yikun Jiang <yikunkero@gmail.com>

Merge branch 'vllm-project:main' into main

c9b5986

Merge branch 'vllm-project:main' into main

5058c08

add nightly test aisbench cases

4a23a11

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

gemini-code-assist bot reviewed Oct 15, 2025

View reviewed changes

github-actions bot added module:tests module:tools labels Oct 15, 2025

fix issues

e467d24

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

wangxiyuan added the ready read for review label Oct 15, 2025

jiangyunfan1 added 4 commits October 15, 2025 08:24

fix import

4a60fb7

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

fix import

aa474b1

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

ignore modelscope check

cfbf3c9

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

add aisbench workflow

3a76502

Signed-off-by: jiangyunfan1 <jiangyunfan1@h-partners.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add aisbench nightly test cases #3474

Add aisbench nightly test cases #3474

jiangyunfan1 commented Oct 15, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 15, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Oct 15, 2025

Uh oh!

gemini-code-assist bot Oct 15, 2025

Uh oh!

github-actions bot commented Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		result_csv_file = os.path.join(result_dir, "gsm8kdataset.csv")
		result_json_file = os.path.join(result_dir, "gsm8kdataset.json")

Add aisbench nightly test cases #3474

Are you sure you want to change the base?

Add aisbench nightly test cases #3474

Conversation

jiangyunfan1 commented Oct 15, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

jiangyunfan1 commented Oct 15, 2025 •

edited by github-actions bot

Loading