Skip to content

Benchmarks / accuracy #7

Benchmarks / accuracy

Benchmarks / accuracy #7

Triggered via schedule July 10, 2025 18:35
Status Cancelled
Total duration 7h 33m 31s
Artifacts

accuracy_test.yaml

on: schedule
Matrix: accuracy_tests
create_pr
0s
create_pr
Fit to window
Zoom out
Zoom in

Annotations

4 errors
Qwen/Qwen3-30B-A3B accuracy V1
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists
Qwen/Qwen2.5-VL-7B-Instruct accuracy V1
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists
Qwen/Qwen3-8B-Base accuracy V1
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists
Benchmarks / accuracy
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists