Skip to content

Benchmarks / accuracy #12

Benchmarks / accuracy

Benchmarks / accuracy #12

Triggered via schedule September 1, 2025 03:41
Status Cancelled
Total duration 3h 12m 58s
Artifacts

accuracy_test.yaml

on: schedule
Matrix: accuracy_tests
create_pr
0s
create_pr
Fit to window
Zoom out
Zoom in

Annotations

5 errors
Qwen3-30B-A3B accuracy
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists
Qwen2.5-VL-7B-Instruct accuracy
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists
DeepSeek-V2-Lite accuracy
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists
Qwen3-8B-Base accuracy
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists
Benchmarks / accuracy
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists