Skip to content

Benchmarks / accuracy #104

Benchmarks / accuracy

Benchmarks / accuracy #104

Triggered via schedule September 24, 2025 01:02
Status Cancelled
Total duration 5h 18m 16s
Artifacts

accuracy_test.yaml

on: schedule
Matrix: accuracy_tests
create_pr
0s
create_pr
Fit to window
Zoom out
Zoom in

Annotations

5 errors
Qwen3-8B-Base accuracy
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists
Qwen3-30B-A3B accuracy
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists
DeepSeek-V2-Lite accuracy
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists
Qwen2.5-VL-7B-Instruct accuracy
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists
Benchmarks / accuracy
Canceling since a higher priority waiting request for Benchmarks / accuracy-refs/heads/main exists