Test CUDA Builds

[ET-VK][export] Update tensor representation sync logic to allow for flexibility in memory layouts #9518

Sign in to view logs

Triggered via pull request February 21, 2026 14:01

SS-JIA

synchronize #17564

gh/SS-JIA/438/head

Status Failure

Total duration 1h 14m 5s

Artifacts 14

cuda.yml

on: pull_request

Matrix: export-model-cuda-artifact

Matrix: test-cuda-builds

unittest-cuda / linux-job

Matrix: test-models-cuda

Matrix: test-cuda-pybind

Matrix: test-model-cuda-e2e

check-all-cuda-builds

Annotations

1 error

test-model-cuda-e2e (openai, whisper-small, quantized-int4-tile-packed) / linux-job

Process completed with exit code 1.

Artifacts

Produced during runtime

Name	Size	Digest
google-gemma-3-4b-it-cuda-non-quantized	7.22 GB	`sha256:7b8123410df837f3db14188a80293e36537bdd6652daad501f3bb79f6f78c10c`
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed	3.36 GB	`sha256:639bcc00f78652ac7d09ec5def9e44d6e2b94115d17ee686230b99231ab26e66`
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized	6.82 GB	`sha256:e27b839f5018278ce2650cfdff4c22d091c3bf180dd5aebb783781913b9aed75`
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed	2.8 GB	`sha256:e6a2dd4ca44280435a920d47608cd65604769d59a36125b5ba693748fe67bda0`
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only	6.14 GB	`sha256:b9eacaae196a47bbf858b568bb2d328a5f51c25601b2d59b9b842a9cdb6636df`
nvidia-parakeet-tdt-cuda-non-quantized	952 MB	`sha256:7e7daec707b0875aea9debdd04f27d070552bbe827abc889bc4de15e5ef37a1d`
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed	443 MB	`sha256:74ca7b0784248e790e1e15443859d759f12db306314d1188605f04d051bc738c`
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only	430 MB	`sha256:5dd963b0ec8b9c29cafe9e2083a7aef00b2a40dd2d89ac77384346a7beeb93bb`
openai-whisper-large-v3-turbo-cuda-non-quantized	1.18 GB	`sha256:27146614aa93acf17a24259c6e05d950902a9b288a73d97a877fd9da90af9be5`
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed	491 MB	`sha256:40dd3e1bc2f9dff9c2ce0e69b3463a4ed9d8427844b39192ed7e50783507b470`
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only	485 MB	`sha256:22c0b2055d8737ca2626a08930a569b3399b3e509f41e3ffa822fa6aae71a27c`
openai-whisper-small-cuda-non-quantized	361 MB	`sha256:abca8e04679bda902348261a1112e616cf6e52a0889bf78f07064a7895f85491`
openai-whisper-small-cuda-quantized-int4-tile-packed	172 MB	`sha256:cd79629bbf978e5566f5dcef395f3c6d553c9a7bf79a79c2988aa619665c95aa`
openai-whisper-small-cuda-quantized-int4-weight-only	271 MB	`sha256:6c18d6f3697ce4582e3068ed093d56b5bde10acb9b77653346c694048dfa97b9`