[ET-VK][export] Update tensor representation sync logic to allow for flexibility in memory layouts #9518
cuda.yml
on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda
/
linux-job
24m 55s
Matrix: test-models-cuda
Annotations
1 error
|
test-model-cuda-e2e (openai, whisper-small, quantized-int4-tile-packed) / linux-job
Process completed with exit code 1.
|
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
google-gemma-3-4b-it-cuda-non-quantized
|
7.22 GB |
sha256:7b8123410df837f3db14188a80293e36537bdd6652daad501f3bb79f6f78c10c
|
|
|
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
|
3.36 GB |
sha256:639bcc00f78652ac7d09ec5def9e44d6e2b94115d17ee686230b99231ab26e66
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
|
6.82 GB |
sha256:e27b839f5018278ce2650cfdff4c22d091c3bf180dd5aebb783781913b9aed75
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
|
2.8 GB |
sha256:e6a2dd4ca44280435a920d47608cd65604769d59a36125b5ba693748fe67bda0
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
|
6.14 GB |
sha256:b9eacaae196a47bbf858b568bb2d328a5f51c25601b2d59b9b842a9cdb6636df
|
|
|
nvidia-parakeet-tdt-cuda-non-quantized
|
952 MB |
sha256:7e7daec707b0875aea9debdd04f27d070552bbe827abc889bc4de15e5ef37a1d
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
|
443 MB |
sha256:74ca7b0784248e790e1e15443859d759f12db306314d1188605f04d051bc738c
|
|
|
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
|
430 MB |
sha256:5dd963b0ec8b9c29cafe9e2083a7aef00b2a40dd2d89ac77384346a7beeb93bb
|
|
|
openai-whisper-large-v3-turbo-cuda-non-quantized
|
1.18 GB |
sha256:27146614aa93acf17a24259c6e05d950902a9b288a73d97a877fd9da90af9be5
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
|
491 MB |
sha256:40dd3e1bc2f9dff9c2ce0e69b3463a4ed9d8427844b39192ed7e50783507b470
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
|
485 MB |
sha256:22c0b2055d8737ca2626a08930a569b3399b3e509f41e3ffa822fa6aae71a27c
|
|
|
openai-whisper-small-cuda-non-quantized
|
361 MB |
sha256:abca8e04679bda902348261a1112e616cf6e52a0889bf78f07064a7895f85491
|
|
|
openai-whisper-small-cuda-quantized-int4-tile-packed
|
172 MB |
sha256:cd79629bbf978e5566f5dcef395f3c6d553c9a7bf79a79c2988aa619665c95aa
|
|
|
openai-whisper-small-cuda-quantized-int4-weight-only
|
271 MB |
sha256:6c18d6f3697ce4582e3068ed093d56b5bde10acb9b77653346c694048dfa97b9
|
|