Skip to content

Enable CUTLASS grouped GEMM for pretraining wgrad on GB200 and H100 (resubmit) #6047

Enable CUTLASS grouped GEMM for pretraining wgrad on GB200 and H100 (resubmit)

Enable CUTLASS grouped GEMM for pretraining wgrad on GB200 and H100 (resubmit) #6047

Triggered via pull request September 22, 2025 21:37
Status Success
Total duration 2h 13m 39s
Artifacts 2
generate-matrix  /  generate
6s
generate-matrix / generate
filter-matrix
6s
filter-matrix
Matrix: build
Fit to window
Zoom out
Zoom in

Annotations

1 warning
filter-matrix
The `python-version` input is not set. The version of Python currently in `PATH` will be used.

Artifacts

Produced during runtime
Name Size Digest
pytorch_FBGEMM__3.10_cu126_aarch64
15.5 MB
sha256:57628fe8ff3d3955f7300474032514b5b7391803f9a0465a43833c6d0e353cd9
pytorch_FBGEMM__3.10_cu128_aarch64
46.5 MB
sha256:c8e84e17c8f7508103b28e0a5fa53fd708051400a7e2338bf3dec5d3fd7e54fc