Skip to content

Enable CUTLASS grouped GEMM for pretraining wgrad on GB200 and H100 #5960

Enable CUTLASS grouped GEMM for pretraining wgrad on GB200 and H100

Enable CUTLASS grouped GEMM for pretraining wgrad on GB200 and H100 #5960

Triggered via pull request September 17, 2025 03:09
Status Success
Total duration 2h 18m 4s
Artifacts 2
generate-matrix  /  generate
7s
generate-matrix / generate
filter-matrix
6s
filter-matrix
Matrix: build
Fit to window
Zoom out
Zoom in

Annotations

1 warning
filter-matrix
The `python-version` input is not set. The version of Python currently in `PATH` will be used.

Artifacts

Produced during runtime
Name Size Digest
pytorch_FBGEMM__3.10_cu126_aarch64
16.1 MB
sha256:889a6284767b0cd6218bbf75aa16700caa44b7d31be6d07314b5f8c850d04c25
pytorch_FBGEMM__3.10_cu128_aarch64
47.8 MB
sha256:8f2832524d248ef6887f5ef25937ddeb3afad279dd8377420a882540f73cf2da