Skip to content

Enable CUTLASS grouped GEMM for pretraining wgrad on GB200 and H100 #5563

Enable CUTLASS grouped GEMM for pretraining wgrad on GB200 and H100

Enable CUTLASS grouped GEMM for pretraining wgrad on GB200 and H100 #5563

Triggered via pull request September 17, 2025 16:49
Status Success
Total duration 18m 12s
Artifacts 4

fbgemm_gpu_benchmark_cpu.yml

on: pull_request
Matrix: build_artifact
Matrix: benchmark_artifact
Fit to window
Zoom out
Zoom in

Artifacts

Produced during runtime
Name Size Digest
fbgemm_gpu_nightly_cpu_arm_gcc_py3.13.whl
4.2 MB
sha256:8a15ee09084e9cfdc3c1e224695bdf67c68574cdad46081867d15f05662a8be8
fbgemm_gpu_nightly_cpu_x86_gcc_py3.13.whl
5.35 MB
sha256:6b22351194204b9e086d18a9f83c2fd4668b127649b8545d36e20e4fc2a5bc86
fbgemm_gpu_traces_arm_gcc_py3.13_cpu.zip
237 KB
sha256:365ce4691e4a355095a1c7f6994984b03295f1bd42c592e3b4466f7691aad955
fbgemm_gpu_traces_x86_gcc_py3.13_cpu.zip
237 KB
sha256:bd07cd5991e41fa4d5c46e2a7f22cd7eae2fdb3bb124d9b3736733a663bf6a5b