Enable CUTLASS grouped GEMM for pretraining wgrad on GB200 and H100 #5563
fbgemm_gpu_benchmark_cpu.yml
on: pull_request
Matrix: build_artifact
Matrix: benchmark_artifact
Artifacts
Produced during runtime
Name | Size | Digest | |
---|---|---|---|
fbgemm_gpu_nightly_cpu_arm_gcc_py3.13.whl
|
4.2 MB |
sha256:8a15ee09084e9cfdc3c1e224695bdf67c68574cdad46081867d15f05662a8be8
|
|
fbgemm_gpu_nightly_cpu_x86_gcc_py3.13.whl
|
5.35 MB |
sha256:6b22351194204b9e086d18a9f83c2fd4668b127649b8545d36e20e4fc2a5bc86
|
|
fbgemm_gpu_traces_arm_gcc_py3.13_cpu.zip
|
237 KB |
sha256:365ce4691e4a355095a1c7f6994984b03295f1bd42c592e3b4466f7691aad955
|
|
fbgemm_gpu_traces_x86_gcc_py3.13_cpu.zip
|
237 KB |
sha256:bd07cd5991e41fa4d5c46e2a7f22cd7eae2fdb3bb124d9b3736733a663bf6a5b
|
|