Skip to content

Enable CUTLASS grouped GEMM for pretraining wgrad on GB200 and H100 #5562

Enable CUTLASS grouped GEMM for pretraining wgrad on GB200 and H100

Enable CUTLASS grouped GEMM for pretraining wgrad on GB200 and H100 #5562

Triggered via pull request September 17, 2025 16:38
Status Cancelled
Total duration 11m 22s
Artifacts 2

fbgemm_gpu_benchmark_cpu.yml

on: pull_request
Matrix: build_artifact
Matrix: benchmark_artifact
Fit to window
Zoom out
Zoom in

Annotations

3 errors
FBGEMM_GPU-CPU Benchmark
Canceling since a higher priority waiting request for FBGEMM_GPU-CPU Benchmark-4886 exists
benchmark_artifact (x86, linux.4xlarge, 20, 3.13, gcc)
Canceling since a higher priority waiting request for FBGEMM_GPU-CPU Benchmark-4886 exists
benchmark_artifact (arm, linux.arm64.m7g.4xlarge, 30, 3.13, gcc)
Canceling since a higher priority waiting request for FBGEMM_GPU-CPU Benchmark-4886 exists

Artifacts

Produced during runtime
Name Size Digest
fbgemm_gpu_nightly_cpu_arm_gcc_py3.13.whl
4.2 MB
sha256:b5ab47646ac55a932621a291ef12cc06a0ad18e43eb35cf698db2f7547e73939
fbgemm_gpu_nightly_cpu_x86_gcc_py3.13.whl
5.35 MB
sha256:c426d1d66af29937eabe61cce6b7b1245d9bfd2c83d045ee8822624c3accf058