Enable CUTLASS grouped GEMM for pretraining wgrad on GB200 and H100 · pytorch/FBGEMM@7d30e87

Triggered via pull request September 17, 2025 16:38

synchronize #4886

Status Cancelled

Total duration 11m 22s

Artifacts 2

fbgemm_gpu_benchmark_cpu.yml

on: pull_request

Matrix: build_artifact

Matrix: benchmark_artifact

3 errors

Canceling since a higher priority waiting request for FBGEMM_GPU-CPU Benchmark-4886 exists

Canceling since a higher priority waiting request for FBGEMM_GPU-CPU Benchmark-4886 exists

Canceling since a higher priority waiting request for FBGEMM_GPU-CPU Benchmark-4886 exists

Produced during runtime

Name	Size	Digest
fbgemm_gpu_nightly_cpu_arm_gcc_py3.13.whl	4.2 MB	`sha256:b5ab47646ac55a932621a291ef12cc06a0ad18e43eb35cf698db2f7547e73939`
fbgemm_gpu_nightly_cpu_x86_gcc_py3.13.whl	5.35 MB	`sha256:c426d1d66af29937eabe61cce6b7b1245d9bfd2c83d045ee8822624c3accf058`