Skip to content

Conversation

cthi
Copy link
Contributor

@cthi cthi commented Sep 24, 2025

Summary:
X-link: https://github.com/facebookresearch/FBGEMM/pull/1957

Split some clean-up/refactors from the core FP4 Torch API support to make the next diff more focused.

  • Removed zero_start_index_M as it's unused
  • Removed passing G into the kernel directly as it can be inferred
  • Rename ElementComputeEpilogue -> ElementScale
  • Add namespace fbgemm_gpu in f4f4bf16_grouped_common.cuh
  • Removed num_x_scale_per_group and num_w_scale_per_group as they are both unused
  • Removed un-neccesary cutlass headers in f4f4bf16_grouped.cu

Differential Revision: D83166227

Summary:
X-link: facebookresearch/FBGEMM#1956


as title

Differential Revision: D83083612
)

Summary:
X-link: facebookresearch/FBGEMM#1955


We would reuse this kernel as a base to add support for NV/MX FP4. As a first step, shuffle it into it's own file.

Differential Revision: D83151150
Summary:
X-link: facebookresearch/FBGEMM#1957

Split some clean-up/refactors from the core FP4 Torch API support to make the next diff more focused.

- Removed `zero_start_index_M` as it's unused
- Removed passing `G` into the kernel directly as it can be inferred
- Rename `ElementComputeEpilogue` -> `ElementScale`
- Add `namespace fbgemm_gpu` in `f4f4bf16_grouped_common.cuh`
- Removed `num_x_scale_per_group` and `num_w_scale_per_group` as they are both unused
- Removed un-neccesary cutlass headers in `f4f4bf16_grouped.cu`

Differential Revision: D83166227
Copy link

netlify bot commented Sep 24, 2025

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit 553b40e
🔍 Latest deploy log https://app.netlify.com/projects/pytorch-fbgemm-docs/deploys/68d4307293d0e60008ef1fa7
😎 Deploy Preview https://deploy-preview-4934--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@meta-cla meta-cla bot added the cla signed label Sep 24, 2025
@facebook-github-bot
Copy link
Contributor

@cthi has exported this pull request. If you are a Meta employee, you can view the originating diff in D83166227.

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in d9b6e01.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants