Skip to content

Conversation

jiawenliu64
Copy link
Member

Summary:
X-link: https://github.com/facebookresearch/FBGEMM/pull/1916

  • Make wgrad CUTLASS grouped gemm return float32 output when wgrad is provided, respecting e2e
  • Optimize general heuristic
  • Make tests cover wgrad accum with float32 output

Differential Revision: D82700455

Copy link

netlify bot commented Sep 18, 2025

Deploy Preview for pytorch-fbgemm-docs ready!

Name Link
🔨 Latest commit 9cd2d85
🔍 Latest deploy log https://app.netlify.com/projects/pytorch-fbgemm-docs/deploys/68d2ce446d3d8c00093f4e79
😎 Deploy Preview https://deploy-preview-4891--pytorch-fbgemm-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@meta-cla meta-cla bot added the cla signed label Sep 18, 2025
@facebook-github-bot
Copy link
Contributor

@jiawenliu64 has exported this pull request. If you are a Meta employee, you can view the originating diff in D82700455.

@facebook-github-bot
Copy link
Contributor

@jiawenliu64 has exported this pull request. If you are a Meta employee, you can view the originating diff in D82700455.

jiawenliu64 added a commit that referenced this pull request Sep 23, 2025
Summary:
Pull Request resolved: #4891

X-link: facebookresearch/FBGEMM#1916

- Make wgrad CUTLASS grouped gemm return float32 output when wgrad is provided, respecting e2e
- Optimize general heuristic
- Make tests cover wgrad accum with float32 output

Reviewed By: q10

Differential Revision: D82700455
@facebook-github-bot
Copy link
Contributor

@jiawenliu64 has exported this pull request. If you are a Meta employee, you can view the originating diff in D82700455.

jiawenliu64 added a commit that referenced this pull request Sep 23, 2025
Summary:
Pull Request resolved: #4891

X-link: facebookresearch/FBGEMM#1916

- Make wgrad CUTLASS grouped gemm return float32 output when wgrad is provided, respecting e2e
- Optimize general heuristic
- Make tests cover wgrad accum with float32 output

Reviewed By: q10

Differential Revision: D82700455
@facebook-github-bot
Copy link
Contributor

@jiawenliu64 has exported this pull request. If you are a Meta employee, you can view the originating diff in D82700455.

jiawenliu64 added a commit that referenced this pull request Sep 23, 2025
Summary:
Pull Request resolved: #4891

X-link: facebookresearch/FBGEMM#1916

- Make wgrad CUTLASS grouped gemm return float32 output when wgrad is provided, respecting e2e
- Optimize general heuristic
- Make tests cover wgrad accum with float32 output

Reviewed By: q10

Differential Revision: D82700455
@facebook-github-bot
Copy link
Contributor

@jiawenliu64 has exported this pull request. If you are a Meta employee, you can view the originating diff in D82700455.

jiawenliu64 added a commit that referenced this pull request Sep 23, 2025
Summary:
Pull Request resolved: #4891

X-link: facebookresearch/FBGEMM#1916

- Make wgrad CUTLASS grouped gemm return float32 output when wgrad is provided, respecting e2e
- Optimize general heuristic
- Make tests cover wgrad accum with float32 output

Reviewed By: q10

Differential Revision: D82700455
@facebook-github-bot
Copy link
Contributor

@jiawenliu64 has exported this pull request. If you are a Meta employee, you can view the originating diff in D82700455.

jiawenliu64 added a commit that referenced this pull request Sep 23, 2025
Summary:
Pull Request resolved: #4891

X-link: facebookresearch/FBGEMM#1916

- Make wgrad CUTLASS grouped gemm return float32 output when wgrad is provided, respecting e2e
- Optimize general heuristic
- Make tests cover wgrad accum with float32 output

Reviewed By: q10

Differential Revision: D82700455
@facebook-github-bot
Copy link
Contributor

@jiawenliu64 has exported this pull request. If you are a Meta employee, you can view the originating diff in D82700455.

jiawenliu64 added a commit that referenced this pull request Sep 23, 2025
Summary:
Pull Request resolved: #4891

X-link: facebookresearch/FBGEMM#1916

- Make wgrad CUTLASS grouped gemm return float32 output when wgrad is provided, respecting e2e
- Optimize general heuristic
- Make tests cover wgrad accum with float32 output

Reviewed By: q10

Differential Revision: D82700455
Summary:
Pull Request resolved: #4891

X-link: facebookresearch/FBGEMM#1916

- Make wgrad CUTLASS grouped gemm return float32 output when wgrad is provided, respecting e2e
- Optimize general heuristic
- Make tests cover wgrad accum with float32 output

Reviewed By: q10

Differential Revision: D82700455
@facebook-github-bot
Copy link
Contributor

@jiawenliu64 has exported this pull request. If you are a Meta employee, you can view the originating diff in D82700455.

@facebook-github-bot
Copy link
Contributor

This pull request has been merged in ddada9e.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants