Skip to content

Users/anouri/cms bf16 128x224x64 nt WIP#4427

Closed
alinouri-amd wants to merge 3 commits intohipblaslt_common_cms_phase2from
users/anouri/cms_bf16_128x224x64_nt
Closed

Users/anouri/cms bf16 128x224x64 nt WIP#4427
alinouri-amd wants to merge 3 commits intohipblaslt_common_cms_phase2from
users/anouri/cms_bf16_128x224x64_nt

Conversation

@alinouri-amd
Copy link
Contributor

@alinouri-amd alinouri-amd commented Feb 9, 2026

Motivation

  • CMS for 128x224x64 BF16 NT

Test Plan

Hipblaslt Performance

  • 12.5% speedup w.r.t non-CMS

Test Result

 - ProblemSizes:
 - Exact: [2048, 3584, 1, 8192]
 - Range: [[128], [224], [1], [1, 16, 64]]
 - Range: [[128], [224], [1], [32, 64, 256]]
  • Run Tensile - Pass
  • Run hipblaslt bench - Pass
  • Run hipblaslt test - Pass
  • pytest for this CMS

@alinouri-amd alinouri-amd requested a review from a team as a code owner February 9, 2026 18:50
@alinouri-amd alinouri-amd changed the title Users/anouri/cms bf16 128x224x64 nt Users/anouri/cms bf16 128x224x64 nt WIP Feb 9, 2026
@alinouri-amd
Copy link
Contributor Author

Note that this PR depends on 224x128x64 NT which currently has a race condition. So perhaps it is better to wait for the issue to be resolved first

@alinouri-amd alinouri-amd force-pushed the users/anouri/cms_bf16_128x224x64_nt branch from 9e9d047 to ac0d6a2 Compare February 11, 2026 18:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant