Skip to content

fix CMS for 224x128x64 NT bf16 #4434

Merged
emezh merged 1 commit intohipblaslt_common_cms_phase2from
users/emezh/fix_cms_bf16_224x128x64_nt
Feb 10, 2026
Merged

fix CMS for 224x128x64 NT bf16 #4434
emezh merged 1 commit intohipblaslt_common_cms_phase2from
users/emezh/fix_cms_bf16_224x128x64_nt

Conversation

@emezh
Copy link
Contributor

@emezh emezh commented Feb 9, 2026

Fixes issue #4422 by reducing the dscnt (follow-up from #4137)

No impact to performance in Tensile.

Tested with tensile-client

        - ProblemSizes:
          - Exact: [3584, 2048, 1, 8192]
          - Range: [[224], [128], [1], [1, 16, 64]]
          - Range: [[224], [128], [1], [32, 64, 256]]

hipblaslt-test also passes

[==========] 21891 tests from 12 test suites ran. (1441926 ms total)
[  PASSED  ] 21891 tests.

Submission Checklist

@emezh emezh requested a review from a team as a code owner February 9, 2026 20:05
@emezh emezh requested review from jfactory07 and removed request for a team February 9, 2026 20:05
@emezh emezh merged commit 9c5b36c into hipblaslt_common_cms_phase2 Feb 10, 2026
14 of 24 checks passed
@emezh emezh deleted the users/emezh/fix_cms_bf16_224x128x64_nt branch February 10, 2026 00:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants