Skip to content

Conversation

umangyadav
Copy link
Member

@umangyadav umangyadav commented Oct 15, 2025

Motivation

Jenkins PR CI tests are taking a lot longer right now mainly due to CEG & GEG tests. These tests happens on three different data types F32, FP16 and BF16 and also with Split-K Enabled.

Currently some of these tests are also using batch size of 64. For the testing purposes smaller batch size is sufficient.

This PR aims to lower runtime to speed up testing locally and also on Jenkins.

Technical Details

CEG/GEG tests are for three different dtypes. IMO it is sufficient to just test F16 in PR CI and keep F32 and BF16 in nightly.

Test Plan

Run both PR CI and Nightly and measure runtime for E2E tests and compare them with current runtimes. Make sure nightly doesn't timeout with these additional tests.

Test Result

  • PR CI
  • Nightly CI
  • Report back Runtime reduction in PR CI
  • Shared Library E2E test time on MI100 reduced from 30minutes to 18minutes.
  • Shared Library E2E test time on Navi4x reduced from 27minutes to 17minutes.
  • Is nightly timing out ?

… tests into nightly. Move all F16 split-k tests into nightly as well except keeping one for the PR CI.
@umangyadav umangyadav requested a review from causten as a code owner October 15, 2025 18:16
@umangyadav umangyadav requested review from Copilot, dhernandez0, justinrosner and pabloantoniom and removed request for causten and dhernandez0 October 15, 2025 18:17
@umangyadav umangyadav self-assigned this Oct 15, 2025
Copy link
Contributor

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

Moves heavier GEG/CEG E2E tests out of PR CI into nightly and lowers workloads in PR to reduce runtime.

  • Repoint F32/BF16 GEG/CEG suites to nightly by renaming directories/suite names from Pr* to non-Pr*.
  • Reduce PR test sizes (notably batch size) and trim permutations.
  • Add new F16 nightly suites and arch gating cfgs; update CMake to split PR vs nightly coverage.

Reviewed Changes

Copilot reviewed 18 out of 26 changed files in this pull request and generated 2 comments.

Show a summary per file
File Description
mlir/test/e2e/PrGemmElementwiseGemmF16SplitK.toml Reduced PR test set to a smaller config to speed up CI
mlir/test/e2e/PrConvElementwiseGemmF16SplitK.toml Reduced PR configs; switched to batchsize=1
mlir/test/e2e/PrConvElementwiseGemmF16.toml Simplified PR conv+gemm F16 suite to a small config
mlir/test/e2e/GemmElementwiseGemmF32SplitK.toml Nightly: rename directory and suite name to non-Pr; keep config
mlir/test/e2e/GemmElementwiseGemmF32.toml Nightly: rename directory and suite name to non-Pr
mlir/test/e2e/GemmElementwiseGemmF16SplitK.toml New nightly F16 split-K gemm+gemm suite
mlir/test/e2e/GemmElementwiseGemmF16SplitK.cfg New nightly gating for F16 split-K (requires mfma/wmma and atomic_add_f16)
mlir/test/e2e/GemmElementwiseGemmBF16SplitK.toml Nightly: rename directory and suite name to non-Pr
mlir/test/e2e/GemmElementwiseGemmBF16.toml Nightly: rename directory and suite name to non-Pr
mlir/test/e2e/GemmElementwiseGemmBF16.cfg New nightly gating for BF16 (requires mfma/wmma)
mlir/test/e2e/ConvElementwiseGemmF32SplitK.toml Nightly: rename directory; reduce workloads; suite name still “pr_*”
mlir/test/e2e/ConvElementwiseGemmF32.toml Nightly: rename directory/suite name; reduce workloads
mlir/test/e2e/ConvElementwiseGemmF16SplitK.toml New nightly F16 conv+gemm split-K suite
mlir/test/e2e/ConvElementwiseGemmF16SplitK.cfg New nightly gating for F16 conv split-K (mfma/wmma and atomic_add_f16)
mlir/test/e2e/ConvElementwiseGemmF16.toml New nightly F16 conv+gemm suite
mlir/test/e2e/ConvElementwiseGemmBF16SplitK.toml Nightly: rename directory/suite name; reduce workloads
mlir/test/e2e/ConvElementwiseGemmBF16.toml Nightly: rename directory/suite name; reduce workloads
mlir/test/e2e/CMakeLists.txt Split PR vs nightly test lists accordingly

Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.

@umangyadav umangyadav requested a review from Copilot October 15, 2025 18:33
Copy link
Contributor

@Copilot Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants