-
Notifications
You must be signed in to change notification settings - Fork 49
Move some of the E2E tests for GEG & CEG into nightly runs #2036
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Conversation
… tests into nightly. Move all F16 split-k tests into nightly as well except keeping one for the PR CI.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
Moves heavier GEG/CEG E2E tests out of PR CI into nightly and lowers workloads in PR to reduce runtime.
- Repoint F32/BF16 GEG/CEG suites to nightly by renaming directories/suite names from Pr* to non-Pr*.
- Reduce PR test sizes (notably batch size) and trim permutations.
- Add new F16 nightly suites and arch gating cfgs; update CMake to split PR vs nightly coverage.
Reviewed Changes
Copilot reviewed 18 out of 26 changed files in this pull request and generated 2 comments.
Show a summary per file
File | Description |
---|---|
mlir/test/e2e/PrGemmElementwiseGemmF16SplitK.toml | Reduced PR test set to a smaller config to speed up CI |
mlir/test/e2e/PrConvElementwiseGemmF16SplitK.toml | Reduced PR configs; switched to batchsize=1 |
mlir/test/e2e/PrConvElementwiseGemmF16.toml | Simplified PR conv+gemm F16 suite to a small config |
mlir/test/e2e/GemmElementwiseGemmF32SplitK.toml | Nightly: rename directory and suite name to non-Pr; keep config |
mlir/test/e2e/GemmElementwiseGemmF32.toml | Nightly: rename directory and suite name to non-Pr |
mlir/test/e2e/GemmElementwiseGemmF16SplitK.toml | New nightly F16 split-K gemm+gemm suite |
mlir/test/e2e/GemmElementwiseGemmF16SplitK.cfg | New nightly gating for F16 split-K (requires mfma/wmma and atomic_add_f16) |
mlir/test/e2e/GemmElementwiseGemmBF16SplitK.toml | Nightly: rename directory and suite name to non-Pr |
mlir/test/e2e/GemmElementwiseGemmBF16.toml | Nightly: rename directory and suite name to non-Pr |
mlir/test/e2e/GemmElementwiseGemmBF16.cfg | New nightly gating for BF16 (requires mfma/wmma) |
mlir/test/e2e/ConvElementwiseGemmF32SplitK.toml | Nightly: rename directory; reduce workloads; suite name still “pr_*” |
mlir/test/e2e/ConvElementwiseGemmF32.toml | Nightly: rename directory/suite name; reduce workloads |
mlir/test/e2e/ConvElementwiseGemmF16SplitK.toml | New nightly F16 conv+gemm split-K suite |
mlir/test/e2e/ConvElementwiseGemmF16SplitK.cfg | New nightly gating for F16 conv split-K (mfma/wmma and atomic_add_f16) |
mlir/test/e2e/ConvElementwiseGemmF16.toml | New nightly F16 conv+gemm suite |
mlir/test/e2e/ConvElementwiseGemmBF16SplitK.toml | Nightly: rename directory/suite name; reduce workloads |
mlir/test/e2e/ConvElementwiseGemmBF16.toml | Nightly: rename directory/suite name; reduce workloads |
mlir/test/e2e/CMakeLists.txt | Split PR vs nightly test lists accordingly |
Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.
Motivation
Jenkins PR CI tests are taking a lot longer right now mainly due to CEG & GEG tests. These tests happens on three different data types F32, FP16 and BF16 and also with Split-K Enabled.
Currently some of these tests are also using batch size of 64. For the testing purposes smaller batch size is sufficient.
This PR aims to lower runtime to speed up testing locally and also on Jenkins.
Technical Details
CEG/GEG tests are for three different dtypes. IMO it is sufficient to just test F16 in PR CI and keep F32 and BF16 in nightly.
Test Plan
Run both PR CI and Nightly and measure runtime for E2E tests and compare them with current runtimes. Make sure nightly doesn't timeout with these additional tests.
Test Result