Skip to content

Use 'FIXED' path type in DedicatedClusterSnapshotRestoreIT #17996

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
Apr 18, 2025

Conversation

andrross
Copy link
Member

@andrross andrross commented Apr 17, 2025

If HASHED_PREFIX or HASHED_INFIX is used in this test then it frequently fails. @ashking94 Can you take a look here? I'm guessing there might be a bug with snapshot cleanup for the new remote store path types. This is really more of a short-term work-around to avoid the new path types for this particular test case, but it does seem to resolve the flakiness of the test.

Related Issues

Resolves #15806

Check List

  • Functionality includes testing.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

@andrross
Copy link
Member Author

For posterity here are three seeds that reliably fail without this change:

./gradlew ':server:internalClusterTest' --tests "org.opensearch.snapshots.DedicatedClusterSnapshotRestoreIT.testSnapshotWithStuckNode" -Dtests.seed=8529B1DD622216C1

./gradlew ':server:internalClusterTest' --tests "org.opensearch.snapshots.DedicatedClusterSnapshotRestoreIT.testSnapshotWithStuckNode" -Dtests.seed=AE568A72925374C5

./gradlew ':server:internalClusterTest' --tests "org.opensearch.snapshots.DedicatedClusterSnapshotRestoreIT.testSnapshotWithStuckNode" -Dtests.seed=97C1E518CA9406A4

@andrross
Copy link
Member Author

I ran this test in a loop for about an hour choosing a different seed every time and saw no failures. Without this change the same scenario will usually fail in less than a minute.

Copy link
Contributor

❌ Gradle check result for aebac93: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Signed-off-by: Andrew Ross <andrross@amazon.com>
@ashking94
Copy link
Member

Sure, @andrross. Let me take a look at this.

@andrross
Copy link
Member Author

@ashking94 FYI, I was later able to trace this down to a specific commit: #15806 (comment) I think @gbbafna is also looking at this. In the short term I think we can still merge this PR to reduce/remove the flakiness.

Copy link
Contributor

❌ Gradle check result for 850ecd7: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

@ashking94
Copy link
Member

@ashking94 FYI, I was later able to trace this down to a specific commit: #15806 (comment) I think @gbbafna is also looking at this. In the short term I think we can still merge this PR to reduce/remove the flakiness.

Makes sense, approved.

@ashking94
Copy link
Member

Restarted the PR build.

Copy link
Contributor

❌ Gradle check result for 850ecd7: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Copy link
Contributor

❌ Gradle check result for 850ecd7: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Copy link
Contributor

✅ Gradle check result for 850ecd7: SUCCESS

Copy link

codecov bot commented Apr 18, 2025

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 72.45%. Comparing base (cbaddd3) to head (850ecd7).
Report is 25 commits behind head on main.

Additional details and impacted files
@@             Coverage Diff              @@
##               main   #17996      +/-   ##
============================================
- Coverage     72.51%   72.45%   -0.06%     
+ Complexity    67108    67052      -56     
============================================
  Files          5475     5478       +3     
  Lines        309916   310034     +118     
  Branches      45060    45066       +6     
============================================
- Hits         224725   224631      -94     
- Misses        66895    67031     +136     
- Partials      18296    18372      +76     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@andrross andrross merged commit 26beb0f into opensearch-project:main Apr 18, 2025
31 checks passed
@andrross andrross deleted the dcsr-it branch April 18, 2025 19:20
opensearch-trigger-bot bot pushed a commit that referenced this pull request Apr 22, 2025
Signed-off-by: Andrew Ross <andrross@amazon.com>
(cherry picked from commit 26beb0f)
Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
andrross pushed a commit that referenced this pull request Apr 22, 2025
Signed-off-by: Andrew Ross <andrross@amazon.com>
(cherry picked from commit 26beb0f)
Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
andrross pushed a commit that referenced this pull request Apr 22, 2025
…18032)

(cherry picked from commit 26beb0f)

Signed-off-by: Andrew Ross <andrross@amazon.com>
Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
x-INFiN1TY-x pushed a commit to x-INFiN1TY-x/OpenSearch_Local that referenced this pull request Apr 24, 2025
…h-project#17996)

Signed-off-by: Andrew Ross <andrross@amazon.com>
Signed-off-by: Tanishq Ranjan <tqranjan@amazon.com>
prudhvigodithi pushed a commit to prudhvigodithi/OpenSearch that referenced this pull request May 6, 2025
…h-project#17996) (opensearch-project#18032)

(cherry picked from commit 26beb0f)

Signed-off-by: Andrew Ross <andrross@amazon.com>
Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Signed-off-by: Prudhvi Godithi <pgodithi@amazon.com>
Harsh-87 pushed a commit to Harsh-87/OpenSearch that referenced this pull request May 7, 2025
…h-project#17996)

Signed-off-by: Andrew Ross <andrross@amazon.com>
Signed-off-by: Harsh Kothari <techarsh@amazon.com>
Harsh-87 pushed a commit to Harsh-87/OpenSearch that referenced this pull request May 7, 2025
…h-project#17996)

Signed-off-by: Andrew Ross <andrross@amazon.com>
Signed-off-by: Harsh Kothari <techarsh@amazon.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
autocut backport 3.0 flaky-test Random test failure that succeeds on second run skip-changelog >test-failure Test failure from CI, local build, etc.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[AUTOCUT] Gradle Check Flaky Test Report for DedicatedClusterSnapshotRestoreIT
2 participants