Flaky MetricAggregatorTests #18331

ajleong623 · 2025-05-19T05:55:08Z

Description

This change will make sure that the MetricAggregatorTests does not fail on the range query. What happened was that in the test, the range query was applied on a terms, but the difference between the expected aggregation and the actual aggregation was not zero. On a field called "keyword_field", the boolean query had 2 should queries (range from 18 to 0) and (exact match of 37).

In the expected aggregation, only the documents which matched on 37 were added together, but in the actual aggregation, more documents which somehow satisfied the range from 18 to 0 were added together. I think what happened was that the "keyword_field" was a string value, and in the range query, the values are interpreted as mapped ordinals in the star tree filters (https://github.com/opensearch-project/OpenSearch/blob/main/server/src/main/java/org/opensearch/search/startree/filter/RangeMatchDimFilter.java), but it seems like in the expected query which does not use star tree filters, the strings are compared as its byte ref value (https://lucene.apache.org/core/6_6_0/core/org/apache/lucene/search/TermRangeQuery.html). At least for now, I think that we should avoid using keyword strings for a range query.

Related Issues

Resolves #18110

Check List

Functionality includes testing.
API changes companion pull request created, if applicable.
Public documentation issue/PR created, if applicable.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Anthony Leong <aj.leong623@gmail.com>

github-actions · 2025-05-19T06:11:37Z

❌ Gradle check result for 33d077b: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

Signed-off-by: Anthony Leong <aj.leong623@gmail.com>

github-actions · 2025-05-19T08:04:11Z

❌ Gradle check result for 501fbcb: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

github-actions · 2025-05-20T01:28:38Z

❕ Gradle check result for 501fbcb: UNSTABLE

Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure.

codecov · 2025-05-20T01:29:07Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 72.47%. Comparing base (19aa0f8) to head (eb7f003).
Report is 17 commits behind head on main.

Additional details and impacted files

@@             Coverage Diff              @@
##               main   #18331      +/-   ##
============================================
- Coverage     72.53%   72.47%   -0.06%     
+ Complexity    67389    67352      -37     
============================================
  Files          5488     5488              
  Lines        311069   311069              
  Branches      45217    45217              
============================================
- Hits         225622   225443     -179     
- Misses        67083    67266     +183     
+ Partials      18364    18360       -4

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

sandeshkr419

Hi @ajleong623

Appreciate your time to debug this case.
I have minor comments.

Also, along with this change, can you also ensure that instead of assigning raw queryBuilders using RangeQueryBuilder/TermQueryBuilder constructors at places, we use getTermQueryBuilder(), getRangeQueryBuilder() methods directly. I know this is not part of your changes, but a little code cleanup can certainly help.

Also, please run the test some n(100-500) times (can be configured with intellij) to assert we are solving the flakiness correctly.

Thanks!

server/src/test/java/org/opensearch/search/aggregations/startree/MetricAggregatorTests.java

…n range queries Signed-off-by: Anthony Leong <aj.leong623@gmail.com>

ajleong623 · 2025-05-20T20:39:56Z

@sandeshkr419 Thank you so much for the suggestions and review. I believe I made the requested changes and also ran the test 100 times. Let me know if there are any other suggestions.

sandeshkr419

Just added a few minor nit-picks. The rest looks good.

server/src/test/java/org/opensearch/search/aggregations/startree/MetricAggregatorTests.java

Signed-off-by: Anthony Leong <aj.leong623@gmail.com>

github-actions · 2025-05-20T22:29:02Z

❕ Gradle check result for eb7f003: UNSTABLE

Please review all flaky tests that succeeded after retry and create an issue if one does not already exist to track the flaky failure.

ajleong623 · 2025-05-21T16:41:09Z

The new changes are ready.

…8331) Signed-off-by: Anthony Leong <aj.leong623@gmail.com>

flaky test repo

33d077b

Signed-off-by: Anthony Leong <aj.leong623@gmail.com>

ajleong623 marked this pull request as ready for review May 19, 2025 05:55

ajleong623 requested review from anasalkouz, andrross, ashking94, Bukhtawar, CEHENKLE, cwperks, dbwiddis, gbbafna, jed326 and kotwanikunal as code owners May 19, 2025 05:55

github-actions bot added the >test-failure Test failure from CI, local build, etc. label May 19, 2025

ajleong623 requested a review from mch2 as a code owner May 19, 2025 05:55

github-actions bot added autocut disabled-test Issues that are used by an AwaitsFix annotation to temporarily disable a broken test labels May 19, 2025

ajleong623 requested a review from msfroh as a code owner May 19, 2025 05:55

github-actions bot added the flaky-test Random test failure that succeeds on second run label May 19, 2025

ajleong623 requested a review from owaiskazi19 as a code owner May 19, 2025 05:55

github-actions bot added the Search:Aggregations label May 19, 2025

ajleong623 requested review from reta, Rishikesh1159, sachinpkale, saratvemulapalli, shwetathareja, sohami, VachaShah and a team as code owners May 19, 2025 05:55

spotless

501fbcb

Signed-off-by: Anthony Leong <aj.leong623@gmail.com>

ajleong623 closed this May 19, 2025

ajleong623 reopened this May 20, 2025

ajleong623 marked this pull request as draft May 20, 2025 00:18

ajleong623 marked this pull request as ready for review May 20, 2025 01:38

sandeshkr419 reviewed May 20, 2025

View reviewed changes

server/src/test/java/org/opensearch/search/aggregations/startree/MetricAggregatorTests.java Outdated Show resolved Hide resolved

server/src/test/java/org/opensearch/search/aggregations/startree/MetricAggregatorTests.java Outdated Show resolved Hide resolved

opensearch-ci-bot mentioned this pull request May 20, 2025

[AUTOCUT] Gradle Check Flaky Test Report for SearchRestCancellationIT #14311

Open

cleaned up code with methods, changed how keyword terms are handles i…

d8d7a58

…n range queries Signed-off-by: Anthony Leong <aj.leong623@gmail.com>

sandeshkr419 approved these changes May 20, 2025

View reviewed changes

sandeshkr419 added the skip-changelog label May 20, 2025

ajleong623 added 3 commits May 20, 2025 14:12

small changes

148772b

Signed-off-by: Anthony Leong <aj.leong623@gmail.com>

nonconsequential change

e73ce8c

Signed-off-by: Anthony Leong <aj.leong623@gmail.com>

revert recent change

eb7f003

Signed-off-by: Anthony Leong <aj.leong623@gmail.com>

opensearch-ci-bot mentioned this pull request May 21, 2025

[AUTOCUT] Gradle Check Flaky Test Report for AzureBlobStoreRepositoryTests #14291

Open

sandeshkr419 approved these changes May 21, 2025

View reviewed changes

andrross approved these changes May 22, 2025

View reviewed changes

andrross merged commit 243ba6a into opensearch-project:main May 22, 2025
30 checks passed

opensearch-ci-bot mentioned this pull request May 23, 2025

[AUTOCUT] Gradle Check Flaky Test Report for IndexingIT #14302

Open

tandonks pushed a commit to tandonks/OpenSearch that referenced this pull request Jun 1, 2025

Fix MetricAggregatorTests.testStarTreeDocValues (opensearch-project#1…

c7bd4d9

…8331) Signed-off-by: Anthony Leong <aj.leong623@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Flaky MetricAggregatorTests #18331

Flaky MetricAggregatorTests #18331

Uh oh!

ajleong623 commented May 19, 2025 •

edited

Loading

Uh oh!

github-actions bot commented May 19, 2025

Uh oh!

github-actions bot commented May 19, 2025

Uh oh!

github-actions bot commented May 20, 2025

Uh oh!

codecov bot commented May 20, 2025 •

edited

Loading

Uh oh!

sandeshkr419 left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

ajleong623 commented May 20, 2025

Uh oh!

sandeshkr419 left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented May 20, 2025

Uh oh!

ajleong623 commented May 21, 2025

Uh oh!

Uh oh!

Uh oh!

Flaky MetricAggregatorTests #18331

Flaky MetricAggregatorTests #18331

Uh oh!

Conversation

ajleong623 commented May 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Check List

Uh oh!

github-actions bot commented May 19, 2025

Uh oh!

github-actions bot commented May 19, 2025

Uh oh!

github-actions bot commented May 20, 2025

Uh oh!

codecov bot commented May 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

sandeshkr419 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

ajleong623 commented May 20, 2025

Uh oh!

sandeshkr419 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented May 20, 2025

Uh oh!

ajleong623 commented May 21, 2025

Uh oh!

Uh oh!

Uh oh!

ajleong623 commented May 19, 2025 •

edited

Loading

codecov bot commented May 20, 2025 •

edited

Loading

sandeshkr419 left a comment •

edited

Loading