Skip to content

Conversation

@aggarwalmayank
Copy link

Description

Describe what this change achieves.
In OpenSearch Serverless (AOSS) time series collections, document creation and update operations fail when a document ID is specified, this change removed the usage of doc ID while writing as it is not required for skipping index to work.

Related Issues

List any issues this PR will resolve, e.g. Resolves [...].
#1012

Check List

  • [ NA] Updated documentation (docs/ppl-lang/README.md)
  • [NA ] Implemented unit tests
  • [ NA] Implemented tests for combination with other commands
  • [ NA] New added source code should include a copyright header
  • [ Y] Commits are signed per the DCO using --signoff
  • Add backport 0.x label if it is a stable change which won't break existing feature

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Mayank Aggarwal <aggmayan@amazon.com>
Copy link
Collaborator

@dai-chen dai-chen left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please add new IT for skipping index without doc ID and simulate the case that with duplicate document in it, the behavior is still correct.

df.getOrElse(spark.read.table(quotedTableName(tableName)))
.groupBy(input_file_name().as(FILE_PATH_COLUMN))
.agg(namedAggFuncs.head, namedAggFuncs.tail: _*)
.withColumn(ID_COLUMN, sha1(col(FILE_PATH_COLUMN)))
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this PoC PR or not? I think in production we should make it configurable instead of removing it always?

@dai-chen dai-chen added enhancement New feature or request Core:Skipping labels Aug 13, 2025
@opensearch-trigger-bot
Copy link

This PR is stalled because it has been open for 30 days with no activity.

@RyanL1997
Copy link
Contributor

Hi @aggarwalmayank Do we still need this change ?

@opensearch-trigger-bot
Copy link

This PR is stalled because it has been open for 30 days with no activity.

@RyanL1997
Copy link
Contributor

Closing this one for now, and feel free to re-open if needed @aggarwalmayank

@RyanL1997 RyanL1997 closed this Oct 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants