You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Adding _source and schema merging to index_mappings (#1101) (#1155)
* Fix antlr4 parser issues (#1094)
* Fix antlr4 parser issues
* Case insensitive lexer
* revert useless change
* remove tokens file
---------
* adding _source to index_mappings
* syntax fix
* Apply scalafmt
* Added index_mapping as an option in index.md, applied scalafmtAll
* improve readability
* Removed index_mappings from FlintMetaData.scala, Modified index.md
* removed indexMappingsSourceEnabled from FlintMetadata.scala
* removed indexMappingsSourceEnabled from FlintMetadata.scala
* removed indexMappingsSourceEnabled from FlintMetadata.scala
* Removed indexMappingsSourceEnabled from FlintMetadata.scala and removed unnecessary code
* Added some test cases to test serialzie() and fixed some formatting issues
* Added some test cases for FlintOpenSearchIndexMetadataServiceSuite.scala
* Added schema merging to index_mappings, added some test cases
* updated test cases
* Minor format fix
* minor fixes
* added nested schema merging logic, moved mergeSchema to serialize, updated test cases, fixed some minor issues
* updated some comments
* fixed some formatting issues based on the comments
* fixed syntax issue
* syntax issue
* syntax issue
* fixed the FlintSparkSkippingIndexITSuite
* fixing schema merging limitation
* less scala/java conversion
* style fix
* fix unnecessary casting
---------
(cherry picked from commit 76d35e2)
Signed-off-by: Lantao Jin <ltjin@amazon.com>
Signed-off-by: Kai Huang <ahkcs@amazon.com>
Signed-off-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Lantao Jin <ltjin@amazon.com>
Copy file name to clipboardExpand all lines: docs/index.md
+2
Original file line number
Diff line number
Diff line change
@@ -394,6 +394,7 @@ User can provide the following options in `WITH` clause of create statement:
394
394
+`watermark_delay`: a string as time expression for how late data can come and still be processed, e.g. 1 minute, 10 seconds. This is required by auto and incremental refresh on materialized view if it has aggregation in the query.
395
395
+`output_mode`: a mode string that describes how data will be written to streaming sink. If unspecified, default append mode will be applied.
396
396
+`index_settings`: a JSON string as index settings for OpenSearch index that will be created. Please follow the format in OpenSearch documentation. If unspecified, default OpenSearch index settings will be applied.
397
+
+`index_mappings`: A JSON string specifying additional OpenSearch index mappings, such as metadata fields (e.g., _source) or mapping parameters (e.g., enabled, index, etc.). This allows customizing certain parts of the index mappings. The base mappings are automatically generated; if unspecified, only the defaults will be applied. Refer to [OpenSearch metadata fields](https://docs.opensearch.org/docs/latest/field-types/metadata-fields/source/) and [mapping parameters](https://docs.opensearch.org/docs/latest/field-types/mapping-parameters/index/) for supported options.
397
398
+`id_expression`: an expression string that generates an ID column to guarantee idempotency when index refresh job restart or any retry attempt during an index refresh. If an empty string is provided, no ID column will be generated.
398
399
+`extra_options`: a JSON string as extra options that can be passed to Spark streaming source and sink API directly. Use qualified source table name (because there could be multiple) and "sink", e.g. '{"sink": "{key: val}", "table1": {key: val}}'
0 commit comments