add UUID as new entry value in the add_entries processor#6653
Open
Zhangxunmt wants to merge 1 commit intoopensearch-project:mainfrom
Open
add UUID as new entry value in the add_entries processor#6653Zhangxunmt wants to merge 1 commit intoopensearch-project:mainfrom
Zhangxunmt wants to merge 1 commit intoopensearch-project:mainfrom
Conversation
Signed-off-by: Xun Zhang <xunzh@amazon.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Description
Added a new entry in the add_entries processor that generates a unique ID for each record. This is needed for cases where the original source data does not contain a unique identifier. The unique ID is essential for running asynchronous batch inference jobs, as it is used to match and merge the inference results back with the source data.
UUID.randomUUID() (UUID v4) uses Java's SecureRandom, and collision probability is so low it has never been observed in practice in any production system, which is what all major distributed systems use for this exact problem.
Usage
Issues Resolved
Resolves #[Issue number to be closed when this PR is merged]
Check List
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.