Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix(execute): use random dataset ids instead of ones hashed from the name #3434

Closed
wants to merge 1 commit into from

Conversation

jsternberg
Copy link
Contributor

The executor would create the dataset id by using the plan node name.
While plan node names are usually unique, they aren't necessarily
guaranteed to be unique. This would cause multiple datasets to have the
same id and could cause a conflict.

This further protects against that situation by changing uuid generation
for dataset ids to use the v4 algorithm which uses a random number
generator.

Fixes #3053.

Done checklist

  • docs/SPEC.md updated
  • Test cases written

…name

The executor would create the dataset id by using the plan node name.
While plan node names are usually unique, they aren't necessarily
guaranteed to be unique. This would cause multiple datasets to have the
same id and could cause a conflict.

This further protects against that situation by changing uuid generation
for dataset ids to use the v4 algorithm which uses a random number
generator.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

union(): Inconsistent results between runs
2 participants