fix(execute): use random dataset ids instead of ones hashed from the name #3434

jsternberg · 2021-01-13T21:51:10Z

The executor would create the dataset id by using the plan node name.
While plan node names are usually unique, they aren't necessarily
guaranteed to be unique. This would cause multiple datasets to have the
same id and could cause a conflict.

This further protects against that situation by changing uuid generation
for dataset ids to use the v4 algorithm which uses a random number
generator.

Fixes #3053.

Done checklist

docs/SPEC.md updated
Test cases written

…name The executor would create the dataset id by using the plan node name. While plan node names are usually unique, they aren't necessarily guaranteed to be unique. This would cause multiple datasets to have the same id and could cause a conflict. This further protects against that situation by changing uuid generation for dataset ids to use the v4 algorithm which uses a random number generator.

jsternberg mentioned this pull request Jul 25, 2022

Address code confusion in the lang package regarding node names #5021

Closed

gavincabbage closed this Jan 30, 2023

jacobmarble deleted the fix/random-dataset-id branch January 4, 2024 16:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(execute): use random dataset ids instead of ones hashed from the name #3434

fix(execute): use random dataset ids instead of ones hashed from the name #3434

jsternberg commented Jan 13, 2021

fix(execute): use random dataset ids instead of ones hashed from the name #3434

fix(execute): use random dataset ids instead of ones hashed from the name #3434

Conversation

jsternberg commented Jan 13, 2021

Done checklist