Add ready event to Tensor and TensorList. #5673

mzient · 2024-10-10T13:54:13Z

Category:

New feature (non-breaking change which adds functionality)
Refactoring (Redesign of existing code that doesn't affect functionality)

Description:

Preliminary work for (cleaner) DLPack support.
DLPack needs to synchronize an stream so that the tensor is ready for use in that stream.
We can't use stream-to-stream synchronization because of prefetching. This PR adds ready_event to Tensor and TensorList to address that.

move SharedEventLease to core
add more complete shared_ptr interface to SharedEventLease
add tests for SharedEventLease
add (set_)ready_event to Tensor and TensorList
minor refactoring in TensorList
remove OperatorIO::event in favor of TensorList's ready_event in exec2

Additional information:

Affected modules and functionalities:

Executor2
Tensor
TensorList

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Implements new requirements
Affects existing requirements
N/A

REQ IDs: N/A

JIRA TASK: N/A

- move SharedEventLease to core - add more complete shared_ptr interface to SharedEventLease - add tests for SharedEventLease - add (set_)ready_event to Tensor and TensorList - minor refactoring in TensorList - remove OperatorIO::event in favor of TensorList's ready_event in exec2 Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

dali-automaton · 2024-10-10T13:57:36Z

CI MESSAGE: [19219483]: BUILD STARTED

dali-automaton · 2024-10-10T14:34:53Z

CI MESSAGE: [19219483]: BUILD FAILED

Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

dali-automaton · 2024-10-10T14:48:11Z

CI MESSAGE: [19220893]: BUILD STARTED

dali-automaton · 2024-10-10T21:26:37Z

CI MESSAGE: [19220893]: BUILD PASSED

Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

dali-automaton · 2024-10-11T10:01:27Z

CI MESSAGE: [19248455]: BUILD STARTED

dali-automaton · 2024-10-12T05:59:40Z

CI MESSAGE: [19248455]: BUILD PASSED

Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

dali-automaton · 2024-10-14T11:17:48Z

CI MESSAGE: [19327083]: BUILD STARTED

mzient · 2024-10-14T11:18:20Z

dali/pipeline/executor/executor2/exec_node_task.cc

-      if (!ptr->shares_data()) {
-        if (AccessOrder consumer_order = OutputConsumerStream(o))
-          ptr->set_order(consumer_order, false);
+      if (ptr->is_pinned()) {


This fixes a bug - previously we would set the consumer order for non-pinned host buffers - this PR adds an assert that verifies that such buffers are always in host order.

szkarpinski · 2024-10-14T11:59:44Z

include/dali/core/cuda_shared_event.h

+  // Hack: use shared_ptr<void> to store a CUDA event - shared_ptr doesn't care whether the pointer
+  // it manages is a real pointer or something else as long as:
+  // - null value is equivalent to nullptr
+  // - the provided deleter can free the object.


Just to make sure: is this guaranteed by the standard, or is it an implementation detail of shared_ptr?

Let's put it differently: we've already been using it this way for 6 years - we use shared_ptr to manage device memory, which is also non-dereferenceble on host, so it's effectively an opaque handle.

dali-automaton · 2024-10-14T20:33:28Z

CI MESSAGE: [19327083]: BUILD PASSED

mzient added 2 commits October 10, 2024 15:49

Propagate ready_event in As(Reshaped)Tensor.

b9bbea5

Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

Fix clang-only build.

013c3d5

Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

Rename SharedEventLease to CUDASharedEvent and make it more universal.

05d29ad

Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

dali-automaton assigned szkarpinski and banasraf Oct 11, 2024

Don't set the order of non-pinned CPU buffers.

8feeb98

Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

mzient commented Oct 14, 2024

View reviewed changes

szkarpinski reviewed Oct 14, 2024

View reviewed changes

szkarpinski approved these changes Oct 14, 2024

View reviewed changes

banasraf approved these changes Oct 14, 2024

View reviewed changes

mzient merged commit 3db39b1 into NVIDIA:main Oct 14, 2024
5 of 6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ready event to Tensor and TensorList. #5673

Add ready event to Tensor and TensorList. #5673

mzient commented Oct 10, 2024

dali-automaton commented Oct 10, 2024

dali-automaton commented Oct 10, 2024

dali-automaton commented Oct 10, 2024

dali-automaton commented Oct 10, 2024

dali-automaton commented Oct 11, 2024

dali-automaton commented Oct 12, 2024

dali-automaton commented Oct 14, 2024

mzient Oct 14, 2024

szkarpinski Oct 14, 2024

mzient Oct 14, 2024 •

edited

Loading

dali-automaton commented Oct 14, 2024

Add ready event to Tensor and TensorList. #5673

Add ready event to Tensor and TensorList. #5673

Conversation

mzient commented Oct 10, 2024

Category:

Description:

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

dali-automaton commented Oct 10, 2024

dali-automaton commented Oct 10, 2024

dali-automaton commented Oct 10, 2024

dali-automaton commented Oct 10, 2024

dali-automaton commented Oct 11, 2024

dali-automaton commented Oct 12, 2024

dali-automaton commented Oct 14, 2024

mzient Oct 14, 2024

Choose a reason for hiding this comment

szkarpinski Oct 14, 2024

Choose a reason for hiding this comment

mzient Oct 14, 2024 • edited Loading

Choose a reason for hiding this comment

dali-automaton commented Oct 14, 2024

mzient Oct 14, 2024 •

edited

Loading