Remove MakeContiguous before CPU inputs of GPU ops. #5590

mzient · 2024-08-01T20:28:15Z

Category:

Refactoring (Redesign of existing code that doesn't affect functionality)

Description:

Remove MakeContiguous before CPU inputs of GPU ops.
Remove redundant (and dangerous) checks in workspace_policy.

Before the unification of TensorList it was necessary to insert a MakeContiguous operator between CPU and other backends because TensorVector needed to be converted to TensorList. It's no longer the case and the operation is superfluous.
The checks used a no-op function in workspace_policy now kicked in and created an invalid workspace - they are no longer necessary and were removed.

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

test_pipeline.py - all of it

Checklist

Documentation

DALI team only

Requirements

Implements new requirements
Affects existing requirements
N/A

REQ IDs: N/A

JIRA TASK: DALI-4030

…and errneous checks in workspace_policy. Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

mzient · 2024-08-02T08:50:55Z

dali/pipeline/pipeline_test.cc

-TEST_F(PipelineTestOnce, TestTriggerToContiguous) {
-  RunTestTrigger("cpu");
-}
-


This test checked for the presence of the feature we've just removed.

Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

dali-automaton · 2024-08-02T08:59:24Z

CI MESSAGE: [17143006]: BUILD STARTED

Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

dali-automaton · 2024-08-02T09:36:08Z

CI MESSAGE: [17143849]: BUILD STARTED

mzient · 2024-08-02T10:56:36Z

dali/pipeline/pipeline.cc

-    if (device == "gpu" && separated_execution_)
-      SetupCPUInput(it, input_idx, &spec);


I introduced a delay in Executor RunGPU (in different places) and tested it with the following code:

@params((2, 5), (5, 2)) def test_separated_queues(cpu_queue_depth, gpu_queue_depth): data = [np.array((10 * (i + 1)), dtype=np.float32) for i in range(10)] @pipeline_def(batch_size=1, num_threads=1, device_id=0) def pipe(): inp = fn.external_source(data, batch=False, cycle=True) + 0 img = types.Constant(np.zeros(shape=(1,1,3), dtype=np.uint8)).gpu() return fn.resize(img, resize_x=inp, resize_y=1) # pass argument input to resize_x p = pipe(prefetch_queue_depth={"cpu_size":cpu_queue_depth, "gpu_size":gpu_queue_depth}) p.build() for i in range(10): o, = p.run() assert o[0].shape()[1] == data[i]

The results were correct, so this seems not necessary now (I wonder if it ever was).

I'm not adding this test to our regular tests, because it's of little value without the delays (which we certainly don't want to add).

dali-automaton · 2024-08-02T12:51:31Z

CI MESSAGE: [17143849]: BUILD PASSED

mzient added 2 commits August 1, 2024 22:22

Remove MakeContiguous before CPU inputs of GPU ops. Remove redundant …

f657d6b

…and errneous checks in workspace_policy. Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

Remove outdated tests.

8c6e9e1

Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

mzient commented Aug 2, 2024

View reviewed changes

Bugfix.

1524201

Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

mzient force-pushed the simplify_input_setup branch from 8e3385f to 1524201 Compare August 2, 2024 08:55

Restore test.

b59ae31

Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

Remove more input setup.

7b85316

Signed-off-by: Michał Zientkiewicz <mzient@gmail.com>

dali-automaton assigned szalpal and mdabek-nvidia Aug 2, 2024

mzient commented Aug 2, 2024

View reviewed changes

klecki assigned klecki and unassigned szalpal Aug 2, 2024

klecki approved these changes Aug 2, 2024

View reviewed changes

mdabek-nvidia approved these changes Aug 2, 2024

View reviewed changes

mzient merged commit 0e2a2fe into NVIDIA:main Aug 2, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove MakeContiguous before CPU inputs of GPU ops. #5590

Remove MakeContiguous before CPU inputs of GPU ops. #5590

mzient commented Aug 1, 2024

mzient Aug 2, 2024

dali-automaton commented Aug 2, 2024

dali-automaton commented Aug 2, 2024

mzient Aug 2, 2024 •

edited

Loading

dali-automaton commented Aug 2, 2024

		if (device == "gpu" && separated_execution_)
		SetupCPUInput(it, input_idx, &spec);

Remove MakeContiguous before CPU inputs of GPU ops. #5590

Remove MakeContiguous before CPU inputs of GPU ops. #5590

Conversation

mzient commented Aug 1, 2024

Category:

Description:

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Checklist

Documentation

DALI team only

Requirements

mzient Aug 2, 2024

Choose a reason for hiding this comment

dali-automaton commented Aug 2, 2024

dali-automaton commented Aug 2, 2024

mzient Aug 2, 2024 • edited Loading

Choose a reason for hiding this comment

dali-automaton commented Aug 2, 2024

mzient Aug 2, 2024 •

edited

Loading