Support inferring batch size from tensor argument inputs #4617

stiepan · 2023-01-26T12:48:06Z

Signed-off-by: Kamil Tokarski ktokarski@nvidia.com

Category:s

New feature (non-breaking change which adds functionality)

Description:

This PR updates the way executor sets requested batch size. If there are any inputs available, executor assumes the operator expected uniform batch sizes across inputs and outputs and sets requested batch size to any tensor input it can find: either positional or named (argument input). Otherwise, if there are none avialable, the stage queue batch size is used.

Additional information:

The common patern introduced with the end2end variable batch size was to look at the regular inputs or, if there were none, resort to GetRequestedBatchSize. This PR moves that behaviour to the executor and includes named arguments in the search of batch before resorting to stage batch size. @klecki updated the autograph layer to reflect that by appropriate "hoisting" of the splits here #4618

Affected modules and functionalities:

The regular usages of the operators should not see any difference.

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Implements new requirements
Affects existing requirements
N/A

REQ IDs: N/A

JIRA TASK: N/A

stiepan · 2023-01-26T13:57:42Z

dali/operators/python_function/dltensor_function.h

@@ -245,30 +245,6 @@ class DLTensorPythonFunctionImpl : public Operator<Backend> {
  bool synchronize_stream_;
  bool batch_processing;
  std::vector<TensorLayout> output_layouts_;
-
- private:
-  int GetCurrBatchSize(Workspace &ws) {


Getting the batch size can be done with the InferBatchSizeFromInput. The extra checks were unnecessary: the EnforceUniformInput/OutputBatchSize at the operator level does that. And as for the check of requested batch size (ws.GetRequestedBatchSize(i) = ws.GetRequestedBatchSize(0)), the value is set uniformly batch_sizes_.resize(NumOutput(), batch_size); so it was true anyway.

mzient · 2023-01-30T14:44:22Z

dali/operators/geometry/affine_transforms/transform_base_op.h

@@ -72,7 +72,7 @@ class TransformBaseOp : public SequenceOperator<Backend, true> {

  bool SetupImpl(std::vector<OutputDesc> &output_descs, const Workspace &ws) override {
    has_input_ = ws.NumInput() > 0;
-    auto curr_batch_size = has_input_ ? ws.GetInputBatchSize(0) : ws.GetRequestedBatchSize(0);
+    auto curr_batch_size = InferBatchSizeFromInput(ws);


This defeats the purpose of RequestedBatchSize. Instead, move InferBatchSize to the executor and invoke it prior to SetBatchSize. It will be a workaround, but at least we'll have one place where we can change/improve batch size inference instead of spreading it across operators. It also allows for some smart(er) solutions or non-local batch source.

mzient

Move the batch size calculation to the executor and keep RequestedBatchSize.

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

mzient · 2023-01-31T15:03:06Z

dali/test/python/conditionals/test_pipeline_conditionals.py

+def test_named_tensor_arguments(op):
+
+    ops2params = {
+        fn.permute_batch: _tensor_arg_permute_batch_params,


permute_batch does have an input - although by original design the output batch size was taken from indices. I don't think there's any special case for this op - or any other means to infer the batch size - which would mean that it takes the batch_size just from the input, not arg. input.

I put it on purpose here as it calls GetRequestedBatchSize manually as an extra check that GetRequestedBatchSize returns the "split" batch size.

stiepan · 2023-01-31T15:24:02Z

!build

dali-automaton · 2023-01-31T15:30:47Z

CI MESSAGE: [7159582]: BUILD STARTED

dali-automaton · 2023-01-31T17:22:11Z

CI MESSAGE: [7159582]: BUILD FAILED

dali-automaton · 2023-01-31T17:56:59Z

CI MESSAGE: [7159582]: BUILD PASSED

* Set requested batch size based on the op tensor arguments if avialbale * Include arg inputs in the executor's check for all empty batch Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

stiepan commented Jan 26, 2023

View reviewed changes

klecki mentioned this pull request Jan 26, 2023

Fix classification of argument input-only operators in AutoGraph #4618

Merged

18 tasks

klecki self-assigned this Jan 26, 2023

stiepan marked this pull request as ready for review January 30, 2023 10:54

jantonguirao assigned mzient Jan 30, 2023

stiepan force-pushed the infer_batch_size_from_arg_input branch from 40726d2 to 916bc82 Compare January 30, 2023 11:37

mzient reviewed Jan 30, 2023

View reviewed changes

mzient requested changes Jan 30, 2023

View reviewed changes

Set requested batch size based on the op tensor arguments if avialbale

7403586

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

stiepan force-pushed the infer_batch_size_from_arg_input branch from 916bc82 to 7403586 Compare January 31, 2023 11:07

stiepan added 2 commits January 31, 2023 14:39

Include arg inputs in the executor's check for all empty batch

76a3e42

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Add tests to ops that may infer batch size from arg input

4d7b85f

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

mzient reviewed Jan 31, 2023

View reviewed changes

klecki approved these changes Jan 31, 2023

View reviewed changes

mzient approved these changes Jan 31, 2023

View reviewed changes

stiepan merged commit 90e112c into NVIDIA:main Jan 31, 2023

JanuszL mentioned this pull request Sep 6, 2023

Roadmap 2023 #4578

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support inferring batch size from tensor argument inputs #4617

Support inferring batch size from tensor argument inputs #4617

stiepan commented Jan 26, 2023 •

edited

Loading

stiepan Jan 26, 2023 •

edited

Loading

mzient Jan 30, 2023

mzient left a comment

mzient Jan 31, 2023

stiepan Jan 31, 2023

stiepan commented Jan 31, 2023

dali-automaton commented Jan 31, 2023

dali-automaton commented Jan 31, 2023

dali-automaton commented Jan 31, 2023

Support inferring batch size from tensor argument inputs #4617

Support inferring batch size from tensor argument inputs #4617

Conversation

stiepan commented Jan 26, 2023 • edited Loading

Category:s

Description:

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

stiepan Jan 26, 2023 • edited Loading

Choose a reason for hiding this comment

mzient Jan 30, 2023

Choose a reason for hiding this comment

mzient left a comment

Choose a reason for hiding this comment

mzient Jan 31, 2023

Choose a reason for hiding this comment

stiepan Jan 31, 2023

Choose a reason for hiding this comment

stiepan commented Jan 31, 2023

dali-automaton commented Jan 31, 2023

dali-automaton commented Jan 31, 2023

dali-automaton commented Jan 31, 2023

stiepan commented Jan 26, 2023 •

edited

Loading

stiepan Jan 26, 2023 •

edited

Loading