Add inflate operator #4366

stiepan · 2022-10-17T15:30:51Z

Category:

New feature (non-breaking change which adds functionality)

Description:

Adds inflate operator that allows to decompress batch of compressed samples (and batch of sequences of compressed samples). For now, only LZ4 compressin is supported.

The decompression is done with nvCOMP library. Note, nvCOMP is not open source, the nvCOMP SDK licencse is included in the Acknwoledgment: #4368.

If the samples contain multiple compressed chunks, either chunk_sizes or chunk_offsets parameter must be provided. If both are provided, the order of the chunks in the input and output can be different.

Additional information:

For now, we build with the nvCOMP (and thus without the operator) only for x86 due to incompatible glibcxx in the nvCOMP's aarch build.

Affected modules and functionalities:

The new operator is added
Python tests are added
CI dockerfiles and wheel building scripts are affected

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

Implements new requirements
Affects existing requirements
N/A

REQ IDs: DECOMP.02-08.

JIRA TASK: DALI-3049

dali-automaton · 2022-10-19T21:04:37Z

CI MESSAGE: [6233278]: BUILD STARTED

dali-automaton · 2022-10-19T21:26:38Z

CI MESSAGE: [6233972]: BUILD STARTED

dali-automaton · 2022-10-19T21:49:06Z

CI MESSAGE: [6234125]: BUILD STARTED

dali-automaton · 2022-10-19T23:11:17Z

CI MESSAGE: [6234125]: BUILD FAILED

dali-automaton · 2022-10-20T12:23:13Z

CI MESSAGE: [6240852]: BUILD STARTED

dali-automaton · 2022-10-20T14:21:40Z

CI MESSAGE: [6240852]: BUILD FAILED

dali-automaton · 2022-10-20T15:41:01Z

CI MESSAGE: [6242387]: BUILD STARTED

dali-automaton · 2022-10-20T17:38:54Z

CI MESSAGE: [6242387]: BUILD FAILED

dali-automaton · 2022-10-20T17:54:45Z

CI MESSAGE: [6243748]: BUILD STARTED

dali-automaton · 2022-10-20T22:47:50Z

CI MESSAGE: [6243748]: BUILD FAILED

dali-automaton · 2022-10-21T09:19:11Z

CI MESSAGE: [6251418]: BUILD STARTED

dali-automaton · 2022-10-21T12:07:23Z

CI MESSAGE: [6251418]: BUILD FAILED

mzient · 2022-10-25T10:01:56Z

dali/operators/decoder/inflate/inflate.cc

+    .NumOutput(1)
+    .AddArg(inflate::shapeArgName, "The shape of the output (inflated) chunk.", DALI_INT_VEC, true)
+    .AddOptionalTypeArg(inflate::dTypeArgName, "The output (inflated) data type.", DALI_NO_TYPE)
+    .AddOptionalArg(inflate::offsetArgName, R"code("A list of offsets within the input sample


It's probably a matter of taste, but I don't like these constants - they make the code only infinitesimally "harder" but much harder to read.

dali-automaton · 2022-11-02T10:14:55Z

CI MESSAGE: [6362583]: BUILD STARTED

awolant · 2022-11-02T11:18:53Z

dali/operators/decoder/inflate/inflate.cc

+If the sample is comprised of multiple chunks, the ``chunks_offsets`` or ``chunks_sizes``
+must be specified. In that case, the ``shape`` must describe the shape of a single inflated
+(output) chunk. The number of the chunks will automatically be added as the leftmost extent
+to the output tensors.


Maybe it would be useful to extend this API in the future with the alternative, where you provide the number of chunks instead of the offset. In same cases I imagine it would be easier to know it.

awolant · 2022-11-02T11:53:35Z

dali/test/python/operator/test_inflate.py

+@has_operator("experimental.inflate")
+def test_sample_inflate():
+    seed = 42
+    for batch_size in [1, 8, 64, 256, 348]:


Are these batch sizes in any way significant to the code?

Not really tbh, 1 is "special case", 8 is small, 348 big and the remaining 2 medium ones. I though there may be some issues that would show up for small or big batch sizes specifically. But maybe there's too much cases.

awolant · 2022-11-02T11:55:44Z

dali/test/python/operator/test_inflate.py

+    yield _test_validation, pipeline_2d_shape, "The shape argument must be a scalar or a 1D tensor"
+    yield _test_validation, pipeline_non_elementary_dtype, \
+        "The inflate output type must have floating point or integral type"
+    yield _test_validation, pipeline_input_float, "Got tensor of type `float` instead"
+    yield _test_validation, pipeline_input_scalar, "Got input with 0 dimensions instead"
+    yield _test_validation, pipeline_input_algorithm, \
+        "Unknown inflate algorithm was specified for `algorithm` argument"
+    yield _test_validation, pipeline_too_big_chunk, "Input chunk size cannot exceed the sample size"
+    yield _test_validation, pipeline_too_big_chunks, \
+        "The sum of chunk sizes for sample of idx 0 exceeds the total size of the sample."
+    yield _test_validation, pipeline_empty_chunk, "Got chunk size 0 for sample of idx 0"
+    yield _test_validation, pipeline_neg_chunk, "Got chunk size -1 for sample of idx 0"
+    yield _test_validation, pipeline_too_big_offsets, \
+        "Got chunk offset 5 while the sample size is 5 for sample of idx 0"
+    yield _test_validation, pipeline_too_zero_size_inferred, \
+        "The inferred size of a chunk would be non-positive for sample of idx 0"
+    yield _test_validation, pipeline_sizes_offsets_mismatched, \
+        "for sample of idx 0 there are 2 offsets and 3 sizes"
+    yield _test_validation, pipeline_negative_offset, \
+        "Input chunks offsets must be non-negative"
+    yield _test_validation, pipeline_chunk_exceeding_sample, \
+        "Input chunk cannot exceed the sample size"


If possible, please do not use tests yielding because it is impossible to properly discover them and run them in parallel in nose2. If you need to have parameterized tests you can look into: https://docs.nose2.io/en/latest/params.html

dali-automaton · 2022-11-02T15:25:25Z

CI MESSAGE: [6362583]: BUILD PASSED

mzient · 2022-11-03T09:43:09Z

dali/operators/decoder/inflate/inflate.cc

+
+Each input sample can either be a single compressed chunk or consist of multiple
+compressed chunks that have the same shape and type when inflated, so that they can be
+be merged into a single tensor where the leftmost extent of the tensor corresponds


Suggested change

be merged into a single tensor where the leftmost extent of the tensor corresponds

be merged into a single tensor where the outermost extent of the tensor corresponds

...but much more importantly - what does it even mean?
Are the chunks visible in the output shape? Shouldn't chunking be just an internal detail?

The real use case it targets are 3D sequences of single channel HW images. Each image is compressed separately and the results are concatenated. Using chunks_offsets or chunks_sizes you can do the reverse: uncompress the chunks into a sequence. So compressed chunks translate to "frames" in the tensor.

outermost still applies - we never use leftmost in this context (the word appears 6 times in our code base and refers to position within image/array, not to dimensions). There are 67 occurrences of outermost.

I've resolved that disccusion by misclick and couldn't find this anywhere. :p
Anyway, renamed it.

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

stiepan · 2022-11-08T18:54:53Z

Rebasing on the CI fixes

dali-automaton · 2022-11-08T18:56:39Z

CI MESSAGE: [6432468]: BUILD STARTED

dali-automaton · 2022-11-08T18:56:42Z

CI MESSAGE: [6432469]: BUILD FAILED

dali-automaton · 2022-11-08T18:58:38Z

CI MESSAGE: [6432477]: BUILD STARTED

dali-automaton · 2022-11-09T02:16:29Z

CI MESSAGE: [6432477]: BUILD PASSED

dali-automaton · 2022-11-09T06:42:35Z

CI MESSAGE: [6432468]: BUILD FAILED

dali-automaton · 2022-11-09T07:57:22Z

CI MESSAGE: [6432468]: BUILD PASSED

stiepan force-pushed the lz4_inflate branch 3 times, most recently from 986c5bb to c14eb76 Compare October 19, 2022 21:02

stiepan force-pushed the lz4_inflate branch from c14eb76 to bf7fc0e Compare October 19, 2022 21:24

stiepan force-pushed the lz4_inflate branch from bf7fc0e to f1affe5 Compare October 19, 2022 21:47

stiepan force-pushed the lz4_inflate branch from e5a28ab to 5339400 Compare October 20, 2022 17:50

stiepan force-pushed the lz4_inflate branch from f7dc1cd to f410137 Compare October 24, 2022 16:13

stiepan mentioned this pull request Oct 24, 2022

Add acknowledgement for nvCOMP #4368

Merged

18 tasks

stiepan changed the title ~~Lz4 inflate~~ Add inflate operator Oct 24, 2022

stiepan marked this pull request as ready for review October 24, 2022 18:38

jantonguirao assigned mzient, szalpal and awolant Oct 25, 2022

mzient reviewed Oct 25, 2022

View reviewed changes

awolant approved these changes Nov 2, 2022

View reviewed changes

mzient reviewed Nov 3, 2022

View reviewed changes

stiepan added 20 commits November 8, 2022 19:53

Copy nvcomp to cuda toolkit dir

cb2f9cb

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Bundle libnvcomp

8b725fe

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Install lz4 for python tests

5a3cc74

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Just don't use cupy in python tests

6677afa

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Disable nvcomp building on aarch, enable on all 11x CUDA x86 images

df3bd65

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Conditionally bundle nvcomp

2a977fd

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Run inflate tests conditionally

e184162

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Fix bundle nvcomp option setting

a71ed0e

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Polishing the docs

9826e6d

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Validation tests

097f5fe

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Review

72ee9b8

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Remove the warning

452131c

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Restrict envs

81b02e0

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Rename params to chunk_offsets and chunk_sizes

8bf98a2

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Test sequence_axis_name param

b8c9b42

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Merge installing the nvcomp with cufile and nvjpeg

4579827

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Simplify algorithm error message

55a4f22

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Leftmost is no more.

1f28ccf

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Disable aarch build even more

25cbe58

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

Do not import lz4 gloablly in test file

9b305d1

Signed-off-by: Kamil Tokarski <ktokarski@nvidia.com>

stiepan force-pushed the lz4_inflate branch from 7d0372a to 9b305d1 Compare November 8, 2022 18:54

stiepan merged commit bd6bfe3 into NVIDIA:main Nov 9, 2022

JanuszL mentioned this pull request Jan 11, 2023

DALI 2022 roadmap #3774

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add inflate operator #4366

Add inflate operator #4366

stiepan commented Oct 17, 2022 •

edited

Loading

dali-automaton commented Oct 19, 2022

dali-automaton commented Oct 19, 2022

dali-automaton commented Oct 19, 2022

dali-automaton commented Oct 19, 2022

dali-automaton commented Oct 20, 2022

dali-automaton commented Oct 20, 2022

dali-automaton commented Oct 20, 2022

dali-automaton commented Oct 20, 2022

dali-automaton commented Oct 20, 2022

dali-automaton commented Oct 20, 2022

dali-automaton commented Oct 21, 2022

dali-automaton commented Oct 21, 2022

mzient Oct 25, 2022

dali-automaton commented Nov 2, 2022

awolant Nov 2, 2022

awolant Nov 2, 2022

stiepan Nov 7, 2022

awolant Nov 2, 2022

dali-automaton commented Nov 2, 2022

mzient Nov 3, 2022 •

edited

Loading

stiepan Nov 3, 2022 •

edited

Loading

mzient Nov 7, 2022

stiepan Nov 7, 2022

stiepan commented Nov 8, 2022

dali-automaton commented Nov 8, 2022

dali-automaton commented Nov 8, 2022

dali-automaton commented Nov 8, 2022

dali-automaton commented Nov 9, 2022

dali-automaton commented Nov 9, 2022

dali-automaton commented Nov 9, 2022

	be merged into a single tensor where the leftmost extent of the tensor corresponds
	be merged into a single tensor where the outermost extent of the tensor corresponds

Add inflate operator #4366

Add inflate operator #4366

Conversation

stiepan commented Oct 17, 2022 • edited Loading

Category:

Description:

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

dali-automaton commented Oct 19, 2022

dali-automaton commented Oct 19, 2022

dali-automaton commented Oct 19, 2022

dali-automaton commented Oct 19, 2022

dali-automaton commented Oct 20, 2022

dali-automaton commented Oct 20, 2022

dali-automaton commented Oct 20, 2022

dali-automaton commented Oct 20, 2022

dali-automaton commented Oct 20, 2022

dali-automaton commented Oct 20, 2022

dali-automaton commented Oct 21, 2022

dali-automaton commented Oct 21, 2022

mzient Oct 25, 2022

Choose a reason for hiding this comment

dali-automaton commented Nov 2, 2022

awolant Nov 2, 2022

Choose a reason for hiding this comment

awolant Nov 2, 2022

Choose a reason for hiding this comment

stiepan Nov 7, 2022

Choose a reason for hiding this comment

awolant Nov 2, 2022

Choose a reason for hiding this comment

dali-automaton commented Nov 2, 2022

mzient Nov 3, 2022 • edited Loading

Choose a reason for hiding this comment

stiepan Nov 3, 2022 • edited Loading

Choose a reason for hiding this comment

mzient Nov 7, 2022

Choose a reason for hiding this comment

stiepan Nov 7, 2022

Choose a reason for hiding this comment

stiepan commented Nov 8, 2022

dali-automaton commented Nov 8, 2022

dali-automaton commented Nov 8, 2022

dali-automaton commented Nov 8, 2022

dali-automaton commented Nov 9, 2022

dali-automaton commented Nov 9, 2022

dali-automaton commented Nov 9, 2022

stiepan commented Oct 17, 2022 •

edited

Loading

mzient Nov 3, 2022 •

edited

Loading

stiepan Nov 3, 2022 •

edited

Loading