ARROW-550: [Format] Draft experimental Tensor flatbuffer message type #435

wesm · 2017-03-23T21:35:29Z

Tensor-like data occurs very frequently in scientific computing and machine learning applications that are mostly implemented in C and C++. Arrow's C++ memory management and shared memory utilities can help serve these use cases for zero-copy data transfer to other tensor-like data structures (like NumPy ndarrays, or the tensor objects used in machine learning libraries like TensorFlow or Torch).

The Tensor data structure is loosely modeled after NumPy's ndarray object and TensorFlow's tensor protocol buffers type (https://github.com/tensorflow/tensorflow/blob/754048a0453a04a761e112ae5d99c149eb9910dd/tensorflow/core/framework/tensor.proto).

cc @pcmoritz @robertnishihara @SylvainCorlay @JohanMabille

Change-Id: I29a980e132d31711a49ddd3a68824dfeca262d50

wesm · 2017-03-23T21:57:46Z

format/Tensor.fbs

+  /// The size of a memory increment necessary to advance 1 cell along a given
+  /// axis. If the strides member is null or has 0 length, then the strides
+  /// will be computed from the shape according to row major order
+  strides: [long];


It probably makes sense to disallow negative strides. NumPy allows this, but it makes serialization very difficult

👍 the used of signed types for strides in the buffer protocol of python is really annoying.

I think using int64_t is useful for maximum compatibility, but at least having a comment to say that negative strides aren't supported. I guess if someone tries to write memory with negative strides using arrow::ipc it will need to be transformed to be all positive (more expensive)

Wouldn't it also make sense to add the stride of a dimension to the TensorDim?

That is possible, but a single stride doesn't have much meaning without looking at all of the dimensions today

robertnishihara · 2017-03-23T22:49:07Z

Nice, this looks reasonable!

SylvainCorlay · 2017-03-24T14:42:45Z

Another option would be to not have strides whatsoever and only allow two orders for serialization

last-index-varies-faster (commonly called C order or row-major)
first-index-varies-faster (commonly called Fortran order or column-major).

A single byte would be sufficient for that, and it would improve the memory footprint while covering most use cases.

wesm · 2017-03-24T14:58:58Z

@SylvainCorlay that had been my initial thought. I'll change the spec to be either row/column-major, and if there is sufficient demand in the future we can add non-contiguous/strided ability. I agree that arbitrary striding mostly only makes sense in-memory

SylvainCorlay · 2017-03-24T15:02:07Z

If by "either row/column-major" you mean with a boolean switch between them, I am 👍 . ( there is probably a good demand for both row-major (e.g. numpy) and column-major (e.g. julia). )

Change-Id: I2ffa55305e770d9aa5c1ecdea74ee65b91589a8a

wesm · 2017-03-24T15:10:46Z

@SylvainCorlay I made it an enum so that we can add a TensorOrder_STRIDED option later if there is demand

SylvainCorlay · 2017-03-24T15:14:15Z

format/Tensor.fbs

+  name: string;
+}
+
+enum TensorOrder : short {


short is two bytes long (in most implementations).

could this be a 1-byte bool, char, or unsigned char enum?

bool:

enum TensorOrder : bool { ROW_MAJOR = false, COLUMN_MAJOR = true, };

char:

enum TensorOrder : char { ROW_MAJOR = 'r', COLUMN_MAJOR = 'c', };

I would prefer char over bool, it is more expressive and easier to extend.

I changed it from short to byte (there is no char in Flatbuffers)

Change-Id: I129312f80ea6f561ee5a56b2393bda2ddcddefca

SylvainCorlay · 2017-03-24T15:38:47Z

Does flatbuffer requires buffers to be aligned on a certain number of bytes or is it up to this spec to potentially specify that there would be padding in the end of the header for it to be aligned at a given level?

wesm · 2017-03-24T15:42:56Z

What kind of buffers are you referring to? When we write the Flatbuffer in the stream/file format with a length prefix, it includes padding to an 8 byte boundary:

https://github.com/apache/arrow/blob/master/cpp/src/arrow/ipc/writer.cc#L169

wesm · 2017-03-24T15:53:21Z

Merging this to move on to a prototype of writing and reading tensors to shared memory, will have a patch together in the next few days hopefully

SylvainCorlay · 2017-03-24T15:55:55Z

SIMD intrinsics to load data into an SIMD register have aligned and unaligned versions. The former is generally much faster than the latter.

So if the main buffer of the tensor can be aligned on a e.g.

16 bytes (SSE2)
32 bytes (AVX)

basis.

So if the entire buffer (including shape and order) is 32-bytes aligned, it is good for the performances that the offset for the data also has a 32 bytes alignment.

So adding some padding after the information on type / shape / order to align the data buffer can make things faster at a library level.

wesm · 2017-03-24T16:01:14Z

@SylvainCorlay sounds good -- I will plan to ensure that the tensor memory is written with at least 32-byte alignment and padding.

SylvainCorlay · 2017-03-24T16:06:37Z

@JohanMabille has written a standard STL allocator for aligned allocation, which when used, would guarantee that the buffer (including type/shape/order) would be aligned

so all that would be required from the ARROW spec to have good SIMD perfs is that the offset of the beginning of the memory is a multiple of 32 bytes.

Draft Tensor flatbuffer type

d7d6407

Change-Id: I29a980e132d31711a49ddd3a68824dfeca262d50

wesm commented Mar 23, 2017

View reviewed changes

xhochy approved these changes Mar 24, 2017

View reviewed changes

Replace strides with TensorOrder enum for row major / column major

249a9d5

Change-Id: I2ffa55305e770d9aa5c1ecdea74ee65b91589a8a

SylvainCorlay reviewed Mar 24, 2017

View reviewed changes

Change TensorOrder enum to byte

afac56e

Change-Id: I129312f80ea6f561ee5a56b2393bda2ddcddefca

asfgit closed this in dc3cb30 Mar 24, 2017

wesm deleted the ARROW-550 branch March 24, 2017 16:00

Moelf mentioned this pull request May 17, 2023

Failure to read compressed empty table from java implementation apache/arrow-julia#437

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-550: [Format] Draft experimental Tensor flatbuffer message type #435

ARROW-550: [Format] Draft experimental Tensor flatbuffer message type #435

wesm commented Mar 23, 2017

wesm Mar 23, 2017

SylvainCorlay Mar 23, 2017

wesm Mar 23, 2017

xhochy Mar 24, 2017

wesm Mar 24, 2017

robertnishihara commented Mar 23, 2017

SylvainCorlay commented Mar 24, 2017 •

edited

Loading

wesm commented Mar 24, 2017

SylvainCorlay commented Mar 24, 2017

wesm commented Mar 24, 2017

SylvainCorlay Mar 24, 2017 •

edited

Loading

SylvainCorlay Mar 24, 2017 •

edited

Loading

JohanMabille Mar 24, 2017 •

edited

Loading

wesm Mar 24, 2017

SylvainCorlay commented Mar 24, 2017

wesm commented Mar 24, 2017

wesm commented Mar 24, 2017

SylvainCorlay commented Mar 24, 2017 •

edited

Loading

wesm commented Mar 24, 2017

SylvainCorlay commented Mar 24, 2017 •

edited

Loading

ARROW-550: [Format] Draft experimental Tensor flatbuffer message type #435

ARROW-550: [Format] Draft experimental Tensor flatbuffer message type #435

Conversation

wesm commented Mar 23, 2017

wesm Mar 23, 2017

Choose a reason for hiding this comment

SylvainCorlay Mar 23, 2017

Choose a reason for hiding this comment

wesm Mar 23, 2017

Choose a reason for hiding this comment

xhochy Mar 24, 2017

Choose a reason for hiding this comment

wesm Mar 24, 2017

Choose a reason for hiding this comment

robertnishihara commented Mar 23, 2017

SylvainCorlay commented Mar 24, 2017 • edited Loading

wesm commented Mar 24, 2017

SylvainCorlay commented Mar 24, 2017

wesm commented Mar 24, 2017

SylvainCorlay Mar 24, 2017 • edited Loading

Choose a reason for hiding this comment

SylvainCorlay Mar 24, 2017 • edited Loading

Choose a reason for hiding this comment

JohanMabille Mar 24, 2017 • edited Loading

Choose a reason for hiding this comment

wesm Mar 24, 2017

Choose a reason for hiding this comment

SylvainCorlay commented Mar 24, 2017

wesm commented Mar 24, 2017

wesm commented Mar 24, 2017

SylvainCorlay commented Mar 24, 2017 • edited Loading

wesm commented Mar 24, 2017

SylvainCorlay commented Mar 24, 2017 • edited Loading

SylvainCorlay commented Mar 24, 2017 •

edited

Loading

SylvainCorlay Mar 24, 2017 •

edited

Loading

SylvainCorlay Mar 24, 2017 •

edited

Loading

JohanMabille Mar 24, 2017 •

edited

Loading

SylvainCorlay commented Mar 24, 2017 •

edited

Loading

SylvainCorlay commented Mar 24, 2017 •

edited

Loading