GH-34971: [Format] Add non-CPU version of C Data Interface #34972

zeroshade · 2023-04-07T16:51:14Z

Rationale for this change

In order to support non-cpu devices and memory usage, we can add new ArrowDeviceArray and ArrowDeviceArrayStream structs to the C Data Interface in order to allow for handling these types of memory.

What changes are included in this PR?

Definitions for a new ArrowDeviceArray, ArrowDeviceArrayStream and ArrowDeviceType enums.

Closes: [Format] Enhance C-Data API to support non-cpu use cases #34971

github-actions · 2023-04-07T16:51:39Z

Closes: [Format] Enhance C-Data API to support non-cpu use cases #34971

github-actions · 2023-04-07T16:51:42Z

⚠️ GitHub issue #34971 has been automatically assigned in GitHub to PR creator.

cpp/src/arrow/c/abi.h

westonpace

I have a few questions about how ArrowDeviceArrayStream works

westonpace · 2023-04-10T18:12:44Z

cpp/src/arrow/c/abi.h

+  /// The next call to `get_next` should provide an ArrowDeviceArray whose
+  /// device_id matches what is provided here, and whose device_type is the
+  /// same as the device_type member of this stream.


I'm not certain I follow. Isn't the ArrowDeviceArray passed to get_next an "out" parameter? Are you saying that the ArrowDeviceArray struct itself (not the buffers) needs to be allocated on the device?

no, I was referring to the device_id member and device_type member that get populated in the ArrowDeviceArray that is returned from get_next

This is a weird API choice, all because you want the consumer to pass its CUDA stream of choice...

ultimately this is a consequence of the fact that the existing frameworks and APIs don't provide any good way to manage the stream's lifetime easily which makes having the consumer pass the stream be the safest route to take.

I'm absolutely open to suggestions to make this better as long as the consumer is able to pass in the desired stream.

the existing frameworks and APIs don't provide any good way to manage the stream's lifetime easily

What do you mean by that? Would you care to give a more concrete example? For example CUDA allows you to destroy a stream:
https://docs.nvidia.com/cuda/cuda-runtime-api/group__CUDART__STREAM.html#group__CUDART__STREAM_1gfda584f1788ca983cb21c5f4d2033a62

There's a lot of discussion in that issue related to internal stream handling under contexts and schedulers and similar terms. It all boils down to the same discussion of many producers being unable to release or share ownership of their streams.

I think this comment does a good job of summarizing the options that were considered: dmlc/dlpack#57 (comment)

And then this comment summarizes discussion of those options: dmlc/dlpack#57 (comment)

The lifetime management of streams as defined by the Numba documentation for __cuda_array_interface__ (https://numba.readthedocs.io/en/stable/cuda/cuda_array_interface.html#streams) requires that keeping the object (typically array) that produces the __cuda_array_interface__ alive also keeps the stream alive. In most cases libraries don't associate a stream with the object since it's valid to use multiple streams with a single object.

Here's the current state of things across a handful of projects:

Numba

Handles stream lifetime properly for __cuda_array_interface__ since they store a stream object as part of their array: https://github.com/numba/numba/blob/008077553b558bd183668ecd581d4d0bc54bd32c/numba/cuda/cudadrv/devicearray.py#L119-L139

Doesn't support __dlpack__

CuPy

Doesn't implement stream lifetime properly for __cuda_array_interface__ as far as I can tell, where they just get a stream ptr as an integer and if it's not the default stream someone could change the current stream ptr and end up having it be destroyed out from underneath it: https://github.com/cupy/cupy/blob/c92d5bc16293300297b843b4ebb364125697c131/cupy/_core/core.pyx#L258-L262 (cc @leofang)

Handles __dlpack__ properly: https://github.com/cupy/cupy/blob/c92d5bc16293300297b843b4ebb364125697c131/cupy/_core/core.pyx#L285-L327

PyTorch

Doesn't support passing the stream in __cuda_array_interface__ currently: https://github.com/pytorch/pytorch/blob/def50d253401540cfdc6c0fffa444d0ee643cc11/torch/_tensor.py#L1005-L1066

Handles __dlpack__ properly: https://github.com/pytorch/pytorch/blob/def50d253401540cfdc6c0fffa444d0ee643cc11/torch/_tensor.py#L1345

Tensorflow and JAX

Doesn't support passing the stream in __cuda_array_interface__ currently: https://github.com/tensorflow/tensorflow/blob/ea6a0f282d2b7ce20891dfc24ec8fe107eeaf22d/tensorflow/compiler/xla/python/py_buffer.cc#L254-L300

Doesn't support __dlpack__, but has explicit to and from functions for using dlpack pycapsules but does not handle streams: https://github.com/tensorflow/tensorflow/blob/6a050b6c15ed2a545693bc171f5e95dacbe05839/tensorflow/compiler/xla/python/dlpack.cc#L285-L363

cuDF

Doesn't support passing the stream in __cuda_array_interface__ currently: https://github.com/rapidsai/cudf/blob/50718e673ff53b18706cf66c6e02cda8e30681fe/python/cudf/cudf/core/column/numerical.py#L169-L191

Doesn't support __dlpack__ but has explicit to and from functions for using dlpack pycapsules but does not handle streams: https://github.com/rapidsai/cudf/blob/50718e673ff53b18706cf66c6e02cda8e30681fe/cpp/src/interop/dlpack.cpp#L218-L294

cc @rgommers @leofang @tqchen (please feel free to include anyone else) do you happen to know if there was any other discussion captured that could be linked here regarding the decision to have a consumer hand a stream to the producer?

Thanks the pointers.

I think this comment does a good job of summarizing the options that were considered: dmlc/dlpack#57 (comment)

Yes, I read this. It looks like solution S1, which is also the one I'm proposing, is considered the most flexible (I don't understand the "harder for compilers" comment, though).

And then this comment summarizes discussion of those options: dmlc/dlpack#57 (comment)

I read this too, but it doesn't actually mention S1, for reasons I wasn't able to understand.

In most cases libraries don't associate a stream with the object since it's valid to use multiple streams with a single object.

But you have to actually synchronize on the right stream before being able to use the object, right? How does the user know which stream to synchronize on, if they didn't produce the data themselves?

Yes, I read this. It looks like solution S1, which is also the one I'm proposing, is considered the most flexible (I don't understand the "harder for compilers" comment, though).

From: dmlc/dlpack#57 (comment)

It also brings extra burden to the compilers themselves. The compiler will need to generate optional synchronization code based on the streams, which is non-trivial.

I believe the compilers being referred to here are deep learning compilers like XLA which do things like kernel fusion and set up execution graphs of kernels that use streams internally to parallelize the execution of said graphs.

But you have to actually synchronize on the right stream before being able to use the object, right?

Something / someone needs to guarantee that there isn't a data race with regards to using multiple non-blocking streams, yes. That could be done with events, stream synchronization, or device synchronization.

How does the user know which stream to synchronize on, if they didn't produce the data themselves?

If you're staying within your framework / library then the expectation is for the framework / library to handle things for the user. If crossing framework / library boundaries, then the expectation is to be reliant on things like interchange protocols to handle the synchronization semantics.

do you happen to know if there was any other discussion captured that could be linked here regarding the decision to have a consumer hand a stream to the producer

Sorry I wasn't able to respond promptly. Is the question still open?

In the case of CAI, it is required that someone handles the exporting stream's lifetime properly:

Like data, CUDA streams also have a finite lifetime. It is therefore required that a Producer exporting data on the interface with an associated stream ensures that the exported stream’s lifetime is equal to or surpasses the lifetime of the object from which the interface was exported.

and this was considered a burden when discussing the DLPack support. A few libraries like Numba, for example, had to hold the reference to the underlying stream. I believe this was the main concern for DLPack to place the requirement on the consumer instead of the producer.

westonpace · 2023-04-10T18:20:31Z

cpp/src/arrow/c/abi.h

+  /// \param[in] queue_ptr The appropriate queue, stream, or
+  /// equivalent object for the device that the data is allocated on
+  /// to indicate where the consumer wants the data to be accessible.
+  /// If queue_ptr is NULL then the default stream (e.g. CUDA stream 0)
+  /// should be used to ensure that the memory is accessible from any stream.


I'm a little confused here. It sounds like I need to call get_next_device_id to determine which queue to use and then I need to pass that queue on to the call to get_next. But why? Why isn't the producer managing the queues?

If the producer controls which device id gets used (get_next_device_id seems to suggest this) then why does the consumer need to give it the queue? For example, if I were merging streams from two different devices it seems like I would do something like (apologies for the butchered pseudo-code)...

// Dummy class merging two infinite streams in an inefficient round-robin fashion class MergedStream { int get_next(ArrowDeviceArray* out) { if (merged_arrays_.empty()) { ArrowDeviceArray arr; left_.get_next(&arr); merged_arrays_.push(arr); right_.get_next(&arr); merged_arrays_.push(arr); } *out = merged_arrays_.pop(); } };

@westonpace The idea here is that the consumer of the interface provides a queue to the producer and the producer is responsible for ensuring that the data is safe to consume on the provided queue.

The reason for doing this instead of the producer returning a pointer to a queue that the data is safe to consume on is that frameworks generally manage these queues internally and don't have a mechanism to share a queue and control its lifetime over a boundary like this.

The standard "mechanism to share a queue and control its lifetime over a boundary like this" in the C Data Interface would be the release callback.

@pitrou the issue is that there isn't a mechanism that you could call in the release callback (to my knowledge) to cleanly control the lifetime. (@kkraus14 correct me if I'm wrong and this isn't what you meant)

That doesn't make sense, does it? How is the consumer supposed to manage the stream's lifetime if "there isn't a mechanism that you could call in the release callback to cleanly control the lifetime"?

Why wouldn't they? They can easily refcount the usage of their own CUDA streams.

Why wouldn't they? They can easily refcount the usage of their own CUDA streams.

I think that is making a lot of assumptions about how folks use and manage CUDA streams 😄. Again, some places use them similarly to thread pools and only control the lifetime of the pool.

I tried to dig through Tensorflow's code to figure exactly how they're managing the lifetime of their streams but I'm not confident, everything I say below may not be correct:

Something eventually calls down to AllocateStream and DeallocateStream (https://github.com/tensorflow/tensorflow/blob/b9fc6a9b611ec373c02e5b5ab432b1d7aff9392e/tensorflow/compiler/xla/stream_executor/cuda/cuda_gpu_executor.cc#L759-L774) to create and destroy CUDA streams.

These operate on raw ptrs and it looks like there's a class that wraps these, Stream (https://github.com/tensorflow/tensorflow/blob/b9fc6a9b611ec373c02e5b5ab432b1d7aff9392e/tensorflow/compiler/xla/stream_executor/stream.cc#L262-L286) which has constructor, destructor, and init functions roughly of what you'd expect.

I believe these Stream objects are managed in a StreamPool (https://github.com/tensorflow/tensorflow/blob/b9fc6a9b611ec373c02e5b5ab432b1d7aff9392e/tensorflow/compiler/xla/service/stream_pool.h#L27-L59) which then allows "borrowing" the streams using unique ptrs.

I guess in theory that if they ultimately have Stream objects being used that it could be moved into the private data being used by the release callback.

I tried to dig through Tensorflow's code to figure exactly how they're managing the lifetime of their streams but I'm not confident

The fact that they're handling those lifetimes should be enough to plug a refcounting mechanism (or interact with the GC, in case of a managed language). This is already necessary to manage the lifetime of data exported through the C Data Interface.

I understand that they might not have a refcounting mechanism in place already, but that's basic engineering anyway.

Okay, regardless if we take the producer provided path then I think it makes a lot more sense for the producer to share an Event than a Stream.

An Event can be waited on via cudaStreamWaitEvent / hipStreamWaitEvent which does a device side wait which would have minimal overhead if it's the same stream or cudaEventSynchronize / hipEventSynchronize if blocking host code is desired.

Since it seems we're going to take the Producer providing an event path, there isn't really a need for the get_next_device_id callback anymore, correct? Or am I missing something?

kkraus14 · 2023-04-11T22:12:44Z

@pitrou I think the other thing that hasn't been discussed in the above threads with regards to producer vs consumer provided stream is the development burden that will have to be incurred by the consumer in the different situations.

In producer provided stream / event:

The default behavior would be for the producer to return a stream or event in the struct if appropriate based on the producer
It forces the consumer to understand and handle device streams / events, which could be a non-trivial ask
There's no way for the consumer to ask for the data to be synchronized and safe to access from any stream and for the stream or event to be nullptr

In consumer provided stream:

The default behavior would be the consumer not providing a stream which would inform the producer to guarantee the data to be synchronized and safe to access from any stream
Someone could use consumer code that doesn't understand or utilize device streams or events in any way without issue

pitrou · 2023-04-12T09:43:18Z

Ok, who is supposed to be the consumer of the C Device Data Interface? I would expect it to be an Arrow implementation, or an Arrow-compatible library. Realistically they probably already deal with CUDA streams if they support CUDA?

pitrou · 2023-04-12T13:11:23Z

(also, to make things clear, I am not saying this proposal is wrong; I just want to make sure we evaluate the issues carefully and accurately - hence the questions)

pitrou

Some minor comments, but LGTM overall.

docs/source/format/CDeviceDataInterface.rst

Co-authored-by: Antoine Pitrou <pitrou@free.fr>

cpp/src/arrow/c/abi.h

Co-authored-by: Antoine Pitrou <pitrou@free.fr>

ursabot · 2023-06-07T08:05:31Z

Benchmark runs are scheduled for baseline = 9fb8697 and contender = 105b9df. 105b9df is a master commit associated with this PR. Results will be available as each benchmark for each run completes.
Conbench compare runs links:
[Finished ⬇️0.0% ⬆️0.0%] ec2-t3-xlarge-us-east-2
[Finished ⬇️0.62% ⬆️0.03%] test-mac-arm
[Finished ⬇️0.33% ⬆️0.0%] ursa-i9-9960x
[Failed ⬇️0.3% ⬆️0.0%] ursa-thinkcentre-m75q
Buildkite builds:
[Finished] 105b9df0 ec2-t3-xlarge-us-east-2
[Finished] 105b9df0 test-mac-arm
[Finished] 105b9df0 ursa-i9-9960x
[Failed] 105b9df0 ursa-thinkcentre-m75q
[Finished] 9fb8697d ec2-t3-xlarge-us-east-2
[Finished] 9fb8697d test-mac-arm
[Finished] 9fb8697d ursa-i9-9960x
[Failed] 9fb8697d ursa-thinkcentre-m75q
Supported benchmarks:
ec2-t3-xlarge-us-east-2: Supported benchmark langs: Python, R. Runs only benchmarks with cloud = True
test-mac-arm: Supported benchmark langs: C++, Python, R
ursa-i9-9960x: Supported benchmark langs: Python, R, JavaScript
ursa-thinkcentre-m75q: Supported benchmark langs: C++, Java

After: - https://github.com/zeroshade/arrow-non-cpu/tree/main - https://lists.apache.org/thread/o2hsw7o1gm3qgw5z51rmz6zqxh0p7bvk - apache/arrow#34972 Still in very much draft form; however, it *does* implement arbitrary ArrowArray copy to/from `ARROW_DEVICE_METAL`, `ARROW_DEVICE_CUDA`, `ARROW_DEVICE_CUDA_HOST`, and `ARROW_DEVICE_CPU`. The nanoarrow_device extension as drafted here serves a similar purpose to nanoarrow: a means by which to create and consume the C ABI with the intention of shipping those structures to other libraries to do transformations, and potentially retrieving them again after the computation is complete. Perhaps another way to put it is that nanoarrow is designed to help at the edges: it can create and consume. Similarly, the nanoarrow_device extension is designed to help at the edges: it can copy/move arrays to and from CPU-land. With this PR, you can currently do something like: ```c struct ArrowDevice* gpu = ArrowDeviceMetalDefaultDevice(); // Alternatively, ArrowDeviceCuda(ARROW_DEVICE_CUDA, 0) // or ArrowDeviceCuda(ARROW_DEVICE_CUDA_HOST, 0) struct ArrowDevice* cpu = ArrowDeviceCpu(); struct ArrowArray array; struct ArrowDeviceArray device_array; struct ArrowDeviceArrayView device_array_view; // Build a CPU array ASSERT_EQ(ArrowArrayInitFromType(&array, NANOARROW_TYPE_STRING), NANOARROW_OK); ASSERT_EQ(ArrowArrayStartAppending(&array), NANOARROW_OK); ASSERT_EQ(ArrowArrayAppendString(&array, ArrowCharView("abc")), NANOARROW_OK); ASSERT_EQ(ArrowArrayAppendString(&array, ArrowCharView("defg")), NANOARROW_OK); ASSERT_EQ(ArrowArrayAppendNull(&array, 1), NANOARROW_OK); ASSERT_EQ(ArrowArrayFinishBuildingDefault(&array, nullptr), NANOARROW_OK); // Convert to a DeviceArray, still on the CPU ArrowDeviceArrayInit(&device_array, cpu); ArrowArrayMove(&array, &device_array.array); // Parse contents into a view that can be copied to another device ArrowDeviceArrayViewInit(&device_array_view); ArrowArrayViewInitFromType(&device_array_view.array_view, string_type); ASSERT_EQ(ArrowDeviceArrayViewSetArray(&device_array_view, &device_array, nullptr), NANOARROW_OK); // Try to zero-copy move to another device or copy if that is not possible. Zero-copy move // is implemented for ARROW_DEVICE_METAL and ARROW_DEVICE_CUDA_HOST for the // gpu -> cpu case. struct ArrowDeviceArray device_array2; device_array2.array.release = nullptr; ASSERT_EQ( ArrowDeviceArrayTryMove(&device_array, &device_array_view, gpu, &device_array2), NANOARROW_OK); ``` In concrete terms, that means we to know enough about a device to (1) copy and/or move an arbitrary `ArrowArray`/`ArrowSchema` pair to a device from the CPU and (2) copy/move an arbitrary `ArrowDeviceArray`/`ArrowSchema` pair back to the CPU. The three types of copying I support (and maybe there could be fewer/need to be more) are: - `ArrowDeviceBufferInit()`: Make a non-owning buffer into an owning buffer on a device. The entry point if you want to take a slice of an `ArrowArrayView` and ship it to a device. - `ArrowDeviceBufferMove()`: Move an existing (owning) buffer to a device. For devices like the CPU, this is a true zero-copy move; for shared memory this can also sometimes be zero copy (e.g., Apple Metal -> CPU) but might also involve a copy. - `ArrowDeviceBufferCopy()`: Copy a section of a buffer into a preallocated section of another buffer. I'm envisioning this to be necessary when copying a String, Binary, List...we need the first and last values of the offsets buffer in order to know what portion of the data buffer to copy. It seems unnecessary to copy 4 bytes of a buffer into an owning variant covered by the first bullet but 🤷 . This PR currently provides support for the CPU device, Apple Metal, CUDA, and CUDA_HOST (i.e., CPU memory that has been registered with CUDA which CUDA copies under the hood). --------- Co-authored-by: Keith Kraus <keith.j.kraus@gmail.com>

… Data support (#40708) ### Rationale for this change We defined a protocol exposing the C Data Interface (schema, array and stream) in Python through PyCapsule objects and dunder methods `__arrow_c_schema/array/stream__` (#35531 / #37797). We also expanded the C Data Interface with device capabilities: https://arrow.apache.org/docs/dev/format/CDeviceDataInterface.html (#34972). This expands the Python exposure of the interface with support for the newer Device structs. ### What changes are included in this PR? Update the specification to defined two additional dunders: * `__arrow_c_device_array__` returns a pair of PyCapsules containing a C ArrowSchema and ArrowDeviceArray, where the latter uses "arrow_device_array" for the capsule name * `__arrow_c_device_stream__` returns a PyCapsule containing a C ArrowDeviceArrayStream, where the capsule must have a name of "arrow_device_array_stream" ### Are these changes tested? Spec-only change * GitHub Issue: #38325 Lead-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com> Co-authored-by: Dewey Dunnington <dewey@dunnington.ca> Co-authored-by: Antoine Pitrou <pitrou@free.fr> Co-authored-by: Matt Topol <zotthewizard@gmail.com> Signed-off-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

…Device Data support (apache#40708) ### Rationale for this change We defined a protocol exposing the C Data Interface (schema, array and stream) in Python through PyCapsule objects and dunder methods `__arrow_c_schema/array/stream__` (apache#35531 / apache#37797). We also expanded the C Data Interface with device capabilities: https://arrow.apache.org/docs/dev/format/CDeviceDataInterface.html (apache#34972). This expands the Python exposure of the interface with support for the newer Device structs. ### What changes are included in this PR? Update the specification to defined two additional dunders: * `__arrow_c_device_array__` returns a pair of PyCapsules containing a C ArrowSchema and ArrowDeviceArray, where the latter uses "arrow_device_array" for the capsule name * `__arrow_c_device_stream__` returns a PyCapsule containing a C ArrowDeviceArrayStream, where the capsule must have a name of "arrow_device_array_stream" ### Are these changes tested? Spec-only change * GitHub Issue: apache#38325 Lead-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com> Co-authored-by: Dewey Dunnington <dewey@dunnington.ca> Co-authored-by: Antoine Pitrou <pitrou@free.fr> Co-authored-by: Matt Topol <zotthewizard@gmail.com> Signed-off-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>

apacheGH-34971: [Format] Enhance C-Data API to support non-cpu cases

3f9519d

github-actions bot added Component: C++ awaiting committer review Awaiting committer review labels Apr 7, 2023

clang-format

63a65dc

pitrou reviewed Apr 8, 2023

View reviewed changes

cpp/src/arrow/c/abi.h Outdated Show resolved Hide resolved

pitrou reviewed Apr 8, 2023

View reviewed changes

cpp/src/arrow/c/abi.h Outdated Show resolved Hide resolved

pitrou reviewed Apr 8, 2023

View reviewed changes

cpp/src/arrow/c/abi.h Outdated Show resolved Hide resolved

pitrou reviewed Apr 8, 2023

View reviewed changes

cpp/src/arrow/c/abi.h Outdated Show resolved Hide resolved

pitrou reviewed Apr 8, 2023

View reviewed changes

cpp/src/arrow/c/abi.h Outdated Show resolved Hide resolved

paleolimbot reviewed Apr 9, 2023

View reviewed changes

cpp/src/arrow/c/abi.h Outdated Show resolved Hide resolved

cpp/src/arrow/c/abi.h Outdated Show resolved Hide resolved

cpp/src/arrow/c/abi.h Outdated Show resolved Hide resolved

github-actions bot added awaiting changes Awaiting changes and removed awaiting committer review Awaiting committer review labels Apr 9, 2023

lidavidm reviewed Apr 9, 2023

View reviewed changes

cpp/src/arrow/c/abi.h Outdated Show resolved Hide resolved

cpp/src/arrow/c/abi.h Outdated Show resolved Hide resolved

cpp/src/arrow/c/abi.h Outdated Show resolved Hide resolved

cpp/src/arrow/c/abi.h Outdated Show resolved Hide resolved

updates from feedback

fc16391

github-actions bot added awaiting change review Awaiting change review awaiting changes Awaiting changes and removed awaiting changes Awaiting changes awaiting change review Awaiting change review labels Apr 10, 2023

format/trim

26c9aa9

github-actions bot added awaiting change review Awaiting change review and removed awaiting changes Awaiting changes labels Apr 10, 2023

newline at end of file

e85d307

westonpace reviewed Apr 10, 2023

View reviewed changes

github-actions bot added awaiting changes Awaiting changes and removed awaiting change review Awaiting change review labels Apr 10, 2023

github-actions bot added the awaiting changes Awaiting changes label Jun 1, 2023

addressing review feedback

e34746f

github-actions bot added awaiting change review Awaiting change review awaiting changes Awaiting changes and removed awaiting changes Awaiting changes awaiting change review Awaiting change review labels Jun 1, 2023

pitrou reviewed Jun 6, 2023

View reviewed changes

zeroshade marked this pull request as ready for review June 6, 2023 14:52

Apply suggestions from code review

b359239

Co-authored-by: Antoine Pitrou <pitrou@free.fr>

github-actions bot added awaiting change review Awaiting change review and removed awaiting changes Awaiting changes labels Jun 6, 2023

zeroshade added 2 commits June 6, 2023 11:32

linting

4845e19

applying review feedback

fa8aab0

github-actions bot added awaiting changes Awaiting changes and removed awaiting change review Awaiting change review labels Jun 6, 2023

zeroshade requested a review from pitrou June 6, 2023 15:35

pitrou reviewed Jun 6, 2023

View reviewed changes

cpp/src/arrow/c/abi.h Outdated Show resolved Hide resolved

pitrou approved these changes Jun 6, 2023

View reviewed changes

Update cpp/src/arrow/c/abi.h

e068bc3

Co-authored-by: Antoine Pitrou <pitrou@free.fr>

github-actions bot added awaiting change review Awaiting change review and removed awaiting changes Awaiting changes labels Jun 6, 2023

zeroshade merged commit 105b9df into apache:main Jun 6, 2023

zeroshade deleted the non-cpu-cdata branch June 6, 2023 16:31

jorisvandenbossche mentioned this pull request Oct 18, 2023

[Python] Expose the device interface through the Arrow PyCapsule protocol #38325

Closed

jorisvandenbossche mentioned this pull request Feb 14, 2024

[C++] Import/Export ArrowDeviceArrayStream #40078

Closed

jorisvandenbossche mentioned this pull request Mar 21, 2024

GH-38325: [Python] Expand the Arrow PyCapsule Interface with C Device Data support #40708

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GH-34971: [Format] Add non-CPU version of C Data Interface #34972

GH-34971: [Format] Add non-CPU version of C Data Interface #34972

zeroshade commented Apr 7, 2023 •

edited by pitrou

Loading

github-actions bot commented Apr 7, 2023

github-actions bot commented Apr 7, 2023

westonpace left a comment

westonpace Apr 10, 2023

zeroshade Apr 10, 2023

pitrou Apr 10, 2023

zeroshade Apr 10, 2023

pitrou Apr 11, 2023

kkraus14 Apr 11, 2023

kkraus14 Apr 11, 2023

pitrou Apr 11, 2023

kkraus14 Apr 11, 2023 •

edited

Loading

leofang Apr 17, 2023

westonpace Apr 10, 2023

kkraus14 Apr 10, 2023

pitrou Apr 10, 2023

zeroshade Apr 10, 2023

pitrou Apr 11, 2023

pitrou Apr 12, 2023

kkraus14 Apr 12, 2023

pitrou Apr 12, 2023

kkraus14 Apr 12, 2023

zeroshade May 10, 2023

kkraus14 commented Apr 11, 2023

pitrou commented Apr 12, 2023

pitrou commented Apr 12, 2023

pitrou left a comment

ursabot commented Jun 7, 2023

GH-34971: [Format] Add non-CPU version of C Data Interface #34972

GH-34971: [Format] Add non-CPU version of C Data Interface #34972

Conversation

zeroshade commented Apr 7, 2023 • edited by pitrou Loading

Rationale for this change

What changes are included in this PR?

github-actions bot commented Apr 7, 2023

github-actions bot commented Apr 7, 2023

westonpace left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kkraus14 Apr 11, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kkraus14 commented Apr 11, 2023

pitrou commented Apr 12, 2023

pitrou commented Apr 12, 2023

pitrou left a comment

Choose a reason for hiding this comment

ursabot commented Jun 7, 2023

zeroshade commented Apr 7, 2023 •

edited by pitrou

Loading

kkraus14 Apr 11, 2023 •

edited

Loading