Implemented kDLCPUPinned (cudaMallocHost) #4985

jmorrill · 2020-03-04T19:55:14Z

Data allocated via cudaMallocHost is supposed to be faster at transferring data to/from a cuda device, and it was not implemented in the tvm runtime.

The DeviceAPIs treat DLDeviceTypes as their own device, so kDLCPUPinned felt a little bit out of place because it was sort of a kDLCPU (host memory) but really it was owned by a kDLGPU (cuda api).

I felt the least complicated path was to register an alias for "device_api.gpu" as "device_api.cpu_pinned" and implement the kDLCPUPinned logic in CUDADeviceAPI.

Some small checks also needed to be modified. Not sure if I missed any.

Open to suggestions if my implementation is way off.

tqchen · 2020-03-10T02:39:50Z

Thanks @jmorrill ! this is merged

* implement kDLCPUPinned * Fix line endings * Fix whitespace for linter * cleanup up allocdataspace method

jmorrill added 4 commits March 4, 2020 11:38

implement kDLCPUPinned

3a3d60f

Fix line endings

ee5e8de

Fix whitespace for linter

2b877fd

cleanup up allocdataspace method

7577b2a

tqchen approved these changes Mar 10, 2020

View reviewed changes

tqchen merged commit fd39c5c into apache:master Mar 10, 2020

tqchen added the status: accepted label Mar 10, 2020

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request Apr 16, 2020

Implemented kDLCPUPinned (cudaMallocHost) (apache#4985)

c0eeab1

* implement kDLCPUPinned * Fix line endings * Fix whitespace for linter * cleanup up allocdataspace method

zhiics pushed a commit to neo-ai/tvm that referenced this pull request Apr 17, 2020

Implemented kDLCPUPinned (cudaMallocHost) (apache#4985)

12cd550

* implement kDLCPUPinned * Fix line endings * Fix whitespace for linter * cleanup up allocdataspace method

ZihengJiang mentioned this pull request Sep 25, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented kDLCPUPinned (cudaMallocHost) #4985

Implemented kDLCPUPinned (cudaMallocHost) #4985

jmorrill commented Mar 4, 2020

tqchen commented Mar 10, 2020

Implemented kDLCPUPinned (cudaMallocHost) #4985

Implemented kDLCPUPinned (cudaMallocHost) #4985

Conversation

jmorrill commented Mar 4, 2020

tqchen commented Mar 10, 2020