[BYOC][TRT] Allocate GPU data buffers and transfer data when needed #6872

trevor-m · 2020-11-06T22:23:36Z

This PR enables the TRT BYOC integration to be used with target="llvm" (previously could only use "cuda").
If an input or output DLTensor is not located on the GPU, we will now allocate a GPU buffer to pass to TensorRT and transfer the data from the DLTensor accordingly. Since data_entry_ is needed during BuildEngine now, we had to move BuildEngine from JsonRuntime::Init to first run.

This is prerequisite to use TRT BYOC in combination with Relay VM which in general requires llvm target.

Thanks @ylc for original implementation: neo-ai#147

fix

trevor-m · 2020-11-06T22:25:22Z

@zhiics @comaniac @anijain2305

comaniac

Only a few comments.

src/runtime/contrib/tensorrt/tensorrt_builder.cc

comaniac · 2020-11-07T01:13:06Z

src/runtime/contrib/tensorrt/tensorrt_runtime.cc

@@ -106,9 +104,11 @@ class TensorRTRuntime : public JSONRuntimeBase {
 #ifdef TVM_GRAPH_RUNTIME_TENSORRT
  /*! \brief Run inference using built engine. */
  void Run() override {
+    BuildEngine();


Is the reason of moving BuildEngine from Init to Run because you need subgraph specific information (e.g., I/O data entry IDs) to allocate device buffers?

Thanks @comaniac for the review! Yes, to allocate the device buffers we need the DLTensor context and shape. data_entry_ in JSON runtime isn't initialized until Run() so I had to move BuildEngine.

In the future, we are planning to be able to dynamically build engines for different input shapes in order to handle subgraphs with dynamic input sizes, so moving it would be needed for that anyway.

src/runtime/contrib/tensorrt/tensorrt_runtime.cc

comaniac

LGTM

comaniac · 2020-11-07T20:59:10Z

Thanks @trevor-m @zhiics

…pache#6872) * Allocate data buffers for gpu fix * Rename AllocateDeviceBuffer, update docstrings * Remove unneeded cast

Allocate data buffers for gpu

44eb222

fix

trevor-m changed the title ~~[BYOC][TRT] Allocate GPU data buffers when needed and transfer data~~ [BYOC][TRT] Allocate GPU data buffers and transfer data when needed Nov 6, 2020

comaniac requested changes Nov 7, 2020

View reviewed changes

Rename AllocateDeviceBuffer, update docstrings

5a02216

comaniac approved these changes Nov 7, 2020

View reviewed changes

Remove unneeded cast

0f630c7

zhiics approved these changes Nov 7, 2020

View reviewed changes

comaniac merged commit c6cf58f into apache:main Nov 7, 2020

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request Dec 2, 2020

[BYOC][TRT] Allocate GPU data buffers and transfer data when needed (a…

fc15b3e

…pache#6872) * Allocate data buffers for gpu fix * Rename AllocateDeviceBuffer, update docstrings * Remove unneeded cast

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request Dec 4, 2020

[BYOC][TRT] Allocate GPU data buffers and transfer data when needed (a…

5a486db

…pache#6872) * Allocate data buffers for gpu fix * Rename AllocateDeviceBuffer, update docstrings * Remove unneeded cast

trevor-m pushed a commit to neo-ai/tvm that referenced this pull request Dec 4, 2020

[BYOC][TRT] Allocate GPU data buffers and transfer data when needed (a…

4ef248b

…pache#6872) * Allocate data buffers for gpu fix * Rename AllocateDeviceBuffer, update docstrings * Remove unneeded cast

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BYOC][TRT] Allocate GPU data buffers and transfer data when needed #6872

[BYOC][TRT] Allocate GPU data buffers and transfer data when needed #6872

trevor-m commented Nov 6, 2020

trevor-m commented Nov 6, 2020 •

edited

Loading

comaniac left a comment

comaniac Nov 7, 2020

trevor-m Nov 7, 2020

comaniac left a comment

comaniac commented Nov 7, 2020

[BYOC][TRT] Allocate GPU data buffers and transfer data when needed #6872

[BYOC][TRT] Allocate GPU data buffers and transfer data when needed #6872

Conversation

trevor-m commented Nov 6, 2020

trevor-m commented Nov 6, 2020 • edited Loading

comaniac left a comment

Choose a reason for hiding this comment

comaniac Nov 7, 2020

Choose a reason for hiding this comment

trevor-m Nov 7, 2020

Choose a reason for hiding this comment

comaniac left a comment

Choose a reason for hiding this comment

comaniac commented Nov 7, 2020

trevor-m commented Nov 6, 2020 •

edited

Loading