TRTIS should support variable-sized input and output tensor dimensions #8

deadeyegoodwin · 2018-11-30T18:14:50Z

Currently TRTIS only allows the first dimension of an input/output tensor to be variable sized and only when that dimension represents batching. TRTIS should allow variable-sized dimensions in other cases since these are supported by some of the FWs (e.g. TensorFlow) and not having it limits which models can easily run on TRTIS.

dcyoung · 2018-11-30T18:51:41Z

The described limitation is stalling our team's migration from tf-serving to tensorrt-inference-server. Glad to hear the need is understood and we will be following the progress eagerly!

xinli94 · 2018-12-03T22:11:51Z

Thanks for taking this into consideration! This blocks me from switching to tensorrt inference server for quite a while, also makes deploying detection models such a pain. Hope it get supported soon!

tilaba · 2018-12-25T05:45:08Z

hi, can TRTIS support the model with mutilple outputs??

bezero · 2018-12-25T05:56:14Z

Hi @tilaba. Yes it does support multiple outputs. For example in case of tensorflow object detection api, for your outputs in the config file you set as follows:
output [
{
name: "detection_boxes"
data_type: TYPE_FP32
dims: [ 100, 4 ]
},
{
name: "detection_scores"
data_type: TYPE_FP32
dims: [ 100 ]
},
{
name: "detection_classes"
data_type: TYPE_FP32
dims: [ 100 ]
}
]
It is an array of outputs. Later, when sending request you can use:
results = ctx.run( {input_name: input_name}, {output: InferContext.ResultFormat.RAW for output in output_names}, batch_size)
Where output_names = ["detection_boxes", "detection_scores", "detection_classes"]

tilaba · 2018-12-25T09:22:48Z

it works, thanks @bezero

tilaba · 2018-12-25T09:28:51Z

Have this issue been fixed?

blackarrow3542 · 2019-01-27T17:41:10Z

This will be really useful for CTPN CRNN models for OCR.

deadeyegoodwin · 2019-02-01T01:03:12Z

The inference server now supports variable-size input and output tensor dimensions for backends that support them. As of now that is Tensorflow, Caffe2, and custom (assuming your custom backend handles them correctly). You specify such a dimension by using -1 in the model configuration for the appropriate dimension.

This support is on the master branch and will be in the 19.02 release. Please give it a try and report any issues.

dcyoung · 2019-02-01T17:31:48Z

@deadeyegoodwin With this feature, our team is excited to explore a migration from tf-serving to TRTIS. Thank you for responding to the community feedback. It is much appreciated.

bezero · 2019-02-22T10:18:55Z

@deadeyegoodwin When TRTIS container for 19.02 release will be available?

deadeyegoodwin · 2019-02-22T17:44:12Z

The monthly container releases are typically available around the 25th. So, following typical practice, 19.02 would be available around Monday 2/25. But this month I think it may be delayed till the end of that week.

ziyuang · 2019-04-19T12:22:24Z

The inference server now supports variable-size input and output tensor dimensions for backends that support them. As of now that is Tensorflow, Caffe2, and custom (assuming your custom backend handles them correctly). You specify such a dimension by using -1 in the model configuration for the appropriate dimension.

This support is on the master branch and will be in the 19.02 release. Please give it a try and report any issues.

How come TRTIS supports dynamic input size while TensorRT itself doesn't?

bezero · 2019-04-19T12:35:09Z

@ziyuang TRTIS does not support TensorRT models alone. It also supports other frameworks as well (tensorrt_plan, tensorflow_graphdef, tensorflow_savedmodel, caffe2_netdef, or custom). These platforms do support dynamic input size. To sum up, TRTIS allows you to specify dynamic input size for your models that are able to handle such inputs.

ziyuang · 2019-04-19T13:26:53Z

@ziyuang TRTIS does not support TensorRT models alone. It also supports other frameworks as well (tensorrt_plan, tensorflow_graphdef, tensorflow_savedmodel, caffe2_netdef, or custom). These platforms do support dynamic input size. To sum up, TRTIS allows you to specify dynamic input size for your models that are able to handle such inputs.

Good; would I have the computation graph optimized if I use models other than TensorRT PLAN?

deadeyegoodwin · 2019-04-22T15:45:18Z

Each backend/framework (tensorrt, tensorflow, caffe2) has its own optimization techniques that it applies to the model before execution. Typically, the optimizations performed by tensorrt provide significant speedups relative to other frameworks. But tensorflow does have some optimization as well. There is also the TensorRT-Tensorflow integration that allows you to get many of the benefits of tensorrt while still using tensorflow. TRTIS fully supports tensorflow models that have been optimized with tensorrt. https://github.com/tensorflow/tensorrt

deadeyegoodwin mentioned this issue Nov 30, 2018

How to deploy models where the shape of output tensor is not known #5

Closed

zoidburg mentioned this issue Nov 30, 2018

got problem while serving with TensorRT plan #7

Closed

deadeyegoodwin added the enhancement New feature or request label Dec 15, 2018

This was referenced Jan 4, 2019

How the Input tensor definition for tensorflow GraphDef model in TRTIS graph.pbtxt file? #33

Closed

Why TRTIS not support dims: [ -1 ] for input and output tensor? #32

Closed

deadeyegoodwin closed this as completed Feb 1, 2019

taomiao mentioned this issue Nov 20, 2019

pytorch bert model error #900

Closed

tanmayv25 mentioned this issue Jun 18, 2020

Enabling HTTPS in python HTTP client library #1684

Merged

rarvind33 mentioned this issue Aug 27, 2020

Failed to load 'resnet50_netdef' #1935

Closed

ruilongzhang mentioned this issue Sep 9, 2020

[enforce fail at operator.cc:76] blob != nullptr. op Cast: Encountered a non-existing input blob: data #1993

Closed

arunsu mentioned this issue Mar 4, 2021

Running torchscript exported model in Triton throws InferenceServerException #2594

Closed

lincong8722 mentioned this issue Nov 30, 2021

About PyTorch execute failure: forward() is missing value for argument 'input'. error #3633

Closed

jackzhou121 mentioned this issue Mar 3, 2022

triton server failed exited with coredump #4010

Closed

This was referenced Apr 25, 2022

[LibTorch] Expected Tensor but got None with inception v3 #2526

Closed

Running inference with Pytorch backend on Jetson nano #4298

Closed

jackzhou121 mentioned this issue Aug 17, 2022

triton pytorch backend malloc coredump #4778

Closed

zhaotyer mentioned this issue Aug 22, 2022

Core dump when dynamic batch Infer using tensorflow backend #4769

Closed

jackzhou121 mentioned this issue Sep 5, 2022

use triton container 22.07 sdk load torchscript model failed #4848

Closed

Tsingjie89 mentioned this issue Sep 8, 2022

python backend crash #4857

Closed

zhaotyer mentioned this issue Dec 22, 2022

Core dump when load model with config which containning repoagent in explicit mode #5189

Closed

rmccorm4 mentioned this issue Feb 3, 2023

Add gdb backtrace to qa tests when server fails to start within timeout #5310

Merged

vonchenplus mentioned this issue Mar 14, 2024

[Pytorch model] Triton inference server didn't response the second request from client (only run with first request) #6593

Closed

MouseSun846 mentioned this issue Jun 2, 2024

triton malloc fail #7308

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TRTIS should support variable-sized input and output tensor dimensions #8

TRTIS should support variable-sized input and output tensor dimensions #8

deadeyegoodwin commented Nov 30, 2018

dcyoung commented Nov 30, 2018

xinli94 commented Dec 3, 2018

tilaba commented Dec 25, 2018

bezero commented Dec 25, 2018

tilaba commented Dec 25, 2018

tilaba commented Dec 25, 2018

blackarrow3542 commented Jan 27, 2019

deadeyegoodwin commented Feb 1, 2019

dcyoung commented Feb 1, 2019

bezero commented Feb 22, 2019

deadeyegoodwin commented Feb 22, 2019

ziyuang commented Apr 19, 2019

bezero commented Apr 19, 2019

ziyuang commented Apr 19, 2019

deadeyegoodwin commented Apr 22, 2019

TRTIS should support variable-sized input and output tensor dimensions #8

TRTIS should support variable-sized input and output tensor dimensions #8

Comments

deadeyegoodwin commented Nov 30, 2018

dcyoung commented Nov 30, 2018

xinli94 commented Dec 3, 2018

tilaba commented Dec 25, 2018

bezero commented Dec 25, 2018

tilaba commented Dec 25, 2018

tilaba commented Dec 25, 2018

blackarrow3542 commented Jan 27, 2019

deadeyegoodwin commented Feb 1, 2019

dcyoung commented Feb 1, 2019

bezero commented Feb 22, 2019

deadeyegoodwin commented Feb 22, 2019

ziyuang commented Apr 19, 2019

bezero commented Apr 19, 2019

ziyuang commented Apr 19, 2019

deadeyegoodwin commented Apr 22, 2019