Skip to content

Latest commit

 

History

History
25 lines (20 loc) · 1.38 KB

README.md

File metadata and controls

25 lines (20 loc) · 1.38 KB

TensorRT 6.0.1 preformance regression against TensorFlow with 3D convolutions and pooling

This repository is meant to contain a dataset and scripts to reproduce the performance regression we have observed across TF and TF-TRT. It appears that running a network that processes 3D convolutions and pooling layers is executed faster through TF than through TF-TRT. We have observed this behavious on three GPUs: GTX 1080 8GB, RTX 2080 Ti 12GB and V100 16GB. In our measurements the TF-TRT slowdown is around 27-44%. We have used batchsize=1 and FP32 everywhere.

TensorFlow needs to built against TensorRT 6 and needs to contain at least 4297539768bfb6d45d3248fc6471e84e260efc6c.

Usage

  • benchmark.sh: run a TF 3D convnet (model_100.pb) through TF benchmark tool
  • convert.py: convert TF model to TF-TRT (model_100.pb -> model_100-tf-trt.pb)
  • benchmark-tf-trt.sh: run the converted TF-TRT 3D convnet (model_100-tf-trt.pb) through TF benchmarking tool

Links

https://devtalk.nvidia.com/default/topic/1064822/tensorrt/tensorrt-6-slower-than-tensorflow-with-3d-convolutions-and-pooling/ https://stackoverflow.com/questions/58607849/how-to-avoid-tensorrt-6-0-1-preformance-regression-against-tensorflow-with-3d-co