TensorRT 6.0.1 preformance regression against TensorFlow with 3D convolutions and pooling

This repository is meant to contain a dataset and scripts to reproduce the performance regression we have observed across TF and TF-TRT. It appears that running a network that processes 3D convolutions and pooling layers is executed faster through TF than through TF-TRT. We have observed this behavious on three GPUs: GTX 1080 8GB, RTX 2080 Ti 12GB and V100 16GB. In our measurements the TF-TRT slowdown is around 27-44%. We have used batchsize=1 and FP32 everywhere.

TensorFlow needs to built against TensorRT 6 and needs to contain at least 4297539768bfb6d45d3248fc6471e84e260efc6c.

Usage

benchmark.sh: run a TF 3D convnet (model_100.pb) through TF benchmark tool
convert.py: convert TF model to TF-TRT (model_100.pb -> model_100-tf-trt.pb)
benchmark-tf-trt.sh: run the converted TF-TRT 3D convnet (model_100-tf-trt.pb) through TF benchmarking tool

Links

https://devtalk.nvidia.com/default/topic/1064822/tensorrt/tensorrt-6-slower-than-tensorflow-with-3d-convolutions-and-pooling/ https://stackoverflow.com/questions/58607849/how-to-avoid-tensorrt-6-0-1-preformance-regression-against-tensorflow-with-3d-co

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
logs		logs
README.md		README.md
benchmark-tf-trt.sh		benchmark-tf-trt.sh
benchmark.sh		benchmark.sh
convert.py		convert.py
model_100.pb		model_100.pb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TensorRT 6.0.1 preformance regression against TensorFlow with 3D convolutions and pooling

Usage

Links

About

Releases

Packages

Languages

ralovich/trt-regression

Folders and files

Latest commit

History

Repository files navigation

TensorRT 6.0.1 preformance regression against TensorFlow with 3D convolutions and pooling

Usage

Links

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages