GitHub - jeng1220/trt-se-resnext: a sample, running se-resnext on TensorRT

Requirement

TensorRT 5.0 GA
Tensorflow with GPU support
PyCUDA
Python3
Cmake (>= 3.8)

Assume that the PB file is located at <path to this project>/data and named se-resnext.pb

Get UFF from Tensorflow protobuf

$ cd <path to this project>/data
$ python3 <path to uff-converter-tf>/convert_to_uff.py <your PB file> -p preprocess.py

For instance:

$ cd data
$ python3 /usr/local/lib/python3.5/dist-packages/uff/bin/convert_to_uff.py se-resnext.pb -p preprocess.py
# or
$ python3 /usr/lib/python3.5/dist-packages/uff/bin/convert_to_uff.py se-resnext.pb -p preprocess.py

You should get an UFF file which may be named se-resnext.uff in data folder

Verify TensorRT result in FP32 mod

Run Tensorflow

$ cd <path to this project>/verification
$ python3 tf_sample.py

Run TensorRT

$ cd <path to this project>/verification
$ python3 trt_sample.py

The results should be same.

Run performance benchmark

$ cd <path to this project>/data
$ <path to TensorRT>/trtexec --uff=<your UFF file> --output=softmax --uffInput=<input name>,3,224,224 --batch=<batch size>

For instance:

$ cd data
$ /usr/src/tensorrt/bin/trtexec --uff=se-resnext.uff --output=softmax --uffInput=tf_feed_image,3,224,224 --batch=32

Build TensorRT engine (C++)

$ mkdir <path to this project>/build
$ cd <path to this project>/build
$ cmake ..
$ make -j2

Run TensorRT engine

$ cd <path to this project>/build
# It is slow at first time because of generating TensorRT engine binary
$ ./trt_se_resnext
# The executable will use TensorRT engine binary at second time. It will be much faster in initialization
$ ./trt_se_resnext

Test environment

CUDA 10
cuDNN 7.3.1
TensorRT 5.0 GA
Tensorflow 18.11-py3 from NGC
Ubuntu 16.04
Cmake 3.8

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
trt-se-resnext		trt-se-resnext
verification		verification
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Requirement

Get UFF from Tensorflow protobuf

Verify TensorRT result in FP32 mod

Run Tensorflow

Run TensorRT

Run performance benchmark

Build TensorRT engine (C++)

Run TensorRT engine

Test environment

About

Releases

Packages

Languages

jeng1220/trt-se-resnext

Folders and files

Latest commit

History

Repository files navigation

Requirement

Get UFF from Tensorflow protobuf

Verify TensorRT result in FP32 mod

Run Tensorflow

Run TensorRT

Run performance benchmark

Build TensorRT engine (C++)

Run TensorRT engine

Test environment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages