Caffe-Int8-To-NCNN

This tools is base on TensorRT 2.0 Int8 calibration tools,which use the KL algorithm to find the suitable threshold to quantize the activions from Float32 to Int8(-127 - 127).

This code modify base on [https://github.com/BUG1989/caffe-int8-convert-tools]

Reference

For details, please read the following PDF:

8-bit Inference with TensorRT

MXNet quantization implement:

Quantization module for generating quantized (INT8) models from FP32 models

An introduction to the principles of a Chinese blog written by bruce.zhang:

The implement of Int8 quantize base on TensorRT

HowTo

The purpose of this tool(caffe-int8-to-ncnn.py) is to save the caffemodel as an int8 ncnn model and deploy it to ncnn.

This format is already supported in the ncnn latest version.

python caffe-int8-convert-tool-dev-weight.py -h
usage: caffe-int8-convert-tool-dev-weight.py [-h] [--proto PROTO] [--model MODEL]
                                  [--mean MEAN MEAN MEAN] [--norm NORM]
                                  [--images IMAGES] [--output_param OUTPUT_PARAM]
                                  [--output_bin OUTPUT_BIN] [--group GROUP]
                                  [--gpu GPU]

find the pretrained caffemodel int8 quantize scale value

optional arguments:
  -h, --help            	show this help message and exit
  --proto PROTO         	path to deploy prototxt.
  --model MODEL         	path to pretrained caffemodel
  --mean MEAN           	value of mean
  --norm NORM           	value of normalize(scale value or std value)
  --images IMAGES       	path to calibration images
  --output_param OUTPUT_PARAM     path to output ncnn param file
  --output_bin OUTPUT_BIN       path to output ncnn bin file
  --group GROUP         enable the group scale(0:disable,1:enable,default:1)
  --gpu GPU             use gpu to forward(0:disable,1:enable,default:0)
python caffe-int8-convert-tool-dev-weight.py --proto=test/models/mobilenet_v1.prototxt --model=test/models/mobilenet_v1.caffemodel --mean 103.94 116.78 123.68 --norm=0.017 --images=test/images/ output_param=pnet.param output_param=pnet.bin --group=1 --gpu=1

License

BSD 3 Clause

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
caffe-int8-to-ncnn.py		caffe-int8-to-ncnn.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Caffe-Int8-To-NCNN

Reference

HowTo

License

About

Releases

Packages

Languages

License

w8501/caffe-int8-to-ncnn

Folders and files

Latest commit

History

Repository files navigation

Caffe-Int8-To-NCNN

Reference

HowTo

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages