Block Floating Point (BFP) and DSConv with GPU support
This repo implements BFP and DSConv in cuda kernels to be used with PyTorch
- Python >= 3.6
- PyTorch >= 1.0
- CUB == 1.8
- Download CUB and put it in
/home/your_username/libs/
(or the file indicated atNUQ/BlackBox/Quantization/src/setup.py:22
) cd /path/to/NUQ/BlackBox/Quantization/src/
thenpython build_ext --inplace
.