Refactor int4 and int8 weight only quantization to use quantize
#1037
Job | Run time |
---|---|
3m 32s | |
26s | |
0s | |
3m 58s |
quantize
#1037
Job | Run time |
---|---|
3m 32s | |
26s | |
0s | |
3m 58s |