Change the repository type filter
All
Repositories list
13 repositories
compressa-perf
Publiccompressa-ai.github.io
Publiccompressa-deploy
Publicvllm
Publiclangchain_compressa
Publicqlora
Publicllm-awq
PublicOmniQuant
Publicrulm
PublicAutoAWQ
Publicsmoothquant
Publicpeft
Publicneural-compressor
PublicIntel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.