Skip to content

CTranslate2 1.11.0

Compare
Choose a tag to compare
@guillaumekln guillaumekln released this 29 Jun 16:02
· 1235 commits to master since this release

New features

  • Add tokenization and detokenization hooks for file translation APIs
  • Add alternatives to Intel MKL:
    • Integrate oneDNN for GEMM functions
    • Implement vectorized operators that automatically select the instruction set architecture (ISA) (can be manually controlled with the CT2_FORCE_CPU_ISA environment variable)
  • When alternatives are available, avoid using Intel MKL on non Intel processors (can be manually controlled with the CT2_USE_MKL environment variable)
  • Enable a verbose mode with the environment variable CT2_VERBOSE=1 to help debugging the run configuration (e.g. the detected CPU, whether Intel MKL is being used, etc.)

Fixes and improvements

  • Improve numerical precision of SoftMax and LogSoftMax layers on CPU
  • Parallelize INT16 quantization/dequantization and ReLU on CPU
  • Add back the translation client in CentOS 7 Docker images