Skip to content

CTranslate2 1.16.0

Compare
Choose a tag to compare
@guillaumekln guillaumekln released this 18 Nov 16:13
· 1099 commits to master since this release

Changes

  • Drop support for Python 2.7 and 3.5

New features

  • Add Docker images using CUDA 11.0

Fixes and improvements

  • [Python] Enable parallel CPU translations from translate_batch when setting inter_threads > 1 and max_batch_size > 0
  • Improve GPU performance on Turing architecture when using a Docker image or the Python package
  • Pad size of INT8 matrices to a multiple of 16 when the GPU has INT8 Tensor Cores
  • Add information about detected GPU devices in CT2_VERBOSE output
  • Update oneDNN to 1.7
  • [Python] Improve type checking for some arguments