CTranslate2 1.2.1
CTranslate2 1.2.1
Fixes and improvements
- Avoid copying decoder states when possible to improve decoding performance (10% to 20% faster)
- Fix execution profiling on GPU (device was not synchronized before measuring the time)
- Include
Mul
operation in profiling report - Add a Python 3 wheel in Ubuntu Docker images