CTranslate2 1.10.1
Fixes and improvements
- Force
intra_threads
to 1 when running a model on GPU to prevent high CPU load - Improve handling of decoding length constraints when using a target prefix
- Do not raise an error when setting
use_vmap
but no vocabulary map exists