Skip to content

Beam search algorithm implementation for TDT models #6299

Beam search algorithm implementation for TDT models

Beam search algorithm implementation for TDT models #6299

L2_Megatron_GPT_with_ResetLR_Pretraining_and_Resume_Training_TP2  /  main

succeeded Nov 13, 2024 in 4m 9s