Add modified beam search for pruned rnn-t. #248

csukuangfj · 2022-03-12T03:29:24Z

Training command

./pruned_transducer_stateless/train.py \
  --world-size 8 \
  --num-epochs 60 \
  --start-epoch 0 \
  --exp-dir pruned_transducer_stateless/exp \
  --full-libri 1 \
  --max-duration 300 \
  --prune-range 5 \
  --lr-factor 5 \
  --lm-scale 0.25

Tensorboard log https://tensorboard.dev/experiment/WKRFY5fYSzaVBHahenpNlA/

Decoding command

for epoch in 42; do
  for avg in 11; do
    ./pruned_transducer_stateless/decode.py \
      --epoch $epoch \
      --avg $avg \
      --exp-dir ./pruned_transducer_stateless/exp \
      --max-duration 100 \
      --decoding-method greedy_search \
      --max-sym-per-frame 1
  done
done

for epoch in 42; do
  for avg in 11; do
    ./pruned_transducer_stateless/decode.py \
      --epoch $epoch \
      --avg $avg \
      --exp-dir ./pruned_transducer_stateless/exp \
      --max-duration 100 \
      --decoding-method modified_beam_search \
      --beam-size 4
  done
done

for epoch in 42; do
  for avg in 11; do
    ./pruned_transducer_stateless/decode.py \
      --epoch $epoch \
      --avg $avg \
      --exp-dir ./pruned_transducer_stateless/exp \
      --max-duration 100 \
      --decoding-method beam_search \
      --beam-size 4
  done
done

Decoding results:

decoding method	test-clean	test-other	comment
greedy search (--max-sym-per-frame 1)	2.62	6.37	--epoch 42 --avg 11 --max-duration 100
greedy search (--max-sym-per-frame 2)	2.62	6.37	--epoch 42 --avg 11 --max-duration 100
greedy search (--max-sym-per-frame 3)	2.62	6.37	--epoch 42 --avg 11 --max-duration 100
modified beam search (--beam-size 4)	2.56	6.27	--epoch 42 --avg 11 --max-duration 100
beam search (--beam-size 4)	2.57	6.27	--epoch 42 --avg 11 --max-duration 100

Note:

The model is not trained using modified transducer.
By modified beam search, it means it hardcodes --max-sym-per-frame=1 during beam search.
The current implementation of beam search is super slow and we recommend using only modified beam search.
For the decoding time of test-clean and test-other, see the table listed as follows:

decoding method	test-clean (seconds)	test-other (seconds)
greedy search (--max-sym-per-frame=1)	160	159
greedy search (--max-sym-per-frame=2)	184	177
greedy search (--max-sym-per-frame=3)	210	213
modified beam search (--beam-size 4)	273	269
beam search (--beam-size 4)	2741	2221

Will update RESULTS.md and upload the pre-trained model to hugging face later today.

pkufool · 2022-03-12T03:54:33Z

Does this model contain extra nn.Linear()?

csukuangfj · 2022-03-12T03:55:58Z

Does this model contain extra nn.Linear()?

Yes. It uses the code from the master.

csukuangfj · 2022-03-12T07:24:29Z

A pre-trained model, the decoding logs, and the decoding results are uploaded to
https://huggingface.co/csukuangfj/icefall-asr-librispeech-pruned-transducer-stateless-2022-03-12

danpovey · 2022-03-12T09:03:03Z

A great feature!

csukuangfj added 2 commits March 12, 2022 10:42

Add modified beam search for pruned rnn-t.

bd033de

Fix style issues.

d9beb73

Update RESULTS.md.

c5291c8

csukuangfj added 3 commits March 12, 2022 15:26

Fix typos.

33c0f8f

Minor fixes.

949b532

Test the pre-trained model using GitHub actions.

25643e0

csukuangfj added the ready label Mar 12, 2022

csukuangfj added 2 commits March 12, 2022 15:59

Let the user install optimized_transducer on her own.

c0f4f62

Fix errors in GitHub CI.

4b606dd

csukuangfj added ready and removed ready labels Mar 12, 2022

csukuangfj merged commit bb7f6ed into k2-fsa:master Mar 12, 2022

csukuangfj deleted the modified-beam-search-for-pruned-rnnt branch March 12, 2022 08:16

pkufool mentioned this pull request Mar 14, 2022

Add fast beam search decoding #250

Merged

csukuangfj mentioned this pull request Mar 14, 2022

[Not for Merge]: Visualize the gradient of each node in the lattice. #251

Open

This was referenced Mar 21, 2022

Implement greedy search in batch mode for transducer decoding. #262

Merged

Support modified beam search in batch mode. #264

Merged

danpovey mentioned this pull request Apr 4, 2022

Reworked model #288

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add modified beam search for pruned rnn-t. #248

Add modified beam search for pruned rnn-t. #248

csukuangfj commented Mar 12, 2022 •

edited

Loading

pkufool commented Mar 12, 2022

csukuangfj commented Mar 12, 2022

csukuangfj commented Mar 12, 2022

danpovey commented Mar 12, 2022

Add modified beam search for pruned rnn-t. #248

Add modified beam search for pruned rnn-t. #248

Conversation

csukuangfj commented Mar 12, 2022 • edited Loading

pkufool commented Mar 12, 2022

csukuangfj commented Mar 12, 2022

csukuangfj commented Mar 12, 2022

danpovey commented Mar 12, 2022

csukuangfj commented Mar 12, 2022 •

edited

Loading