Conformer lm #54

danpovey · 2021-09-24T03:22:19Z

No description provided.

…nly...

…l/prepare_lm_training_data.py

…workers

…-04.

csukuangfj · 2021-09-24T03:23:48Z

We will add decoding script to it.

danpovey · 2021-09-24T03:24:36Z

See also https://tensorboard.dev/experiment/unF4gSyjRjua2DSKgb3BMg/
and /ceph-dan/icefall/egs/librispeech/ASR/conformer_m/exp_6

csukuangfj · 2021-09-25T11:05:16Z

egs/librispeech/ASR/conformer_lm/dataset.py

+        bos or eos symbols).
+        """
+        # in future will just do:
+        #return self.words[self.sentences[i]].tolist()


Supported by k2-fsa/k2#833

danpovey · 2021-09-25T11:16:04Z

Cool!

…

On Sat, Sep 25, 2021 at 7:05 PM Fangjun Kuang ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In egs/librispeech/ASR/conformer_lm/dataset.py <#54 (comment)>: > + super(LmDataset, self).__init__() + self.sentences = sentences + self.words = words + + + def __len__(self): + # Total size on axis 0, == num sentences + return self.sentences.tot_size(0) + + def __getitem__(self, i: int): + """ + Return the i'th sentence, as a list of ints (representing BPE pieces, without + bos or eos symbols). + """ + # in future will just do: + #return self.words[self.sentences[i]].tolist() Supported by k2-fsa/k2#833 <k2-fsa/k2#833> — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub <#54 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAZFLO6OB42G2EE63756OJ3UDWUHRANCNFSM5EVAYUMQ> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.

csukuangfj · 2021-09-27T02:35:23Z

egs/librispeech/ASR/conformer_lm/conformer.py

+            cnn_module_kernel,
+        )
+        self.encoder = MaskedLmConformerEncoder(encoder_layer, num_encoder_layers,
+                                                norm=nn.LayerNorm(d_model))


We are using pre-normalization here.
You have placed a layer norm at the end of the encoder layer, see

icefall/egs/librispeech/ASR/conformer_lm/conformer.py

Line 907 in 0cfa8c8

x = self.norm_final(x)

You are using an extra layer norm here, which means you are doing

x = layernorm(layernorm(x))

See

icefall/egs/librispeech/ASR/conformer_lm/conformer.py

Lines 967 to 968 in 0cfa8c8

if self.norm is not None:

x = self.norm(x)

I just realized that it is even worse.

You are using a layer norm at both ends of an encoder layer but
encoder layers are connected end-by-end, which means the output of the layer norm
from the previous encoder layer is used as the input of the layer norm of the next encoder.

This "even worse" part is not quite right, because there are bypass connections, so there are paths involving residuals where the input is used without layer norm.

.. but yes, I agree that the LayerNorm at the end of the conformer encoder is redundant. I have since stopped using that. But in this particular case, it would take a long time to retrain the model if we were to fix it, so I'd say leave it as-is for now.

so I'd say leave it as-is for now.

Yes, I agree. After finishing the decoding script, I would recommend removing it and
and re-run the whole pipeline.

egs/librispeech/ASR/conformer_lm/conformer.py

Co-authored-by: Fangjun Kuang <[email protected]>

csukuangfj · 2021-09-27T04:09:16Z

egs/librispeech/ASR/prepare.sh

+    lm_dir=data/lm_training_${vocab_size}
+    mkdir -p $lm_dir
+    log "Stage 9: creating $lm_dir/lm_data.pt"
+    ./local/prepare_lm_training_data.py data/lang_bpe_${vocab_size}/bpe.model download/lm/librispeech-lm-norm.txt $lm_dir/lm_data.pt


Suggested change

./local/prepare_lm_training_data.py data/lang_bpe_${vocab_size}/bpe.model download/lm/librispeech-lm-norm.txt $lm_dir/lm_data.pt

./local/prepare_lm_training_data.py $lang_dir/bpe.model $dl_dir/lm/librispeech-lm-norm.txt $lm_dir/lm_data.pt

csukuangfj · 2021-09-27T04:34:52Z

egs/librispeech/ASR/conformer_lm/conformer.py

+            cnn_module_kernel,
+        )
+        self.encoder = MaskedLmConformerEncoder(encoder_layer, num_encoder_layers,
+                                                norm=nn.LayerNorm(d_model))


so I'd say leave it as-is for now.

Yes, I agree. After finishing the decoding script, I would recommend removing it and
and re-run the whole pipeline.

csukuangfj · 2021-09-27T08:11:50Z

egs/librispeech/ASR/conformer_lm/dataset.py

+        # Calling this on all copies of a DDP setup will sync the sizes so that
+        # all copies have the exact same number of batches.  I think
+        # this needs to be called with the GPU device, not sure if it would
+        # work otherwise.


It will not work for CPU devices as DDP requires GPU devices.

csukuangfj · 2021-09-27T08:27:42Z

egs/librispeech/ASR/conformer_lm/dataset.py

+
+    def _sync_sizes(self, device: torch.device = torch.device('cuda')):
+        # Calling this on all copies of a DDP setup will sync the sizes so that
+        # all copies have the exact same number of batches.  I think


Shall we mention that without doing this, the training process
will hang indefinitely at the end?

Mm, sure...

See k2-fsa#54

danpovey added 28 commits August 21, 2021 18:23

Get dataset.py working..

421a410

Copy some files, will edit..

cbe5ee1

Initial conformer refactoring, not nearly done

076a70b

Remove BatchNorm, use LayerNorm

ea43b49

Merge remote-tracking branch 'upstream/master'

24d3a98

Some progress on refactoring conformer code, it's in transformer.py o…

03ff4aa

…nly...

Progress in testing

e0b04ba

Add more testing; fix issue about channel dim of LayerNorm.

2fbe3b7

Add testing for MaskedLmConformerEncoder

556fae5

Test, and fix, TransformerDecoderLayerRelPos

7856ab8

Test, and fix, TransformerDecoderRelPos

5fecd24

Get tests to work for MaskedLmConformer

26b5b5b

Merge remote-tracking branch 'upstream/master'

13200d7

Update prepare.sh to create LM training data; add missed scripts loca…

894be06

…l/prepare_lm_training_data.py

Add train.py

c3a8727

Fix bugs; first version that is running successfully.

7711fba

Various bug fixes

9576d65

Changes to dataset to prevent OOM on batches with short sentences

e6eefeb

Version I am running...

0d97e68

Use collate_fn as class. harmless but not necessary without multiple …

a7b6110

…workers

Get dataset to work for empty input sentences; test it

d045831

Add Foam optimizer; I used this from epoch 3.

ccf7bde

Run in exp_2, with foam from start, knee_factor=5.0, initial_lrate=2e…

573e058

…-04.

Change configuration again.. not great performance.

d313c27

Move to Gloam optimizer, exponential lrate

56a88ba

Change to exp_5, 1/sqrt(t) component.

d0e5b9b

UPdates for new k2 version; change LR decay from 0.85 to 0.9

3ce1de3

Merge remote-tracking branch 'upstream/master' into conformer_lm

0cfa8c8

csukuangfj self-assigned this Sep 25, 2021

csukuangfj reviewed Sep 25, 2021

View reviewed changes

csukuangfj reviewed Sep 27, 2021

View reviewed changes

egs/librispeech/ASR/conformer_lm/conformer.py Outdated Show resolved Hide resolved

Update egs/librispeech/ASR/conformer_lm/conformer.py

8e650a5

Co-authored-by: Fangjun Kuang <[email protected]>

csukuangfj reviewed Sep 27, 2021

View reviewed changes

glynpu mentioned this pull request Oct 8, 2021

[WIP]decoding and computing ppl with nnlm #72

Open

This was referenced Oct 13, 2021

Support multi-node multi-GPU training. #63

Open

Add MMI training with word pieces as modelling unit. #6

Merged

csukuangfj mentioned this pull request Oct 27, 2021

WIP: Decoding scripts using conformer LM. #95

Open

csukuangfj added a commit to csukuangfj/icefall that referenced this pull request Nov 1, 2021

Add files form Dan.

19828cb

See k2-fsa#54

csukuangfj added a commit to csukuangfj/icefall that referenced this pull request Nov 17, 2021

Add files from Dan.

469b665

See k2-fsa#54

JinZr mentioned this pull request Apr 10, 2024

关于wenetspeech的指标是不是有一点问题 #1587

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conformer lm #54

Conformer lm #54

danpovey commented Sep 24, 2021

csukuangfj commented Sep 24, 2021

danpovey commented Sep 24, 2021

csukuangfj Sep 25, 2021

danpovey commented Sep 25, 2021 via email

csukuangfj Sep 27, 2021

danpovey Sep 27, 2021

danpovey Sep 27, 2021

csukuangfj Sep 27, 2021

csukuangfj Sep 27, 2021

csukuangfj Sep 27, 2021

csukuangfj Sep 27, 2021

csukuangfj Sep 27, 2021

danpovey Sep 27, 2021

	./local/prepare_lm_training_data.py data/lang_bpe_${vocab_size}/bpe.model download/lm/librispeech-lm-norm.txt $lm_dir/lm_data.pt
	./local/prepare_lm_training_data.py $lang_dir/bpe.model $dl_dir/lm/librispeech-lm-norm.txt $lm_dir/lm_data.pt

Conformer lm #54

Are you sure you want to change the base?

Conformer lm #54

Conversation

danpovey commented Sep 24, 2021

csukuangfj commented Sep 24, 2021

danpovey commented Sep 24, 2021

Choose a reason for hiding this comment

danpovey commented Sep 25, 2021 via email

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment