Refactor decode.py to make it more readable and more modular. #44

csukuangfj · 2021-09-13T12:44:48Z

No description provided.

csukuangfj · 2021-09-13T12:45:34Z

test/test_decode.py

+    nbest2 = nbest.intersect(lattice)
+    tot_scores = nbest2.tot_scores()
+    argmax = tot_scores.argmax()
+    best_path = k2.index_fsa(nbest2.fsa, argmax)


This shows how to do nbest decoding.

It is more readable than before, I think.

There is a implementation of rescoring nbest with attention decoder in #5, the return value is also a Nbest.

icefall/icefall/decode.py

Lines 892 to 900 in 355e324

def rescore_nbest_with_attention_decoder(

nbest: Nbest,

model: nn.Module,

memory: torch.Tensor,

memory_key_padding_mask: torch.Tensor,

sos_id: int,

eos_id: int,

) -> Nbest:

That's cool.
I'll let Fangjun decide what to do about this.

Yes I think the code in this PR is clearer than before

Ok, Fangjun could refactor decoding as his plan first, I will merge with his code later. It will cost more time to make that PR work, I am afraid :(

Nbest.fsa should always have token IDs as labels and word IDs as aux_labels.

danpovey · 2021-09-13T13:55:53Z

Why not use the Nbest that's in k2? Just wondering.
Also, this seems to only be a partial replacement of the current code?

danpovey · 2021-09-13T13:56:56Z

.. of course if it's more convenient to work on the class in Icefall, to avoid compatibility issues, that's fine with me.

danpovey · 2021-09-13T15:36:47Z

cool

…

On Mon, Sep 13, 2021 at 11:01 PM Wei Kang ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In test/test_decode.py <#44 (comment)>: > + lattice=lattice, num_paths=10, use_double_scores=True, scale=0.5 + ) + # each lattice has only 4 distinct paths that have different word sequences: + # 10->30 + # 10->40 + # 20->30 + # 20->40 + # + # So there should be only 4 paths for each lattice in the Nbest object + assert nbest.fsa.shape[0] == 4 * 2 + assert nbest.shape.row_splits(1).tolist() == [0, 4, 8] + + nbest2 = nbest.intersect(lattice) + tot_scores = nbest2.tot_scores() + argmax = tot_scores.argmax() + best_path = k2.index_fsa(nbest2.fsa, argmax) Ok, Fangjun could refactor decoding as his plan first, I will merge with his code later. It will cost more time to make that PR work, I am afraid :( — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#44 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAZFLO4ZHBWC6PM4QRP7CNTUBYG6BANCNFSM5D5XRYGA> . Triage notifications on the go with GitHub Mobile for iOS <https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675> or Android <https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub>.

csukuangfj · 2021-09-14T13:11:47Z

icefall/decode2.py

+    levenshtein_graphs = [levenshtein_graph(ids) for ids in word_ids_list]
+    refs = k2.Fsa.from_fsas(levenshtein_graphs).to(device)
+
+    # Now compute the edit distance between hyps and refs


This shows how to use k2 to compute the levenshtein distance among multiple sequences at the same time.
(Ideas are from @danpovey)

If anyone is interested in it, I wrote a colab notebook to demonstrate it step by step
https://colab.research.google.com/drive/1hAo5RUb8cMY4qhdn8HASJu9foXrRdo6m?usp=sharing

That’s pretty cool, does it mean we could use it directly to optimize WER as a loss function?

There would have to be some kind of expectation involved for that to work. There are other uses for this though.

csukuangfj · 2021-09-18T08:42:34Z

This pull-request is ready for review.
(Will use k2.levenshtein_alignment and k2.levenshtein_graph once k2-fsa/k2#828 gets merged)

There are fewer code duplicates. Also, it is more readable, I believe.

It produces the same WER as the previous code. Would be great if someone can test it using an existing model.

I can move class Nbest to k2 if @danpovey you think that looks nicer.
(The code for Nbest here has several new functions (e.g., compute_am_scores, compute_lm_scores) )

danpovey · 2021-09-18T09:16:30Z

Cool!
Regarding moving that code to k2, we should definitely do that at some point, but I'm OK to keep it temporarily here if that would reduce compatibility issues in the short term. We can also duplicate it with here and k2, and later remove it from k2.

csukuangfj · 2021-09-20T07:38:23Z

Ok, I am merging it.

icefall now requires k2 >= v1.9.

Refactor decode.py to make it more readable and more modular.

9ddf236

csukuangfj commented Sep 13, 2021

View reviewed changes

Fix an error.

968e4a6

Nbest.fsa should always have token IDs as labels and word IDs as aux_labels.

csukuangfj changed the title ~~Refactor decode.py to make it more readable and more modular.~~ WIP: Refactor decode.py to make it more readable and more modular. Sep 13, 2021

csukuangfj added 2 commits September 14, 2021 13:45

Add nbest decoding.

d2bedbe

Compute edit distance with k2.

664b87a

csukuangfj commented Sep 14, 2021

View reviewed changes

csukuangfj added 8 commits September 16, 2021 11:41

Merge remote-tracking branch 'dan/master' into nbest

d6a9959

Refactor nbest-oracle.

b9fc46f

Merge remote-tracking branch 'dan/master' into nbest

8807381

Add rescore with nbest lists.

a44b4f8

Add whole-lattice rescoring.

b0b942b

Add rescoring with attention decoder.

38cfd06

Refactoring.

8623983

Fixes after refactoring.

e062c1b

csukuangfj changed the title ~~WIP: Refactor decode.py to make it more readable and more modular.~~ Refactor decode.py to make it more readable and more modular. Sep 18, 2021

csukuangfj added the ready label Sep 18, 2021

Fix a typo.

77993b9

csukuangfj added 2 commits September 18, 2021 16:46

Minor fixes.

306c9e1

Replace [] with () for shapes.

a9a8448

Use k2 v1.9

a96f10a

csukuangfj added ready and removed ready labels Sep 20, 2021

Use Levenshtein graphs/alignment from k2 v1.9

d2c7fb9

csukuangfj removed the ready label Sep 20, 2021

csukuangfj added the ready label Sep 20, 2021

[doc] Require k2 >= v1.9

ec6b129

Minor fixes.

c75592c

csukuangfj merged commit a80e58e into k2-fsa:master Sep 20, 2021

csukuangfj deleted the nbest branch September 20, 2021 07:44

danpovey mentioned this pull request Nov 27, 2021

Decoding error 'Fsa' object doesn't support assignment. #133

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor decode.py to make it more readable and more modular. #44

Refactor decode.py to make it more readable and more modular. #44

csukuangfj commented Sep 13, 2021

csukuangfj Sep 13, 2021

pkufool Sep 13, 2021 •

edited

Loading

danpovey Sep 13, 2021

pkufool Sep 13, 2021

danpovey commented Sep 13, 2021

danpovey commented Sep 13, 2021

danpovey commented Sep 13, 2021 via email

csukuangfj Sep 14, 2021

pzelasko Sep 14, 2021

danpovey Sep 14, 2021

csukuangfj commented Sep 18, 2021

danpovey commented Sep 18, 2021

csukuangfj commented Sep 20, 2021


	def rescore_nbest_with_attention_decoder(
	nbest: Nbest,
	model: nn.Module,
	memory: torch.Tensor,
	memory_key_padding_mask: torch.Tensor,
	sos_id: int,
	eos_id: int,
	) -> Nbest:

Refactor decode.py to make it more readable and more modular. #44

Refactor decode.py to make it more readable and more modular. #44

Conversation

csukuangfj commented Sep 13, 2021

csukuangfj Sep 13, 2021

Choose a reason for hiding this comment

pkufool Sep 13, 2021 • edited Loading

Choose a reason for hiding this comment

danpovey Sep 13, 2021

Choose a reason for hiding this comment

pkufool Sep 13, 2021

Choose a reason for hiding this comment

danpovey commented Sep 13, 2021

danpovey commented Sep 13, 2021

danpovey commented Sep 13, 2021 via email

csukuangfj Sep 14, 2021

Choose a reason for hiding this comment

pzelasko Sep 14, 2021

Choose a reason for hiding this comment

danpovey Sep 14, 2021

Choose a reason for hiding this comment

csukuangfj commented Sep 18, 2021

danpovey commented Sep 18, 2021

csukuangfj commented Sep 20, 2021

pkufool Sep 13, 2021 •

edited

Loading