MASS-supNMT: Args say word_mask_keep_rand but code is word_mask_rand_keep #139

JasonVann · 2020-04-30T07:34:47Z

Hi there, I think I found a typo or I'm confused. In MASS-supNMT, xmasked_seq2seq.py, word_mask_keep_rand defaults to '0.1, 0.1, 0.8'.

Then this mask_keep_rand is passed to args as "pred_probs" on line 119 , which is then passed to MaskedLanguagePairDataset

In masked_language_pair_dataset.py, in random_word(), the way pred_probs is [used] is here

MASS/MASS-supNMT/mass/masked_language_pair_dataset.py

Lines 183 to 184 in 208ead5

    
           cands = [self.vocab.mask_index, np.random.randint(self.vocab.nspecial, len(self.vocab)), w] 
        
           prob = torch.multinomial(self.pred_probs, 1, replacement=True)

From the code we see that pred_probs is acutally used as mask, random, keep, not word_mask_keep_rand. This implies the default args 0.1, 0.1, 0.8 is not mask 0.1, keep 0.1, rand 0.8, but actually mask 0.1, rand 0.1, keep 0.8, quite different from what the variable name says

In short, word_mask_keep_rand should have been named as word_mask_rand_keep.

JasonVann changed the title ~~MASS-supNMT: Args says word_mask_keep_rand but code is word_mask_rand_keep~~ MASS-supNMT: Args say word_mask_keep_rand but code is word_mask_rand_keep Apr 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MASS-supNMT: Args say word_mask_keep_rand but code is word_mask_rand_keep #139

MASS-supNMT: Args say word_mask_keep_rand but code is word_mask_rand_keep #139

JasonVann commented Apr 30, 2020

MASS-supNMT: Args say word_mask_keep_rand but code is word_mask_rand_keep #139

MASS-supNMT: Args say word_mask_keep_rand but code is word_mask_rand_keep #139

Comments

JasonVann commented Apr 30, 2020