Add warning and info message for beta and gamma parameters #33192

zly-idleness · 2024-08-29T08:54:44Z

What does this PR do?

This adds a warning message to notify about the renaming of gamma and beta parameters during initialisation and also during loading.

before:

(vqa-audio) (base) jeeves@notebook-5064-cadence:~/ChatTTS/rhapsodyaudio$ python tmp_save_pretrain.py 
bash: warning: setlocale: LC_ALL: cannot change locale (en_US.UTF-8)
Loading checkpoint shards: 100%|██████████████████████████████████████████████████████████████████████████████| 8/8 [00:16<00:00,  2.02s/it]
Some weights of Qwen2AudioForConditionalChatTTS were not initialized from the model checkpoint at /mnt/data/user/tc_agi/luoyuanZ/ChatTTS_default and are newly initialized: 

['tts.dvae.decoder.decoder_block.0.gamma', 'tts.dvae.decoder.decoder_block.1.gamma', 'tts.dvae.decoder.decoder_block.10.gamma', 'tts.dvae.decoder.decoder_block.11.gamma', 'tts.dvae.decoder.decoder_block.2.gamma', 'tts.dvae.decoder.decoder_block.3.gamma', 'tts.dvae.decoder.decoder_block.4.gamma', 'tts.dvae.decoder.decoder_block.5.gamma', 'tts.dvae.decoder.decoder_block.6.gamma', 'tts.dvae.decoder.decoder_block.7.gamma', 'tts.dvae.decoder.decoder_block.8.gamma', 'tts.dvae.decoder.decoder_block.9.gamma', 'tts.dvae.encoder.decoder_block.0.gamma', 'tts.dvae.encoder.decoder_block.1.gamma', 'tts.dvae.encoder.decoder_block.10.gamma', 'tts.dvae.encoder.decoder_block.11.gamma', 'tts.dvae.encoder.decoder_block.2.gamma', 'tts.dvae.encoder.decoder_block.3.gamma', 'tts.dvae.encoder.decoder_block.4.gamma', 'tts.dvae.encoder.decoder_block.5.gamma', 'tts.dvae.encoder.decoder_block.6.gamma', 'tts.dvae.encoder.decoder_block.7.gamma', 'tts.dvae.encoder.decoder_block.8.gamma', 'tts.dvae.encoder.decoder_block.9.gamma']

after:

(vqa-audio) (base) jeeves@notebook-5064-cadence:~/ChatTTS/rhapsodyaudio$ python tmp_save_pretrain.py 
bash: warning: setlocale: LC_ALL: cannot change locale (en_US.UTF-8)
This model <class 'muffin.model.infer_qwen2tts.Qwen2AudioForConditionalChatTTS'>contains parameters that have been renamed internally (a few are listed below but more are present in the model):

Loading checkpoint shards: 100%|██████████████████████████████████████████████████████████████████████████████| 8/8 [00:14<00:00,  1.81s/it]

Fixes #29554 and #33190 (issue)

Before submitting

Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

add warn and info message when load "gamma" and "beta" keys in _load_pretrained_model

without other changes

zly-idleness · 2024-08-29T16:28:30Z

cc @amyeroberts

amyeroberts

Thanks for adding this @zly-idleness!

At the moment, this involves iterating over the loaded keys twice - once on L4116 and then on L4146. It also involves a redefinition of original_loaded_keys. We should rework the code to reduce the double logic

zly-idleness · 2024-09-03T11:14:41Z

Thanks for adding this @zly-idleness!

At the moment, this involves iterating over the loaded keys twice - once on L4116 and then on L4146. It also involves a redefinition of original_loaded_keys. We should rework the code to reduce the double logic

Thank you for pointing that out. I'll refactor the code to eliminate redundancy ☺️

amyeroberts · 2024-09-04T09:48:35Z

src/transformers/modeling_utils.py

+                old_keys.append(key)
+                new_keys.append(new_key)
+                renamed_keys[key] = new_key
+                loaded_keys[i] = new_key


This doesn't work - this will modify both loaded_keys and original_loaded_keys:

In [1]: li_0 = list(range(10)) In [2]: li_1 = li_0 In [3]: for i in range(10): ...: if i % 2: ...: li_0[i] = -1 ...: In [4]: li_0 Out[4]: [0, -1, 2, -1, 4, -1, 6, -1, 8, -1] In [5]: li_1 Out[5]: [0, -1, 2, -1, 4, -1, 6, -1, 8, -1]

Thank you sir ! You are totally right , I forget add a copy() to ensure that the original_loaded_keys remain unaltered.

amyeroberts

Thanks for iterating!

At the moment, the logic is being forced around the old renaming code. There are also changes in the PR to the olmoe model which should be removed

amyeroberts · 2024-09-21T01:40:23Z

src/transformers/models/olmoe/modeling_olmoe.py

There shouldn't be any changes to this file in the PR

amyeroberts · 2024-09-21T01:43:15Z

src/transformers/modeling_utils.py

+            warning_msg += 'contains parameters that have been renamed internally ("gamma" and "beta" in parameters) (a few are listed below but more are present in the model):\n'
+            logger.warning(warning_msg)
+            for old_key, new_key in renamed_keys.items():
+                warning_msg += f"* `{old_key}` -> `{new_key}`\n"
+            warning_msg += "If you are using a model from the Hub, consider submitting a PR to adjust these weights and help future users."
+            logger.info(warning_msg)


This message isn't consistent - at the moment all of the renamed keys will be listed. Like in the other places where this logic is added, let's just take the first renames

amyeroberts · 2024-09-21T01:44:31Z

src/transformers/modeling_utils.py

+            return None
+
+        for i, key in enumerate(loaded_keys):
+            new_key = _fix_key(key)


Rather than try and use the existing _fix_key logic - it would be better to rework this to:

Not use copy

To only add the first renaming case for gamma and beta respectively

zly-idleness added 7 commits August 29, 2024 16:44

Update modeling_utils.py

b680345

add warn and info message when load "gamma" and "beta" keys in _load_pretrained_model

Update modeling_utils.py

4a1a12b

Update modeling_utils.py

a244360

without other changes

Update modeling_utils.py

24078a8

Update modeling_utils.py

f960e67

Update modeling_utils.py reformat

82b8b7f

Update modeling_utils.py maintain unexpected keys

8f68de0

amyeroberts reviewed Sep 2, 2024

View reviewed changes

zly-idleness added 3 commits September 3, 2024 19:15

Merge branch 'huggingface:main' into main

362a86f

Update modeling_utils.py

a1e0232

Update modeling_utils.py format

9f93c0a

amyeroberts reviewed Sep 4, 2024

View reviewed changes

zly-idleness added 4 commits September 4, 2024 17:59

Merge branch 'huggingface:main' into main

8c0e9a1

Update modeling_utils.py

881835c

Merge branch 'huggingface:main' into main

e0924da

Update modeling_olmoe.py fix copies

ed86b6d

amyeroberts reviewed Sep 21, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add warning and info message for beta and gamma parameters #33192

Add warning and info message for beta and gamma parameters #33192

zly-idleness commented Aug 29, 2024 •

edited

Loading

zly-idleness commented Aug 29, 2024

amyeroberts left a comment

zly-idleness commented Sep 3, 2024

amyeroberts Sep 4, 2024

zly-idleness Sep 4, 2024

amyeroberts left a comment

amyeroberts Sep 21, 2024

amyeroberts Sep 21, 2024

amyeroberts Sep 21, 2024

Add warning and info message for beta and gamma parameters #33192

Are you sure you want to change the base?

Add warning and info message for beta and gamma parameters #33192

Conversation

zly-idleness commented Aug 29, 2024 • edited Loading

What does this PR do?

Before submitting

Who can review?

zly-idleness commented Aug 29, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

zly-idleness commented Sep 3, 2024

amyeroberts Sep 4, 2024

Choose a reason for hiding this comment

zly-idleness Sep 4, 2024

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts Sep 21, 2024

Choose a reason for hiding this comment

amyeroberts Sep 21, 2024

Choose a reason for hiding this comment

amyeroberts Sep 21, 2024

Choose a reason for hiding this comment

zly-idleness commented Aug 29, 2024 •

edited

Loading