Textual Inversion's dlr0 diverges to inf #222

and1011 · 2023-02-22T22:23:15Z

I didn't test the algorithm before implementation of d-adaptation, but now, when I try to train a textual inversion, dlr0 starts low, which I suppose is correct anyway, but quickly starts increasing (exponentially) with each step until it reaches the float limit.
Also, after the value diverges to inf the embedding becomes unusable.
If I understood correctly, what could be wrong, but in theory shouldn't the dlr converge to some value?
Anyone else having this problem or who could help?

bmaltais · 2023-02-23T01:41:51Z

This is usually because the LR is too high... You could try to lower it a bit.

and1011 · 2023-02-23T02:47:26Z

I saw here that you fixed it in this commit 34ab844

Now that I've updated this bug has been fixed, I can now use AdamW normally.
Thank you.

fix training instability issue, add metadata

and1011 closed this as completed Feb 24, 2023

Cauldrath pushed a commit to Cauldrath/kohya_ss that referenced this issue Apr 5, 2023

Merge pull request bmaltais#222 from kohya-ss/dev

39a70f1

fix training instability issue, add metadata

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Textual Inversion's dlr0 diverges to inf #222

Textual Inversion's dlr0 diverges to inf #222

and1011 commented Feb 22, 2023 •

edited

Loading

bmaltais commented Feb 23, 2023

and1011 commented Feb 23, 2023

Textual Inversion's dlr0 diverges to inf #222

Textual Inversion's dlr0 diverges to inf #222

Comments

and1011 commented Feb 22, 2023 • edited Loading

bmaltais commented Feb 23, 2023

and1011 commented Feb 23, 2023

and1011 commented Feb 22, 2023 •

edited

Loading