May version finetuning degrades models capabilities. April version works nice. #786

kgonia · 2023-05-12T11:01:26Z

So I have April version on my local machine and on AWS AMI

commit 1e645deebec16ea5c3740ed0903ebbe21f273e1b
Merge: af0df34 cd8f85d
Author: bmaltais <[email protected]>
Date:   Sat Apr 1 16:52:37 2023 -0400

    Merge pull request #512 from bmaltais/dev

    v21.4.0

with this version I'm able to finetune with specific topic without degrading model capabilities.

Recently I updated code base to newer version

 commit 959ef18e2dec8cbdefa3571bbb84cd0d01ff9890 (origin/master, origin/HEAD)
Merge: fa41e40 eb2ca44
Author: bmaltais <[email protected]>
Date:   Wed May 3 13:22:06 2023 -0400

    Merge pull request #706 from chiragjn/patch-1

    Add `--no-cache-dir` to reduce image size

and it's degrades model capability.

I'm using same data and same configuration but got totally different result.

Below example of test generation from same prompt during training after first epoch

new code base

old code base

Was there any changes that can affect finetuning?

The text was updated successfully, but these errors were encountered:

bmaltais · 2023-05-12T16:42:11Z

Indeed Kohya constantly update his code... Probably a change that is having a negative side effect. Really hard to pinpoint what it might be....

rafstahelin · 2023-05-23T09:34:14Z

So I have April version on my local machine and on AWS AMI
commit 1e645deebec16ea5c3740ed0903ebbe21f273e1b

Merge: af0df34 cd8f85d

Author: bmaltais <[email protected]>

Date:   Sat Apr 1 16:52:37 2023 -0400



    Merge pull request #512 from bmaltais/dev



    v21.4.0
with this version I'm able to finetune with specific topic without degrading model capabilities.

Recently I updated code base to newer version
 commit 959ef18e2dec8cbdefa3571bbb84cd0d01ff9890 (origin/master, origin/HEAD)

Merge: fa41e40 eb2ca44

Author: bmaltais <[email protected]>

Date:   Wed May 3 13:22:06 2023 -0400



    Merge pull request #706 from chiragjn/patch-1



    Add `--no-cache-dir` to reduce image size
and it's degrades model capability.

I'm using same data and same configuration but got totally different result.

Below example of test generation from same prompt during training after first epoch

new code base

old code base

Was there any changes that can affect finetuning?

My Kohya can't even fine tune. Lora is fine, but get error message when I start fine tune message.
Should I be on 3.10.6 Python rather than 3.10.9?

kgonia · 2023-05-23T21:05:57Z

I'm using 3.10.10 curently

Support JPEG XL

bmaltais pushed a commit that referenced this issue Sep 3, 2023

Merge pull request #786 from Isotr0py/jxl

497051c

Support JPEG XL

bmaltais mentioned this issue Sep 3, 2023

v21.8.9 #1476

Merged

bmaltais closed this as completed Jan 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

May version finetuning degrades models capabilities. April version works nice. #786

May version finetuning degrades models capabilities. April version works nice. #786

kgonia commented May 12, 2023

bmaltais commented May 12, 2023

rafstahelin commented May 23, 2023

kgonia commented May 23, 2023

May version finetuning degrades models capabilities. April version works nice. #786

May version finetuning degrades models capabilities. April version works nice. #786

Comments

kgonia commented May 12, 2023

bmaltais commented May 12, 2023

rafstahelin commented May 23, 2023

kgonia commented May 23, 2023