update `create_model_card` to properly save peft details when using Trainer with PEFT #27754

pacman100 · 2023-11-28T23:23:35Z

What does this PR do?

When using Trainer with PEFT, model.save_pretrained in PEFT adds PEFT-specific details to the existing model card or creates a new model card and adds these details. However, trainer.create_model_card is called after model is saved and it overwrites the entire file thereby nullifies the addition of details related to PEFT such as library name, quantization used and PEFT version.
This PR fixes the above issue and thereby adds backs the PEFT details to the model card. This will help in better organization and understanding usage of PEFT on Hub.
Example of the repo with the PR usage: https://huggingface.co/smangrul/mistral_lora_clm_with_added_tokens

…rainer with PEFT

muellerzr

Solution makes sense to me! Thanks!

BenjaminBossan

I'm not very knowledgeable about Trainer, so can't comment on the whole integration.

From my understanding, this seems a little bit hacky to me, because we first create the model card with PEFT, then completely overwrite it with Trainer, then re-add PEFT-related content. It feels like the proper solution would be for the Trainer to update the model card if it already exists. But I understand that this would be more work, so I'd be okay with this more hacky approach.

src/transformers/trainer.py

Co-authored-by: Benjamin Bossan <[email protected]>

pacman100 · 2023-12-01T10:21:55Z

From my understanding, this seems a little bit hacky to me, because we first create the model card with PEFT, then completely overwrite it with Trainer, then re-add PEFT-related content. It feels like the proper solution would be for the Trainer to update the model card if it already exists. But I understand that this would be more work, so I'd be okay with this more hacky approach.

Yes, Trainer should ideally update if README is already there, but it rewrites everything from the TrainerState and as such appending/updating would be trickier. Open to ideas on making this cleaner.

BenjaminBossan

Thanks for the update.

Yes, Trainer should ideally update if README is already there, but it rewrites everything from the TrainerState and as such appending/updating would be trickier. Open to ideas on making this cleaner.

I'm not very knowledgeable about Trainer, so I don't have any suggestion for a better solution.

muellerzr · 2023-12-05T17:20:53Z

I can't really see a better solution currently either, so what we have here works for now

…rainer with PEFT (huggingface#27754) * update `create_model_card` to properly save peft details when using Trainer with PEFT * nit * Apply suggestions from code review Co-authored-by: Benjamin Bossan <[email protected]> --------- Co-authored-by: Benjamin Bossan <[email protected]>

update create_model_card to properly save peft details when using T…

9763755

…rainer with PEFT

pacman100 requested review from amyeroberts and muellerzr November 28, 2023 23:23

nit

3e54b4a

pacman100 requested a review from BenjaminBossan December 1, 2023 04:08

muellerzr approved these changes Dec 1, 2023

View reviewed changes

BenjaminBossan reviewed Dec 1, 2023

View reviewed changes

src/transformers/trainer.py Outdated Show resolved Hide resolved

src/transformers/trainer.py Outdated Show resolved Hide resolved

Apply suggestions from code review

1e4985e

Co-authored-by: Benjamin Bossan <[email protected]>

BenjaminBossan approved these changes Dec 4, 2023

View reviewed changes

muellerzr requested review from ArthurZucker and removed request for amyeroberts December 4, 2023 18:49

ArthurZucker approved these changes Dec 5, 2023

View reviewed changes

pacman100 merged commit 5324bf9 into main Dec 7, 2023
21 checks passed

pacman100 deleted the smangrul/fix-peft-model-card branch December 7, 2023 12:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update `create_model_card` to properly save peft details when using Trainer with PEFT #27754

update `create_model_card` to properly save peft details when using Trainer with PEFT #27754

pacman100 commented Nov 28, 2023

muellerzr left a comment

BenjaminBossan left a comment

pacman100 commented Dec 1, 2023

BenjaminBossan left a comment

muellerzr commented Dec 5, 2023

update create_model_card to properly save peft details when using Trainer with PEFT #27754

update create_model_card to properly save peft details when using Trainer with PEFT #27754

Conversation

pacman100 commented Nov 28, 2023

What does this PR do?

muellerzr left a comment

Choose a reason for hiding this comment

BenjaminBossan left a comment

Choose a reason for hiding this comment

pacman100 commented Dec 1, 2023

BenjaminBossan left a comment

Choose a reason for hiding this comment

muellerzr commented Dec 5, 2023

update `create_model_card` to properly save peft details when using Trainer with PEFT #27754

update `create_model_card` to properly save peft details when using Trainer with PEFT #27754