Stable Diffusion XL fine-tuning with Dreambooth & Lora: how to structure local dataset for fine-tuning with ROI #6890

SylwiaNowakowska · 2024-02-07T08:26:09Z

SylwiaNowakowska
Feb 7, 2024

Hi!

I am using https://github.com/huggingface/diffusers/blob/main/examples/dreambooth/train_dreambooth_lora_sdxl.py
I am having a custom dataset locally, which is structured in a following way:

data/metadata.jsonl
data/train/01.png
data/train/02.png
....

With the metadata.json file as follows:
{"file_name": "train/01.png", "text": "In the style of MAMM, a view X of a healthy patient"}
{"file_name": "train/02.png", "text": "In the style of MAMM, a view Y of a healthy patient"}
....

The fine-tuning works.

In the next step, I would like to take the fine-tuned model and train on images which show abnormality.
I have ROI coordinates for the abnormality.

I would be grateful, if you could tell me how shall I structure the dataset for this task?

Shall I add the ROI coordinates to a prompt in similar way as here: https://huggingface.co/docs/datasets/image_dataset (section: "object detection")?
If yes, should it look sth like this?
{"file_name": "train/01.png", "text": "In the style of MAMM, a view X, "objects" : {"bbox": [[302.0, 109.0, 73.0, 52.0]], "categories": [abnormality A]}}}

Or to add the ROIs as a segmentation map in form of an image? (How to structure the folder then and the metadata.jsonl?)

I would appreciate your help.

Answered by lhoestq

Feb 14, 2024

I think this script is to train a dreambooth model only using text and image - it doesn't seem to support bounding boxes or categories.

Maybe you can simply filter your images to keep the ones with abnormalities and use this filtered dataset instead ? You could even mention the abnormality name in the texts.

View full answer

sayakpaul · 2024-02-07T10:20:55Z

sayakpaul
Feb 7, 2024
Collaborator

Cc: @lhoestq from the datasets team.

7 replies

SylwiaNowakowska Feb 14, 2024
Author

I would appreciate your input on that.

lhoestq Feb 14, 2024

The format you mentioned sounds good to me :) but it also depends on the training script you want to use, since it may expect the data in a specific format.

Which training script are you using for object detection ?

SylwiaNowakowska Feb 14, 2024
Author

Thx for your feedback.
I am using: I am using https://github.com/huggingface/diffusers/blob/main/examples/dreambooth/train_dreambooth_lora_sdxl.py

Could you let me know, if the ROI should be better in the metadata.jsonl or saved as masks in .png format?

lhoestq Feb 14, 2024

I think this script is to train a dreambooth model only using text and image - it doesn't seem to support bounding boxes or categories.

Maybe you can simply filter your images to keep the ones with abnormalities and use this filtered dataset instead ? You could even mention the abnormality name in the texts.

Answer selected by SylwiaNowakowska

SylwiaNowakowska Feb 15, 2024
Author

Thank you for the answer.
Do you plan to extend the script to support training with ROI?

sayakpaul Feb 15, 2024
Collaborator

No, we don't plan on doing that because our scripts are not meant to be exhaustive.

SylwiaNowakowska Feb 15, 2024
Author

Thx!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stable Diffusion XL fine-tuning with Dreambooth & Lora: how to structure local dataset for fine-tuning with ROI #6890

{{title}}

Replies: 1 comment 7 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Stable Diffusion XL fine-tuning with Dreambooth & Lora: how to structure local dataset for fine-tuning with ROI #6890

SylwiaNowakowska Feb 7, 2024

Replies: 1 comment · 7 replies

sayakpaul Feb 7, 2024 Collaborator

SylwiaNowakowska Feb 14, 2024 Author

lhoestq Feb 14, 2024

SylwiaNowakowska Feb 14, 2024 Author

lhoestq Feb 14, 2024

SylwiaNowakowska Feb 15, 2024 Author

sayakpaul Feb 15, 2024 Collaborator

SylwiaNowakowska Feb 15, 2024 Author

SylwiaNowakowska
Feb 7, 2024

Replies: 1 comment 7 replies

sayakpaul
Feb 7, 2024
Collaborator

SylwiaNowakowska Feb 14, 2024
Author

SylwiaNowakowska Feb 14, 2024
Author

SylwiaNowakowska Feb 15, 2024
Author

sayakpaul Feb 15, 2024
Collaborator

SylwiaNowakowska Feb 15, 2024
Author