Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TFRecords size increase #92

Open
LGTiscar opened this issue Apr 21, 2021 · 1 comment
Open

TFRecords size increase #92

LGTiscar opened this issue Apr 21, 2021 · 1 comment

Comments

@LGTiscar
Copy link

Hello, I have a folder with 5,4k JPG files, and I want to train a StyleGAN 2 ADA model with a custom dataset made from this images. Problem is, I am working on a google collab notebook with only 40 GB free disk space (I'm from Spain and Google Collab PRO isnt avaible here yet...). After some preprocessing to crop and resize them to 1024x1024, the folder size is 400MB aprox.

The problem is, when decoding the JPEG to the TFRecords to create the dataset using:

!python dataset_tool.py create_from_images ./datasets/{dataset_name} {dataset_path}

The size of the TFRecord dataset grows up to more 40 GB , and the disk space in the free google colab notebook saturates.

Is there anyway to fix this or to change the dataset_tool.py code to encode the bytes in the TFRecord as JPG does?

@Tabriz12
Copy link

You can transorm images to tfrecords on your local machine, put them on drive, and access from there.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants