Datasets

In this page are some of the standard datasets used to train models for publication of papers. They are not meant to work for every case, but can serve as a reference for how to build your own dataset.

In order to build your own dataset, you can fetch images from places like Kaggle, Flickr (API), Pixiv, Danbooru, or any other, according to the purpose of the model you want to train.

Super-Resolution

Several standard SR datasets are listed below.

Name	Datasets	Short Description	Google Drive	Other
Classical SR Training	T91	_{91 images for training}	Google Drive	Other
	BSDS200	_{A subset (train) of BSD500 for training}
	General100	_{100 images for training}
Classical SR Testing	Set5	_{Set5 test dataset}
	Set14	_{Set14 test dataset}
	BSDS100	_{A subset (test) of BSD500 for testing}
	urban100	_{100 building images for testing (regular structures)}
	manga109	_{109 images of Japanese manga for testing}
	historical	_{10 gray LR images without the ground-truth}
2K Resolution	DIV2K	_{proposed in NTIRE17(800 train and 100 validation)}	Google Drive	Other
	Flickr2K	_{2650 2K images from Flickr for training}
	DF2K	_{A merged training dataset of DIV2K and Flickr2K}
OST (Outdoor Scenes)	OST Training	_{7 categories images with rich textures}	Google Drive	Other
OST (Outdoor Scenes)	OST300	_{300 test images of outdoor scences}	Google Drive	Other
PIRM	PIRM	_{PIRM self-val, val, test datasets}	Google Drive	Other

Image to image translation

Name	Datasets	Short Description	Google Drive
Pix2pix (paired^*1)	facades	_{400 images from the CMP Facades dataset.}	Server
	maps	_{1096 training images scraped from Google Maps.}
	edges2shoes	_{50k training images from UT Zappos50K dataset. Edges are computed with HED edge detector + post-processing.}
	edges2handbags	_{137K Amazon Handbag images from iGAN project. Edges are computed with HED edge detector + post-processing.}
	night2day (day2night)	_{around 20K natural scene images from Transient Attributes dataset.}
CycleGAN (unpaired)	facades	_{400 images from the CMP Facades dataset.}	Server
	maps	_{1096 training images scraped from Google Maps.}
	horse2zebra	_{939 horse images and 1177 zebra images downloaded from ImageNet using keywords wild horse and zebra}
	apple2orange	_{996 apple images and 1020 orange images downloaded from ImageNet using keywords apple and navel orange}
	summer2winter_yosemite	_{1273 summer Yosemite images and 854 winter Yosemite images were downloaded using Flickr API.}
	monet2photo, vangogh2photo, ukiyoe2photo, cezanne2photo	_{The art images were downloaded from WikiArt. The real photos are downloaded from Flickr using the combination of the tags landscape and landscapephotography. The training set size of each class is Monet:1074, Cezanne:584, Van Gogh:401, Ukiyo-e:1433, Photographs:6853.}
	iphone2dslr_flower	_{both classes of images were downloaded from Flickr. The training set size of each class is iPhone:1813, DSLR:3316.}
Cityscapes	Cityscapes	_{2975 images from the Cityscapes dataset.^*2}	Server

¹ In order to use these datasets, you need to use the dataroot_AB path and outputs: AB options so the image pairs will be automatically split during training. In order to switch A with B, you can also use the optional direction: BtoA option.

² Cityscapes dataset requires processing before using, see this script.

Video

Name	Datasets	Short Description	Google Drive
REDS	Multiple	_{REDS video dataset. Includes deblurring, super-resolution and high FPS datasets.}	Server

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

datasets.md

datasets.md

Datasets

Super-Resolution

Image to image translation

Video

Files

datasets.md

Latest commit

History

datasets.md

File metadata and controls

Datasets

Super-Resolution

Image to image translation

Video