In this page are some of the standard datasets used to train models for publication of papers. They are not meant to work for every case, but can serve as a reference for how to build your own dataset.
In order to build your own dataset, you can fetch images from places like Kaggle, Flickr (API), Pixiv, Danbooru, or any other, according to the purpose of the model you want to train.
Several standard SR datasets are listed below.
Name | Datasets | Short Description | Google Drive | Other |
---|---|---|---|---|
Classical SR Training | T91 | 91 images for training | Google Drive | Other |
BSDS200 | A subset (train) of BSD500 for training | |||
General100 | 100 images for training | |||
Classical SR Testing | Set5 | Set5 test dataset | ||
Set14 | Set14 test dataset | |||
BSDS100 | A subset (test) of BSD500 for testing | |||
urban100 | 100 building images for testing (regular structures) | |||
manga109 | 109 images of Japanese manga for testing | |||
historical | 10 gray LR images without the ground-truth | |||
2K Resolution | DIV2K | proposed in NTIRE17(800 train and 100 validation) | Google Drive | Other |
Flickr2K | 2650 2K images from Flickr for training | |||
DF2K | A merged training dataset of DIV2K and Flickr2K | |||
OST (Outdoor Scenes) | OST Training | 7 categories images with rich textures | Google Drive | Other |
OST300 | 300 test images of outdoor scences | |||
PIRM | PIRM | PIRM self-val, val, test datasets | Google Drive | Other |
Name | Datasets | Short Description | Google Drive |
---|---|---|---|
Pix2pix (paired*1) | facades | 400 images from the CMP Facades dataset. | Server |
maps | 1096 training images scraped from Google Maps. | ||
edges2shoes | 50k training images from UT Zappos50K dataset. Edges are computed with HED edge detector + post-processing. | ||
edges2handbags | 137K Amazon Handbag images from iGAN project. Edges are computed with HED edge detector + post-processing. | ||
night2day (day2night) | around 20K natural scene images from Transient Attributes dataset. | ||
CycleGAN (unpaired) | facades | 400 images from the CMP Facades dataset. | Server |
maps | 1096 training images scraped from Google Maps. | ||
horse2zebra | 939 horse images and 1177 zebra images downloaded from ImageNet using keywords wild horse and zebra | ||
apple2orange | 996 apple images and 1020 orange images downloaded from ImageNet using keywords apple and navel orange | ||
summer2winter_yosemite | 1273 summer Yosemite images and 854 winter Yosemite images were downloaded using Flickr API. | ||
monet2photo, vangogh2photo, ukiyoe2photo, cezanne2photo | The art images were downloaded from WikiArt. The real photos are downloaded from Flickr using the combination of the tags landscape and landscapephotography. The training set size of each class is Monet:1074, Cezanne:584, Van Gogh:401, Ukiyo-e:1433, Photographs:6853. | ||
iphone2dslr_flower | both classes of images were downloaded from Flickr. The training set size of each class is iPhone:1813, DSLR:3316. | ||
Cityscapes | Cityscapes | 2975 images from the Cityscapes dataset.*2 | Server |
1 In order to use these datasets, you need to use the dataroot_AB
path and outputs: AB
options so the image pairs will be automatically split during training. In order to switch A with B, you can also use the optional direction: BtoA
option.
2 Cityscapes dataset requires processing before using, see this script.
Name | Datasets | Short Description | Google Drive |
---|---|---|---|
REDS | Multiple | REDS video dataset. Includes deblurring, super-resolution and high FPS datasets. | Server |