Skip to content

Latest commit

 

History

History
193 lines (171 loc) · 7.55 KB

datasets.md

File metadata and controls

193 lines (171 loc) · 7.55 KB

Datasets

In this page are some of the standard datasets used to train models for publication of papers. They are not meant to work for every case, but can serve as a reference for how to build your own dataset.

In order to build your own dataset, you can fetch images from places like Kaggle, Flickr (API), Pixiv, Danbooru, or any other, according to the purpose of the model you want to train.

Super-Resolution

Several standard SR datasets are listed below.

Name Datasets Short Description Google Drive Other
Classical SR Training T91 91 images for training Google Drive Other
BSDS200 A subset (train) of BSD500 for training
General100 100 images for training
Classical SR Testing Set5 Set5 test dataset
Set14 Set14 test dataset
BSDS100 A subset (test) of BSD500 for testing
urban100 100 building images for testing (regular structures)
manga109 109 images of Japanese manga for testing
historical 10 gray LR images without the ground-truth
2K Resolution DIV2K proposed in NTIRE17(800 train and 100 validation) Google Drive Other
Flickr2K 2650 2K images from Flickr for training
DF2K A merged training dataset of DIV2K and Flickr2K
OST (Outdoor Scenes) OST Training 7 categories images with rich textures Google Drive Other
OST300 300 test images of outdoor scences
PIRM PIRM PIRM self-val, val, test datasets Google Drive Other

Image to image translation

Name Datasets Short Description Google Drive
Pix2pix (paired*1) facades 400 images from the CMP Facades dataset. Server
maps 1096 training images scraped from Google Maps.
edges2shoes 50k training images from UT Zappos50K dataset. Edges are computed with HED edge detector + post-processing.
edges2handbags 137K Amazon Handbag images from iGAN project. Edges are computed with HED edge detector + post-processing.
night2day (day2night) around 20K natural scene images from Transient Attributes dataset.
CycleGAN (unpaired) facades 400 images from the CMP Facades dataset. Server
maps 1096 training images scraped from Google Maps.
horse2zebra 939 horse images and 1177 zebra images downloaded from ImageNet using keywords wild horse and zebra
apple2orange 996 apple images and 1020 orange images downloaded from ImageNet using keywords apple and navel orange
summer2winter_yosemite 1273 summer Yosemite images and 854 winter Yosemite images were downloaded using Flickr API.
monet2photo, vangogh2photo, ukiyoe2photo, cezanne2photo The art images were downloaded from WikiArt. The real photos are downloaded from Flickr using the combination of the tags landscape and landscapephotography. The training set size of each class is Monet:1074, Cezanne:584, Van Gogh:401, Ukiyo-e:1433, Photographs:6853.
iphone2dslr_flower both classes of images were downloaded from Flickr. The training set size of each class is iPhone:1813, DSLR:3316.
Cityscapes Cityscapes 2975 images from the Cityscapes dataset.*2 Server

1 In order to use these datasets, you need to use the dataroot_AB path and outputs: AB options so the image pairs will be automatically split during training. In order to switch A with B, you can also use the optional direction: BtoA option.

2 Cityscapes dataset requires processing before using, see this script.

Video

Name Datasets Short Description Google Drive
REDS Multiple REDS video dataset. Includes deblurring, super-resolution and high FPS datasets. Server