A PyTorch implementation of LDAST
Paper | Project | Slide | Video
LDAST is an implementation of
"Language-Driven Artistic Style Transfer"
Tsu-Jui Fu, Xin Eric Wang, and William Yang Wang
in European Conference on Computer Vision (ECCV) 2022
Language visual artist (LVA) extracts content structures from C and visual patterns from X to perform LDAST. LVA adopts the patch-wise style discriminator D to connect extracted visual semantics to patches of paired style image (PS). Contrastive reasoning (CR) allows comparing contrastive pairs C1-X1, C2-X1, and C2-X2 of content image and style instruction.
This code is implemented under Python 3.8, PyTorch 1.7, and Torchvision 0.8.
The dataset includes content images and visual attribute instructions (DTD).
Please visit WikiArt and here for emotional effect instructions (ArtEmis).
Put sanet.pt in ./_ckpt and dtd.pkl in ./_data.
python train_lva.py
python train_ctr.py
Put clva_dtd.pt in ./_ckpt.
python inference.py
python gui.py
@inproceedings{fu2022ldast,
author = {Tsu-Jui Fu and Xin Eric Wang and William Yang Wang},
title = {{Language-Driven Artistic Style Transfer}},
booktitle = {European Conference on Computer Vision (ECCV)},
year = {2022}
}
This code is based on SANet