Skip to content

Latest commit

 

History

History
 
 

vit_adapter

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 

Vision Transformer Adapter for Dense Predictions

Reference

Chen, Zhe, Yuchen Duan, Wenhai Wang, Junjun He, Tong Lu, Jifeng Dai, and Yu Qiao. "Vision Transformer Adapter for Dense Predictions." arXiv preprint arXiv:2205.08534 (2022).

Prerequesites

Download the ms_deform_attn.zip (https://paddleseg.bj.bcebos.com/dygraph/customized_ops/ms_deform_attn.zip), and then refer to the readme to install ms_deform_attn lib.

Performance

ADE20K

Model Backbone Resolution Training Iters mIoU mIoU (flip) mIoU (ms+flip) Links
UPerNetViTAdapter ViT-Adapter-Tiny 512x512 160000 41.90% - - model | log | vdl