Pre-trained weights? #2

hzhang57 · 2021-05-01T02:41:41Z

Hi, I want to extend the model on my own task, will you release pre-trained weights?

danczs · 2021-05-01T04:19:12Z

Beacause of the ploicy of our institution, we cannot send the pre-trained models out directly. We plan to find some gpu servers outside, but it will take time. So we are afraid the models will not be released recently.

hzhang57 · 2021-05-10T02:54:42Z

Hi, I trained a model with the provided codes on ImageNet-1k only with 4x2080ti (batch100), finally reach 82.0 around. I upload this temporal alternative in google drive to facilate other's needs. https://drive.google.com/drive/folders/18GpH1SeVOsq3_2QGTA5Z_3O1UFtKugEu?usp=sharing
I also guess that the model should have potential if pre-trained with ImageNet-21k.

danczs · 2021-05-11T01:45:48Z

That's great! I will add it to readme for someone else need it. Thanks a lot!

amaarora · 2021-05-18T02:19:41Z

, I trained a model

Assuming this is Visformer small?

hzhang57 · 2021-05-18T02:23:15Z

yes, I trained the visformer small with 224: visformer_small

developer0hye · 2021-10-01T00:01:08Z

@danczs @amaarora

Thanks for sharing your works! I really love the architecture and experiments that you guys did. I could find out how to improve the performance of transformer models with convolutional layer.

I trained the visformer tiny with 224. If I upload the pretrained weight, will it can help other researchers?
When I trained the visformer tiny, the top1 acc of this model reached 78.3% and reached 78.1% with the weight saved into last epoch.

danczs · 2021-10-01T12:54:04Z

@danczs @amaarora

Thanks for sharing your works! I really love the architecture and experiments that you guys did. I could find out how to improve the performance of transformer models with convolutional layer.

I trained the visformer tiny with 224. If I upload the pretrained weight, will it can help other researchers? When I trained the visformer tiny, the top1 acc of this model reached 78.3% and reached 78.1% with the weight saved into last epoch.

Thanks for your attention! Now only the weights of Visformer-small are available. So I think tiny weights can be helpful for someone. By the way, for tiny model, setting '--drop-path=0.0' can slightly improve the performance.

developer0hye · 2021-10-01T13:08:45Z

@danczs

I trained the model with the below command having set '--drop-path' to 0.

python -m torch.distributed.launch --nproc_per_node=4 --use_env main.py --model visformer_tiny --batch-size 256 --drop-path 0.0 --data-path /path/to/imagenet --output_dir /path/to/save

Please check my weight and share this link on Readme file!

https://drive.google.com/file/d/1LLBGbj7-ok1fDvvMCab-Fn5T3cjTzOKB/view?usp=sharing

danczs · 2021-10-01T13:17:12Z

I have added it. Thanks for your sharing!
In addition, we will slightly update the model in the next few days to enable Visformer to use amp. At that time, old weights may not work well. We will test it and report the result here.
Thanks!

developer0hye · 2021-10-01T13:33:42Z

@danczs Okay! Thanks!

danczs · 2021-10-12T07:15:05Z

By slightly adjusting the model, Visformer can use amp now. During inference, old weights can utilize amp as well. One can refer to ReadMe for details.

amaarora mentioned this issue May 18, 2021

LeVit, Visformer, RedNet/Involution models and layers huggingface/pytorch-image-models#637

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pre-trained weights? #2

Pre-trained weights? #2

hzhang57 commented May 1, 2021

danczs commented May 1, 2021

hzhang57 commented May 10, 2021

danczs commented May 11, 2021

amaarora commented May 18, 2021

hzhang57 commented May 18, 2021

developer0hye commented Oct 1, 2021 •

edited

Loading

danczs commented Oct 1, 2021

developer0hye commented Oct 1, 2021

danczs commented Oct 1, 2021 •

edited

Loading

developer0hye commented Oct 1, 2021

danczs commented Oct 12, 2021

Pre-trained weights? #2

Pre-trained weights? #2

Comments

hzhang57 commented May 1, 2021

danczs commented May 1, 2021

hzhang57 commented May 10, 2021

danczs commented May 11, 2021

amaarora commented May 18, 2021

hzhang57 commented May 18, 2021

developer0hye commented Oct 1, 2021 • edited Loading

danczs commented Oct 1, 2021

developer0hye commented Oct 1, 2021

danczs commented Oct 1, 2021 • edited Loading

developer0hye commented Oct 1, 2021

danczs commented Oct 12, 2021

developer0hye commented Oct 1, 2021 •

edited

Loading

danczs commented Oct 1, 2021 •

edited

Loading