Hi, very glad to see this new version of Swin-Trans. Could I have a question about using mixed-precision training #35

SudongCAI · 2022-04-29T14:09:04Z

Dear author,

I notice that the repo recommends using apex mixed-precision for fine-tuning.
Then, how about learning from scratch on ImageNet-1k (should I also open the Apex mixed-precision training in this case)?
Previously, I found that mixed-precision could decrease the results for training CNNs on ImageNet if training from scratch.
Hence, I wonder whether mixed-precision training served as the default setting for the experiments of CSwins (or Swins).
Thank you so much!

Andy1621 · 2022-05-20T01:34:42Z

Here I give some experience in my UniFormer, you can also follow our work to do it~

Mix-precision is a common trick for training Vision Transformer, in our experiments, it does not hurt the performance. Both mix-precision in Apex and Pytorch work!
But sometimes mix-precision will cause loss NAN, and layer scale is another trick to handle it.

SudongCAI · 2022-05-20T04:20:05Z

Understood. Thanks so much for your kind reply!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hi, very glad to see this new version of Swin-Trans. Could I have a question about using mixed-precision training #35

Hi, very glad to see this new version of Swin-Trans. Could I have a question about using mixed-precision training #35

SudongCAI commented Apr 29, 2022

Andy1621 commented May 20, 2022

SudongCAI commented May 20, 2022

Hi, very glad to see this new version of Swin-Trans. Could I have a question about using mixed-precision training #35

Hi, very glad to see this new version of Swin-Trans. Could I have a question about using mixed-precision training #35

Comments

SudongCAI commented Apr 29, 2022

Andy1621 commented May 20, 2022

SudongCAI commented May 20, 2022