You do not use NativeScaler and ModelEma for training? #1

wangpichao · 2021-04-28T20:02:26Z

You do not use NativeScaler and ModelEma for training?

danczs · 2021-04-29T03:09:49Z

Yeah, we do not use amp, beacause it causes NAN values sometimes. Thus we removed amp and Scaler in our code. See also in deit issue: facebookresearch/deit#29. Without this acceleration，our code takes more time during training than Deit. But amp can be freely used after training, so the the inference is as efficient.
For ModelEma, we do not use it as in Deit.
Thanks for your question.

m-aliabbas · 2021-04-29T06:23:32Z

Hello, I am getting the error.
NameError: name 'model_ema' is not defined
How can I train the model without it. Please help

danczs · 2021-04-29T06:43:23Z

Hello, I am getting the error.
NameError: name 'model_ema' is not defined
How can I train the model without it. Please help

Sorry about this error. The code of Scaler and model_ema is not cleared completely . We have fixed it by removing the code in main.py(ling 287,288)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

You do not use NativeScaler and ModelEma for training? #1

You do not use NativeScaler and ModelEma for training? #1

wangpichao commented Apr 28, 2021

danczs commented Apr 29, 2021

m-aliabbas commented Apr 29, 2021

danczs commented Apr 29, 2021 •

edited

Loading

You do not use NativeScaler and ModelEma for training? #1

You do not use NativeScaler and ModelEma for training? #1

Comments

wangpichao commented Apr 28, 2021

danczs commented Apr 29, 2021

m-aliabbas commented Apr 29, 2021

danczs commented Apr 29, 2021 • edited Loading

danczs commented Apr 29, 2021 •

edited

Loading