-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Can I feed more than 3 channel to this model? like in (6, 244,244) instead of RGB data in (3,244,244) #4
Comments
Yeah, by changing the input setting in the stem layer, a model can handle different channel numbers. |
Thanks for your swift reply. |
This _cfg funchtion should work well. I checked the repo and did not find any difference. I will try to train the models in timm's repo and report the results here. |
Thank you for the time you put into this. Looking forward to your testing reslt. |
We trained the timm-visformer-S and got 82.154%, which is simliar with our new reported results 82.19% (average of 3 run). So there is basically no difference between the two models. |
Hi,
I understand the visformer accept input in the 3D format as in (3,244,244), can I feed data with more than 3 RGB channel into the model? like in (16, 244,244)?
The text was updated successfully, but these errors were encountered: