You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I have a question in the paper it mentioned that in the bridge adapter, after the zoom layer and linear layer. There are the MHSA for each modality before entering the MHCA which are there in the given code for ResNet in class Interactor
Hi, I have a question in the paper it mentioned that in the bridge adapter, after the zoom layer and linear layer. There are the MHSA for each modality before entering the MHCA which are there in the given code for ResNet in class Interactor
ETRIS/model/layers.py
Line 346 in 11cd4db
but for VIT it used class InteractorT which perform MHCA directly without MHSA.
ETRIS/model/layers.py
Line 326 in 11cd4db
Why didn't you do a MHSA for ViT-based as like in ResNet one?
The text was updated successfully, but these errors were encountered: