questions about top_down_transform #7

wefwefWEF2 · 2024-01-15T16:01:40Z

Hi, thanks a lot for your great work, and about top_down_transform I have some questions.

Here, why we use top_down_transform to multiply with masked_x again, because we have already got the selected feature.

top_down_transform = prompt[..., None] @ prompt[..., None].transpose(-1, -2)
x = x @ top_down_transform * 5

bfshi · 2024-03-21T17:45:16Z

Hi, that's a good question. This part is for selecting the relevant features on the channel dimension while the previous selection is on the spatial dimension. We find selecting on both dimensions can enhance the effect of top down attention.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

questions about top_down_transform #7

questions about top_down_transform #7

wefwefWEF2 commented Jan 15, 2024

bfshi commented Mar 21, 2024

questions about top_down_transform #7

questions about top_down_transform #7

Comments

wefwefWEF2 commented Jan 15, 2024

bfshi commented Mar 21, 2024