Input of Linguistic Branch #17

JJ-res101 · 2021-11-19T07:16:47Z

Thank you for your excellent work! How does the model get the box of a certain phrase in a sentence? Right now it seems to me that the model can't do that. Is that right?

djiajunustc · 2021-11-20T16:36:03Z

The box is not annotated to match a certain phrase, but the whole sentence.

JJ-res101 · 2021-11-21T01:58:22Z

I think the box is annotated to each phrase in Flickr30K Entities data. As said in your paper, "Flickr30K Entities [38] augments the original Flickr30K [58] with short region phrase correspondence annotations."
Maybe the 'Flickr' dataset you use is one box annotation per sentence. Is that right?:)

jianghaojun · 2022-03-25T02:47:13Z

Just as you cited, "Flickr30K Entities [38] augments the original Flickr30K [58] with short region phrase correspondence annotations." which means the original sentences of Flickr30K are splited to short phrases and each phrase is annotated with a bbox. When training on Flickr30K Entities, each sample is consists of a phrase and a bbox.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Input of Linguistic Branch #17

Input of Linguistic Branch #17

JJ-res101 commented Nov 19, 2021

djiajunustc commented Nov 20, 2021

JJ-res101 commented Nov 21, 2021 •

edited

Loading

jianghaojun commented Mar 25, 2022

Input of Linguistic Branch #17

Input of Linguistic Branch #17

Comments

JJ-res101 commented Nov 19, 2021

djiajunustc commented Nov 20, 2021

JJ-res101 commented Nov 21, 2021 • edited Loading

jianghaojun commented Mar 25, 2022

JJ-res101 commented Nov 21, 2021 •

edited

Loading