-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Could you provide the innotation .json file for each dataset (unc, gref, gref_umd)? #3
Comments
The annotations of visual grounding tasks are included in the split files. These files are released by TransVG https://drive.google.com/file/d/1fVwdDvXNbH8uuq_pHD_o5HI7yqeuz0yS/view |
I found that the Flickr30k dataset have 427,193 training pairs. But it suppose to be 29,783 training pairs. Is it some mistake of split file of Flickr30k? |
Sorry, I mean the .json file for pre-training the weights of swin-Transformer. |
You can generate the
|
Dear authors:
Thanks for your work in visual grounding, I also feel interested in this field, and I have a question:
Could you provide the innotation .json file for each dataset (unc, gref, gref_umd)?
The text was updated successfully, but these errors were encountered: