QuerYD_download

This is a tool to allow for easy download of the videos forming the QuerYD dataset.

Installing necessary libraries:

argparse
pytube python -m pip install git+https://github.com/nficano/pytube
pathlib
logging
multiprocessing
requests
tqdm
json
zsvision

Version 2 of the dataset has been added on 8th of April

Downloading videos
To test the download videos script for the QuerYD dataset simply run:

python download_queryd.py --txt_file relevant-video-links-test.txt --task download_videos

This will create a folder called videos in your current folder and videos will be saved there. To fully run the download_videos script run:

python download_queryd.py --txt_file relevant-video-links-{version either v1 or v2}.txt --task download_videos

To only download videos with non-english descriptions run the download_videos script run:

python download_queryd.py --txt_file relevant-non-en-links.txt --task download_videos

To attempt downloading videos multiple times, set the --tries flag to the desired value. By default the value is 2. Eg:

python download_queryd.py --txt_file relevant-video-links-{version either v1 or v2}.txt --tries 3 --task download_videos

To re-download all files use the --refresh flag. Eg:

python download_queryd.py --txt_file relevant-video-links-{version either v1 or v2}.txt --refresh --task download_videos

Downloading json metadata
To download the .json file containing information about the described videos run:

wget http://www.robots.ox.ac.uk/~vgg/research/collaborative-experts/data/QuerYD/json_metadata-{version either v1 or v2}.zip
mv json_metadata-{version either v1 or v2}.zip json_metadata.zip
unzip json_metadata.zip

Downloading audio description files
Audio files can be downloaded only after downloading the .json metadata using the previous step.
To download the audio description files corresponding to each video, run:

python download_queryd.py --txt_file relevant-video-links-{version either v1 or v2}.txt --task download_wavs

To download only the non-english audio descriptions run:

python download_queryd.py --txt_file relevant-non-en-links.txt --task download_wavs

To use more processes add the --processes flag with the number of CPUs available. eg:

python download_queryd.py --txt_file relevant-video-links-{version either v1 or v2}.txt --task download_wavs --processes 2

Downloading transcribed descriptions and corresponding time-stamps
The transcribed version of the audio descriptions can be downloaded as a pickle file by accessing the following link:

http://www.robots.ox.ac.uk/~vgg/research/collaborative-experts/data/QuerYD/raw_captions_combined_filtered-{version either v1 or v2}.pkl
mv raw_captions_combined_filtered-{version either v1 or v2}.pkl raw_captions_combined_filtered.pkl

The corresponding time-stamps in the same order are provided in this pickle file:

http://www.robots.ox.ac.uk/~vgg/research/collaborative-experts/data/QuerYD/times_captions_combined_filtered-{version either v1 or v2}.pkl
mv times_captions_combined_filtered-{version either v1 or v2}.pkl times_captions_combined_filtered.pkl

The confidence of the transcriptions in the same order as transcriptions are found here:

http://www.robots.ox.ac.uk/~vgg/research/collaborative-experts/data/QuerYD/confidence_captions_combined_filtered-{version either v1 or v2}.pkl
mv confidence_captions_combined_filtered-{version either v1 or v2}.pkl confidence_captions_combined_filtered.pkl

Downloading video features, descriptions and train/val/test splits
To download QuerYD data:

wget http://www.robots.ox.ac.uk/~vgg/research/collaborative-experts/data/features-v2/QuerYD-experts-{version either v1 or v2}.tar.gz
mv QuerYD-experts-{version either v1 or v2}.tar.gz QuerYD-experts.tar.gz

To download QuerYDSegments data (localised clips and their descriptions):

wget http://www.robots.ox.ac.uk/~vgg/research/collaborative-experts/data/features-v2/QuerYDSegments-experts-{version either v1 or v2}.tar.gz
mv QuerYDSegments-experts-{version either v1 or v2}.tar.gz QuerYDSegments-experts.tar.gz

More info and scripts used can be found at https://github.com/albanie/collaborative-experts#queryd and training and test steps can be followed from https://github.com/albanie/collaborative-experts#evaluating-a-pretrained-model where MSVD should be replaced by QuerYD or QuerYDSegments. Model names should be taken from retrieval results tables at https://github.com/albanie/collaborative-experts#queryd or https://github.com/albanie/collaborative-experts#querydsegments .

References

[1] If you find this code useful, please consider citing:

@misc{oncescu2021queryd,
      title={QuerYD: A video dataset with high-quality text and audio narrations}, 
      author={Andreea-Maria Oncescu and João F. Henriques and Yang Liu and Andrew Zisserman and Samuel Albanie},
      year={2021},
      eprint={2011.11071},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

[2] If you find this code useful or use the extracted features, please consider citing:

@inproceedings{Liu2019a,
  author    = {Liu, Y. and Albanie, S. and Nagrani, A. and Zisserman, A.},
  booktitle = {arXiv preprint arxiv:1907.13487},
  title     = {Use What You Have: Video retrieval using representations from collaborative experts},
  date      = {2019},
}

Acknowledgements

This work is supported by the EP-SRC (VisualAI EP/T028572/1 and DTA Studentship), and the Royal Academy of Engineering (DFR05420). We are gratefulto Sophia Koepke for her helpful comments and suggestions.

2nd Version of QuerYD retrieval results:

QuerYD

MODEL study on QUERYD

Importance of the model:

Model	Task	R@1	R@5	R@10	R@50	MdR	MnR	Geom	params	Links
HowTo100m S3D	t2v	_{^10.2_(0.0)}	_{^24.5_(0.0)}	_{^32.7_(0.0)}	_{^54.3_(0.0)}	_{^38.0_(0.0)}	_{^82.1_(0.0)}	_{^20.2_(0.0)}	1	config, model, log
CE - P,CG	t2v	_{^29.8_(0.3)}	_{^63.8_(0.5)}	_{^74.9_(0.3)}	_{^93.0_(0.1)}	_{^3.0_(0.0)}	_{^15.1_(0.4)}	_{^52.3_(0.3)}	57.75M	config, model, log
CE	t2v	_{^31.9_(1.5)}	_{^64.5_(1.4)}	_{^76.1_(0.8)}	_{^93.8_(0.9)}	_{^3.0_(0.0)}	_{^13.1_(0.8)}	_{^53.9_(0.7)}	30.82M	config, model, log
HowTo100m S3D	v2t	_{^10.0_(0.0)}	_{^25.7_(0.0)}	_{^32.3_(0.0)}	_{^53.2_(0.0)}	_{^42.0_(0.0)}	_{^81.7_(0.0)}	_{^20.2_(0.0)}	1	config, model, log
CE - P,CG	v2t	_{^28.6_(1.1)}	_{^62.4_(0.5)}	_{^73.6_(0.8)}	_{^92.9_(0.1)}	_{^3.0_(0.0)}	_{^14.7_(0.4)}	_{^50.8_(0.7)}	57.75M	config, model, log
CE	v2t	_{^32.9_(1.7)}	_{^64.9_(1.1)}	_{^76.7_(1.1)}	_{^93.6_(0.6)}	_{^3.0_(0.0)}	_{^12.8_(0.6)}	_{^54.7_(1.1)}	30.82M	config, model, log

The influence of different pretrained experts for the performance of the CE model trained on QuerYD is studied. The value and cumulative effect of different experts for scene clas-sification (SCENE), ambient sound classification (AUDIO),image classification (OBJECT), and action recognition (ACTION) are presented. PREV. denotes the experts used in the previous row.

Experts	Task	R@1	R@5	R@10	R@50	MdR	MnR	Geom	params	Links
Scene	t2v	_{^17.0_(0.7)}	_{^47.0_(2.4)}	_{^60.8_(1.1)}	_{^85.4_(1.6)}	_{^6.3_(0.6)}	_{^27.2_(1.1)}	_{^36.5_(1.0)}	7.51M	config, model, log
Prev. + Audio	t2v	_{^21.4_(0.2)}	_{^53.0_(1.3)}	_{^63.9_(0.4)}	_{^88.6_(0.3)}	_{^5.0_(0.0)}	_{^22.2_(0.7)}	_{^41.7_(0.4)}	17.25M	config, model, log
Prev. + Inst	t2v	_{^32.3_(1.6)}	_{^65.5_(1.0)}	_{^76.7_(0.9)}	_{^93.6_(0.2)}	_{^3.0_(0.0)}	_{^13.0_(0.3)}	_{^54.5_(0.3)}	24.63M	config, model, log
Prev. + R2P1D	t2v	_{^31.9_(1.5)}	_{^64.2_(1.4)}	_{^76.1_(0.7)}	_{^93.8_(0.9)}	_{^3.0_(0.0)}	_{^13.1_(0.8)}	_{^53.8_(0.7)}	30.82M	config, model, log
Scene	v2t	_{^20.3_(0.5)}	_{^47.4_(0.8)}	_{^60.0_(0.4)}	_{^85.5_(1.6)}	_{^6.0_(0.0)}	_{^27.0_(0.7)}	_{^38.7_(0.3)}	7.51M	config, model, log
Prev. + Audio	v2t	_{^23.6_(0.9)}	_{^52.2_(1.1)}	_{^63.9_(1.3)}	_{^89.2_(0.3)}	_{^5.0_(0.0)}	_{^21.6_(0.8)}	_{^42.8_(0.5)}	17.25M	config, model, log
Prev. + Inst.	v2t	_{^32.6_(1.3)}	_{^65.6_(0.3)}	_{^77.2_(0.3)}	_{^93.7_(0.9)}	_{^3.0_(0.0)}	_{^12.5_(0.1)}	_{^54.8_(0.6)}	24.63M	config, model, log
Prev. + R2P1D	v2t	_{^32.9_(1.7)}	_{^65.0_(1.0)}	_{^76.7_(1.0)}	_{^93.6_(0.6)}	_{^3.0_(0.0)}	_{^12.8_(0.6)}	_{^54.7_(1.1)}	30.82M	config, model, log

For QuerYDSegments updated results are

MODEL study on QUERYDSEGMENTS

Importance of the model:

Model	Task	R@1	R@5	R@10	R@50	MdR	MnR	Geom	params	Links
HowTo100m S3D	t2v	_{^6.4_(0.0)}	_{^13.8_(0.0)}	_{^19.9_(0.0)}	_{^36.3_(0.0)}	_{^131.0_(0.0)}	_{^340.0_(0.0)}	_{^12.1_(0.0)}	1	config, model, log
CE - P,CG	t2v	_{^21.9_(0.5)}	_{^44.5_(0.9)}	_{^53.5_(0.0)}	_{^72.0_(0.8)}	_{^8.3_(0.6)}	_{^107.7_(2.6)}	_{^37.4_(0.4)}	57.75M	config, model, log
CE	t2v	_{^19.2_(0.1)}	_{^40.8_(1.6)}	_{^49.4_(1.0)}	_{^68.7_(0.5)}	_{^11.0_(1.0)}	_{^125.0_(4.7)}	_{^33.8_(0.6)}	30.82M	config, model, log
HowTo100m S3D	v2t	_{^7.2_(0.0)}	_{^15.1_(0.0)}	_{^19.5_(0.0)}	_{^34.3_(0.0)}	_{^160.0_(0.0)}	_{^361.4_(0.0)}	_{^12.9_(0.0)}	1	config, model, log
CE - P,CG	v2t	_{^20.8_(0.7)}	_{^43.8_(1.2)}	_{^53.2_(0.8)}	_{^72.6_(1.1)}	_{^8.3_(0.6)}	_{^102.6_(2.9)}	_{^36.5_(0.7)}	57.75M	config, model, log
CE	v2t	_{^18.5_(0.5)}	_{^40.1_(0.6)}	_{^49.5_(0.2)}	_{^69.0_(0.6)}	_{^11.0_(0.0)}	_{^112.1_(4.3)}	_{^33.2_(0.4)}	30.82M	config, model, log

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
wav_text_files_no_en		wav_text_files_no_en
README.md		README.md
download_queryd.py		download_queryd.py
languages.json		languages.json
relevant-non-en-links.txt		relevant-non-en-links.txt
relevant-video-links-test.txt		relevant-video-links-test.txt
relevant-video-links-v1.txt		relevant-video-links-v1.txt
relevant-video-links-v2.txt		relevant-video-links-v2.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QuerYD_download

Version 2 of the dataset has been added on 8th of April

References

Acknowledgements

2nd Version of QuerYD retrieval results:

QuerYD

MODEL study on QUERYD

For QuerYDSegments updated results are

MODEL study on QUERYDSEGMENTS

About

Releases

Packages

Languages

oncescuandreea/QuerYD_downloader

Folders and files

Latest commit

History

Repository files navigation

QuerYD_download

Version 2 of the dataset has been added on 8th of April

References

Acknowledgements

2nd Version of QuerYD retrieval results:

QuerYD

MODEL study on QUERYD

For QuerYDSegments updated results are

MODEL study on QUERYDSEGMENTS

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages