This repo takes the initial step towards leveraging text learning for online action detection without explicit human supervision.
- [2024/10/04] 🔥 OV-OAD Code release.
git clone https://github.com/OV-OAD
cd OV-OAD
conda create -n ovoad python=3.7 -y
conda activate ovoad
Please checkout the following page for more inference & evaluation details.
If you find it useful for your research and applications, please cite related papers/blogs using this BibTeX:
@article{zhao2024ovoad,
title={Does Video-Text Pretraining Help Open-Vocabulary Online Action Detection?},
author={Qingsong Zhao, Yi Wang, Jilan Xu, Yinan He, Zifan Song, Limin Wang, Yu Qiao, Cairong Zhao},
journal={Advances in neural information processing systems},
year={2024}
}