Abstract

This is a PyTorch implementation for Learning Weakly Supervised Audio-Visual Violence Detection in Hyperbolic Space. The original paper is under review.

Abstract

In recent years, the field of weakly supervised audio-visual violence detection has gained substantial attention. The goal of this task is to identify multimodal violent snippets based on the video-level label. Despite the progress made, traditional Euclidean neural networks used in previous methods still face challenges in capturing discriminative representations. To overcome this limitation, we propose HyperVD, a novel framework that learns snippet embeddings in hyperbolic space to improve model discrimination. Specifically, our framework comprises a detour fusion module for multimodal fusion, which effectively alleviates modality inconsistency, and two branches of fully hyperbolic graph convolutional networks, which excavate feature similarities and temporal relationships among snippets in hyperbolic space. Extensive experiments on the XD-Violence benchmark demonstrate that our method outperforms the state-of-the-art methods by a sizable margin.

Training Stage

Download the extracted I3D features of XD-Violence dataset from here.
Change the file paths of make_list.py in the list folder to generate the training and test list.
The hyperparameters are saved in option.py, where we keep default settings as mentioned in our paper.
Run the following command for training:

python main.py

Test Stage

Change the checkpoint path of infer.py.
Run the following command for test:

python infer.py

Acknowledgements

The implementation mainly references the repositories of XDVioDet and fully-hyperbolic-nn . We greatly appreciate their excellent contribution.

If this work is helpful for your research, please consider citing the following BibTeX entry.

@misc{peng2023learning,
      title={Learning Weakly Supervised Audio-Visual Violence Detection in Hyperbolic Space}, 
      author={Xiaogang Peng and Hao Wen and Yikai Luo and Xiao Zhou and Keyang Yu and Yigang Wang and Zizhao Wu},
      year={2023},
      eprint={2305.18797},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.idea		.idea
__pycache__		__pycache__
ckpt		ckpt
layers		layers
list		list
manifolds		manifolds
models		models
utils		utils
README.md		README.md
dataset.py		dataset.py
infer.py		infer.py
main.py		main.py
model.py		model.py
option.py		option.py
preprocess.py		preprocess.py
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Abstract

Training Stage

Test Stage

Acknowledgements

About

Releases

Packages

Languages

xiaogangpeng/HyperVD

Folders and files

Latest commit

History

Repository files navigation

Abstract

Training Stage

Test Stage

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages