Skip to content

VividLe/Structured-Attention-Composition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Structured Attention Composition for Temporal Action Localization

This repository is the official implementation of SAC. In this work, we tackle the temporal action localization task from the perspective of modality, and precisely assign frame-modality attention. Paper from arXiv or IEEE.

Illustrating the architecture of the proposed SAC

Requirements

To install requirements:

conda env create -n env_name -f environment.yaml

Before running the code, please activate this conda environment.

Data Preparation

a. Download pre-extracted features from baiduyun (code:6666)

Please ensure the data structure is as below

├── data
   └── thumos
       └── val
           ├── video_validation_0000051_02432.npz
           ├── video_validation_0000051_02560.npz
           ├── ...
       └── test
           ├── video_test_0000004_00000.npz
           ├── video_test_0000004_00256.npz
           ├── ...

Train

a. Config

Adjust configurations.

./experiments/thumos/network.yaml

c. Train

cd tools
bash run.sh

Inference

a. You can download pre-trained models from baiduyun (code:6666), and put the weight file in the folder checkpoint.

  • Performance
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 Average
mAP 75.54 73.65 69.09 61.06 51.44 37.10 22.75 8.63 1.43 44.52

b. Test

cd tools
python eval.py

Related Projects

  • BackTAL: Background-Click Supervision for Temporal Action Localization.
  • A2Net: Revisiting Anchor Mechanisms for Temporal Action Localization.

Contact

For any discussions, please contact [email protected].

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages