简体中文 | English
We implemented action recgonition model and action localization model in this repo.
Field | Model | Config | Dataset | Metrics | ACC% | Download |
---|---|---|---|---|---|---|
action recognition | PP-TSM | pptsm.yaml | Kinetics-400 | Top-1 | 76.16 | ppTSM.pdparams |
action recognition | PP-TSN | pptsn.yaml | Kinetics-400 | Top-1 | 75.06 | ppTSN.pdparams |
action recognition | TimeSformer | timesformer.yaml | Kinetics-400 | Top-1 | 77.29 | TimeSformer.pdparams |
action recognition | SlowFast | slowfast_multigrid.yaml | Kinetics-400 | Top-1 | 75.84 | SlowFast.pdparams |
action recognition | TSM | tsm.yaml | Kinetics-400 | Top-1 | 70.86 | TSM.pdparams |
action recognition | TSN | tsn.yaml | Kinetics-400 | Top-1 | 69.81 | TSN.pdparams |
action recognition | AttentionLSTM | attention_lstm.yaml | Youtube-8M | Hit@1 | 89.0 | AttentionLstm.pdparams |
action detection | BMN | bmn.yaml | ActivityNet | AUC | 67.23 | BMN.pdparams |
-
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification, Xiang Long, Chuang Gan, Gerard de Melo, Jiajun Wu, Xiao Liu, Shilei Wen
-
BMN: Boundary-Matching Network for Temporal Action Proposal Generation, Tianwei Lin, Xiao Liu, Xin Li, Errui Ding, Shilei Wen.
-
SlowFast Networks for Video Recognition, Feichtenhofer C, Fan H, Malik J, et al.
-
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition, Limin Wang, Yuanjun Xiong, Zhe Wang, Yu Qiao, Dahua Lin, Xiaoou Tang, Luc Van Gool
-
Temporal Shift Module for Efficient Video Understanding, Ji Lin, Chuang Gan, Song Han
-
Is Space-Time Attention All You Need for Video Understanding? Gedas Bertasius, Heng Wang, Lorenzo Torresani