Fusion Attention for Action Recognition: Integrating Sparse-Dense and Global Attention for Video Action Recognition

Conventional approaches to video action recognition perform global attention over the entire video patches, which may be ineffective due to the temporal redundancy of video frames. Recent works on masked video modeling adopt a high-ratio tube masking and reconstruction strategy as a pre-training met...

Full description

Saved in:

Bibliographic Details
Main Authors:	Hyun-Woo Kim, Yong-Suk Choi
Format:	Article
Language:	English
Published:	MDPI AG 2024-10-01
Series:	Sensors
Subjects:	action recognition fusion attention temporal redundancy
Online Access:	https://www.mdpi.com/1424-8220/24/21/6842
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://www.mdpi.com/1424-8220/24/21/6842

Fusion Attention for Action Recognition: Integrating Sparse-Dense and Global Attention for Video Action Recognition

Internet

Similar Items