Fusion Attention for Action Recognition: Integrating Sparse-Dense and Global Attention for Video Action Recognition

Conventional approaches to video action recognition perform global attention over the entire video patches, which may be ineffective due to the temporal redundancy of video frames. Recent works on masked video modeling adopt a high-ratio tube masking and reconstruction strategy as a pre-training met...

Full description

Saved in:
Bibliographic Details
Main Authors: Hyun-Woo Kim, Yong-Suk Choi
Format: Article
Language:English
Published: MDPI AG 2024-10-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/24/21/6842
Tags: Add Tag
No Tags, Be the first to tag this record!