The lip reading method based on Adaptive Pooling Attention Transformer

Lip reading technology establishes the mapping relationship between lip movements and specific language characters by processing a series of consecutive lip images, thereby enabling semantic information recognition. Existing methods mainly use recurrent networks for spatiotemporal modeling of sequen...

Full description

Saved in:
Bibliographic Details
Main Authors: YAO Yun, HU Zhenxiao, DENG Tao, WANG Xiao
Format: Article
Language:zho
Published: POSTS&TELECOM PRESS Co., LTD 2025-01-01
Series:智能科学与技术学报
Subjects:
Online Access:http://www.cjist.com.cn/zh/article/99639204/
Tags: Add Tag
No Tags, Be the first to tag this record!