SlowFast-TCN: A Deep Learning Approach for Visual Speech Recognition

Visual Speech Recognition (VSR), commonly referred to as automated lip-reading, is an emerging technology that interprets speech by visually analyzing lip movements. A challenge in VSR where visually distinct words produce similar lip movements is known as homopheme problem. Visemes are the basic vi...

Full description

Saved in:

Bibliographic Details
Main Authors:	Nicole Yah Yie Ha, Lee-Yeng Ong, Meng-Chew Leow
Format:	Article
Language:	English
Published:	Ital Publication 2024-12-01
Series:	Emerging Science Journal
Subjects:	visual speech recognition temporal convolutional network lip reading in wild slowfast network homophemes.
Online Access:	https://ijournalse.org/index.php/ESJ/article/view/2670
Tags:	Add Tag No Tags, Be the first to tag this record!

Be the first to leave a comment!

SlowFast-TCN: A Deep Learning Approach for Visual Speech Recognition

Similar Items