SlowFast-TCN: A Deep Learning Approach for Visual Speech Recognition

Visual Speech Recognition (VSR), commonly referred to as automated lip-reading, is an emerging technology that interprets speech by visually analyzing lip movements. A challenge in VSR where visually distinct words produce similar lip movements is known as homopheme problem. Visemes are the basic vi...

Full description

Saved in:
Bibliographic Details
Main Authors: Nicole Yah Yie Ha, Lee-Yeng Ong, Meng-Chew Leow
Format: Article
Language:English
Published: Ital Publication 2024-12-01
Series:Emerging Science Journal
Subjects:
Online Access:https://ijournalse.org/index.php/ESJ/article/view/2670
Tags: Add Tag
No Tags, Be the first to tag this record!