SlowFast-TCN: A Deep Learning Approach for Visual Speech Recognition

SlowFast-TCN: A Deep Learning Approach for Visual Speech Recognition

Visual Speech Recognition (VSR), commonly referred to as automated lip-reading, is an emerging technology that interprets speech by visually analyzing lip movements. A challenge in VSR where visually distinct words produce similar lip movements is known as homopheme problem. Visemes are the basic vi...

Full description

Saved in:

Bibliographic Details
Main Authors:	Nicole Yah Yie Ha, Lee-Yeng Ong, Meng-Chew Leow
Format:	Article
Language:	English
Published:	Ital Publication 2024-12-01
Series:	Emerging Science Journal
Subjects:	visual speech recognition temporal convolutional network lip reading in wild slowfast network homophemes.
Online Access:	https://ijournalse.org/index.php/ESJ/article/view/2670
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Classification of Speech Emotion State Based on Feature Map Fusion of TCN and Pretrained CNN Model From Korean Speech Emotion Data
by: A-Hyeon Jo, et al.
Published: (2025-01-01)

JEP-KD: Joint-Embedding Predictive Architecture Based Knowledge Distillation for Visual Speech Recognition
by: Chang Sun, et al.
Published: (2024-01-01)

Analysis for speech and esthetics in sixty consecutive patients with cleft lip and palate
by: Mahantesh S Shiraganvi, et al.
Published: (2011-10-01)

LipBengal: Pioneering Bengali lip-reading dataset for pronunciation mapping through lip gesturesHugging Face
by: Md. Tanvir Rahman Sahed, et al.
Published: (2025-02-01)

Deep Transfer Learning for Lip Reading Based on NASNetMobile Pretrained Model in Wild Dataset
by: Ashwaq Waleed Abdul Ameer, et al.
Published: (2025-01-01)

Lip-Reading Classification of Turkish Digits Using Ensemble Learning Architecture Based on 3DCNN
by: Ali Erbey, et al.
Published: (2025-01-01)

Synchronous Analysis of Speech Production and Lips Movement to Detect Parkinson’s Disease Using Deep Learning Methods
by: Cristian David Ríos-Urrego, et al.
Published: (2024-12-01)

Urdu Lip Reading Systems for Digits in Controlled and Uncontrolled Environment
by: Amanullah Baloch, et al.
Published: (2025-01-01)

A Helium Speech Correction Method Based on Generative Adversarial Networks
by: Hongjun Li, et al.
Published: (2024-11-01)

End-to-end speech enhancement based on ultra-lightweight channel attention
by: Yi HONG, et al.
Published: (2021-09-01)

Speech etiquette situation of apology in classes of Russian as a foreign language (on the example of Asya Petrova's story «Sorry, Fool»)
by: Zhanna K. Gaponova, et al.
Published: (2023-08-01)

Objective assessment of communication speech interference effect based on feature fusion
by: Yun LIN, et al.
Published: (2023-03-01)

Deep Learning Based Automatic Speech Recognition for Turkish
by: Hamit Erdem, et al.
Published: (2020-08-01)

CNN Based Automatic Speech Recognition: A Comparative Study
by: Hilal Ilgaz, et al.
Published: (2024-08-01)

Long-term evaluation of Elmelegy’s technique of local muscle transposition for the functional restoration of large upper or lower lip defects
by: Nader G. Elmelegy
Published: (2024-12-01)

EDUCATIONAL TRAINING SIMULATOR FOR MONITORING READING TECHNIQUE AND SPEED BASED ON SPEECH-TO-TEXT (STT) METHODS
by: Olesia Barkovska, et al.
Published: (2024-10-01)

Examining the Neural Markers of Speech Rhythm in Silent Reading Using Mass Univariate Statistics of EEG Single Trials
by: Stephanie J. Powell, et al.
Published: (2024-11-01)

TCN-GRU Based on Attention Mechanism for Solar Irradiance Prediction
by: Zhi Rao, et al.
Published: (2024-11-01)

Evaluation of two surgical methods for maxillary hypoplasia in patients with cleft lip and palate
by: GUO Songsong, ZHANG Zhenxing, ZHANG Ping, JIANG Chenghui, CHENG Jie, JIANG Hongbing, LI Sheng
Published: (2024-12-01)

End-to-End Mandarin Speech Reconstruction Based on Ultrasound Tongue Images Using Deep Learning
by: Fengji Li, et al.
Published: (2025-01-01)

Research development and forecast of automatic speech recognition technologies
by: Haikun WANG, et al.
Published: (2018-02-01)

Speech Recognition System Based on Machine Learning in Persian Language
by: Shahed Mohammadi, et al.
Published: (2022-06-01)

Self-supervised speech representation learning based on positive sample comparison and masking reconstruction
by: Wenlin ZHANG, et al.
Published: (2022-07-01)

Perception of vocoded speech in domestic dogs
by: Amritha Mallikarjun, et al.
Published: (2024-04-01)

DPSNN: spiking neural network for low-latency streaming speech enhancement
by: Tao Sun, et al.
Published: (2024-01-01)

Automatic Speech Recognition: A survey of deep learning techniques and approaches
by: Harsh Ahlawat, et al.
Published: (2025-12-01)

Synthetic speech detection method using texture feature based on circumferential local ternary pattern
by: Honghui JIN, et al.
Published: (2023-06-01)

ZeST: A Zero-Resourced Speech-to-Speech Translation Approach for Unknown, Unpaired, and Untranscribed Languages
by: Luan Thanh Nguyen, et al.
Published: (2025-01-01)

Ethnic influence of the lips and incisor in the diagnosis and orthodontic planning
by: Ana Thais BAGATINI, et al.
Published: (2024-12-01)

A part of speech tagger for Yoruba language text using deep neural network
by: Chukwuemeka Christian Ugwu, et al.
Published: (2024-12-01)

Incisors inclination in relation to lip parameters: a CBCT study
by: Majd ELIAS, et al.
Published: (2024-12-01)

Lithium Battery Life Prediction for Electric Vehicles Using Enhanced TCN and SVN Quantile Regression
by: Xinyue Li, et al.
Published: (2025-01-01)

A study of morphological patterns of lip prints in relation to gender of North Indian population
by: Shalini Gupta, et al.
Published: (2011-10-01)

Methodology in Research on Speech Patterns
by: Nicole Wong
Published: (2017-07-01)

Methodology in Research on Speech Patterns
by: Nicole Wong
Published: (2017-07-01)

Methodology in Research on Speech Patterns
by: Nicole Wong
Published: (2017-07-01)

Methodology in Research on Speech Patterns
by: Nicole Wong
Published: (2017-07-01)

Methodology in Research on Speech Patterns
by: Nicole Wong
Published: (2017-07-01)

Methodology in Research on Speech Patterns
by: Nicole Wong
Published: (2017-07-01)

Methodology in Research on Speech Patterns
by: Nicole Wong
Published: (2017-07-01)