Feature-level fusion of face and speech based multimodal biometric attendance system with liveness detection

The rapid growth of deep learning and the Internet of Things has spurred the need for touchless biometric systems in areas where cleanliness and non-intrusive user interaction are critical. In order to authenticate seamlessly, traditional biometric methods such as fingerprint, hand, etc. recognition...

Full description

Saved in:

Bibliographic Details
Main Authors:	Khushboo Jha, Aruna Jain, Sumit Srivastava
Format:	Article
Language:	English
Published:	AIP Publishing LLC 2024-11-01
Series:	AIP Advances
Online Access:	http://dx.doi.org/10.1063/5.0234430
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1846141168511877120
author	Khushboo Jha Aruna Jain Sumit Srivastava
author_facet	Khushboo Jha Aruna Jain Sumit Srivastava
author_sort	Khushboo Jha
collection	DOAJ
description	The rapid growth of deep learning and the Internet of Things has spurred the need for touchless biometric systems in areas where cleanliness and non-intrusive user interaction are critical. In order to authenticate seamlessly, traditional biometric methods such as fingerprint, hand, etc. recognition require physical contact and, therefore, risk hygiene issues, hence making face and speaker verification more viable alternatives. A robust Multimodal Biometric Attendance System (MBAS) is needed due to the vulnerabilities and limitations of single modality systems. In this research, we introduce MBAS using feature-level fusion of speech data with face data, combining the best of both worlds. The textural features based on a person’s facial appearance are integrated with dynamic speech information for liveness detection, followed by dimensionality reduction using linear discriminant analysis, and then incorporated into a Bi-LSTM classifier. Therefore, for better security, accuracy, and anti-spoofing attacks, this approach is proposed in addition to increasing accuracy as well as enhancing security against spoofing attacks. Two publicly available datasets, DeepfakeTIMIT and AVSpeech, are extensively explored to evaluate different fusion strategies, classifier types, and standard performance metrics. The proposed system outperformed other cutting-edge biometric based systems by exhibiting a 97.51% high accuracy rate with a precision of 99.10% and an equal error rate of 2.48%. These findings affirm the effectiveness and possible real-world applications of the MBAS concept, along with its enhancement ensuring safety. Furthermore, this study underscores the importance of incorporating advanced liveness detection into secure contactless biometrics solutions for modern attendance management in various industries that encompass both face and voice modalities.
format	Article
id	doaj-art-16cc963f22ad4e9a81e75e64cb455c46
institution	Kabale University
issn	2158-3226
language	English
publishDate	2024-11-01
publisher	AIP Publishing LLC
record_format	Article
series	AIP Advances
spelling	doaj-art-16cc963f22ad4e9a81e75e64cb455c462024-12-04T16:59:16ZengAIP Publishing LLCAIP Advances2158-32262024-11-011411115007115007-1010.1063/5.0234430Feature-level fusion of face and speech based multimodal biometric attendance system with liveness detectionKhushboo Jha0Aruna Jain1Sumit Srivastava2Department of Computer Science and Engineering, Birla Institute of Technology, Ranchi 835215, IndiaDepartment of Computer Science and Engineering, Birla Institute of Technology, Ranchi 835215, IndiaDepartment of Computer Science and Engineering, Birla Institute of Technology, Ranchi 835215, IndiaThe rapid growth of deep learning and the Internet of Things has spurred the need for touchless biometric systems in areas where cleanliness and non-intrusive user interaction are critical. In order to authenticate seamlessly, traditional biometric methods such as fingerprint, hand, etc. recognition require physical contact and, therefore, risk hygiene issues, hence making face and speaker verification more viable alternatives. A robust Multimodal Biometric Attendance System (MBAS) is needed due to the vulnerabilities and limitations of single modality systems. In this research, we introduce MBAS using feature-level fusion of speech data with face data, combining the best of both worlds. The textural features based on a person’s facial appearance are integrated with dynamic speech information for liveness detection, followed by dimensionality reduction using linear discriminant analysis, and then incorporated into a Bi-LSTM classifier. Therefore, for better security, accuracy, and anti-spoofing attacks, this approach is proposed in addition to increasing accuracy as well as enhancing security against spoofing attacks. Two publicly available datasets, DeepfakeTIMIT and AVSpeech, are extensively explored to evaluate different fusion strategies, classifier types, and standard performance metrics. The proposed system outperformed other cutting-edge biometric based systems by exhibiting a 97.51% high accuracy rate with a precision of 99.10% and an equal error rate of 2.48%. These findings affirm the effectiveness and possible real-world applications of the MBAS concept, along with its enhancement ensuring safety. Furthermore, this study underscores the importance of incorporating advanced liveness detection into secure contactless biometrics solutions for modern attendance management in various industries that encompass both face and voice modalities.http://dx.doi.org/10.1063/5.0234430
spellingShingle	Khushboo Jha Aruna Jain Sumit Srivastava Feature-level fusion of face and speech based multimodal biometric attendance system with liveness detection AIP Advances
title	Feature-level fusion of face and speech based multimodal biometric attendance system with liveness detection
title_full	Feature-level fusion of face and speech based multimodal biometric attendance system with liveness detection
title_fullStr	Feature-level fusion of face and speech based multimodal biometric attendance system with liveness detection
title_full_unstemmed	Feature-level fusion of face and speech based multimodal biometric attendance system with liveness detection
title_short	Feature-level fusion of face and speech based multimodal biometric attendance system with liveness detection
title_sort	feature level fusion of face and speech based multimodal biometric attendance system with liveness detection
url	http://dx.doi.org/10.1063/5.0234430
work_keys_str_mv	AT khushboojha featurelevelfusionoffaceandspeechbasedmultimodalbiometricattendancesystemwithlivenessdetection AT arunajain featurelevelfusionoffaceandspeechbasedmultimodalbiometricattendancesystemwithlivenessdetection AT sumitsrivastava featurelevelfusionoffaceandspeechbasedmultimodalbiometricattendancesystemwithlivenessdetection

Feature-level fusion of face and speech based multimodal biometric attendance system with liveness detection

Similar Items