Deep Learning Approach for Detecting Audio Deepfakes in Urdu

The application of Deep Learning algorithms for speech synthesis has led to the widespread generation of Audio Deepfakes, which are becoming a real threat to voice interfaces. Audio Deepfakes are fake audio recordings that are difficult to differentiate from real recordings because they use AI-gene...

Full description

Saved in:

Bibliographic Details
Main Author:	Marium Mateen
Format:	Article
Language:	English
Published:	National University of Modern Languages (NUML), Islamabad 2023-07-01
Series:	NUML International Journal of Engineering and Computing
Subjects:	Audio DeepFakes Machine Learning Deep Learning LSTM RNN
Online Access:	https://nijec.numl.edu.pk/index.php/nijec/article/view/37
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1846129739259969536
author	Marium Mateen
author_facet	Marium Mateen
author_sort	Marium Mateen
collection	DOAJ
description	The application of Deep Learning algorithms for speech synthesis has led to the widespread generation of Audio Deepfakes, which are becoming a real threat to voice interfaces. Audio Deepfakes are fake audio recordings that are difficult to differentiate from real recordings because they use AI-generated techniques to clone human voices. When prominent speakers, celebrities, and politicians are the target of Audio Deepfakes, this technology can potentially undermine public confidence and trustworthiness. Therefore, it is essential to create efficient methods and technologies to identify and stop the creation and spread of Audio Deepfakes. To address the critical issue of the widespread circulation of fake audio and to detect Audio Deepfakes, several Machine Learning and Deep Learning techniques have been developed recently. However, most such solutions have been trained using datasets in English, raising concerns about their accuracy and trustworthiness for other languages. The primary objective of this research is to develop a Deep Learning model for detecting Audio Deepfakes in Urdu. For this purpose, the deep learning model is trained using an Urdu language audio dataset. The dataset was prepared using both real and fake audio. The real Urdu audio clips were initially collected from which Deep fakes were generated with the help of the Real-Time Voice Cloning tool. Our Deep Learning-based model is built to detect Audio Deep fakes produced using imitation and synthesis techniques. According to the findings of our study, when tested and evaluated, our model obtained an accuracy of 91 percent.
format	Article
id	doaj-art-382f8debbb554eb0a56dd33f876e04cf
institution	Kabale University
issn	2788-9629 2791-3465
language	English
publishDate	2023-07-01
publisher	National University of Modern Languages (NUML), Islamabad
record_format	Article
series	NUML International Journal of Engineering and Computing
spelling	doaj-art-382f8debbb554eb0a56dd33f876e04cf2024-12-09T20:53:55ZengNational University of Modern Languages (NUML), IslamabadNUML International Journal of Engineering and Computing2788-96292791-34652023-07-012110.52015/nijec.v2i1.37Deep Learning Approach for Detecting Audio Deepfakes in Urdu Marium Mateen0Jinnah University for Women - Karachi, Pakistan The application of Deep Learning algorithms for speech synthesis has led to the widespread generation of Audio Deepfakes, which are becoming a real threat to voice interfaces. Audio Deepfakes are fake audio recordings that are difficult to differentiate from real recordings because they use AI-generated techniques to clone human voices. When prominent speakers, celebrities, and politicians are the target of Audio Deepfakes, this technology can potentially undermine public confidence and trustworthiness. Therefore, it is essential to create efficient methods and technologies to identify and stop the creation and spread of Audio Deepfakes. To address the critical issue of the widespread circulation of fake audio and to detect Audio Deepfakes, several Machine Learning and Deep Learning techniques have been developed recently. However, most such solutions have been trained using datasets in English, raising concerns about their accuracy and trustworthiness for other languages. The primary objective of this research is to develop a Deep Learning model for detecting Audio Deepfakes in Urdu. For this purpose, the deep learning model is trained using an Urdu language audio dataset. The dataset was prepared using both real and fake audio. The real Urdu audio clips were initially collected from which Deep fakes were generated with the help of the Real-Time Voice Cloning tool. Our Deep Learning-based model is built to detect Audio Deep fakes produced using imitation and synthesis techniques. According to the findings of our study, when tested and evaluated, our model obtained an accuracy of 91 percent. https://nijec.numl.edu.pk/index.php/nijec/article/view/37Audio DeepFakesMachine LearningDeep LearningLSTMRNN
spellingShingle	Marium Mateen Deep Learning Approach for Detecting Audio Deepfakes in Urdu NUML International Journal of Engineering and Computing Audio DeepFakes Machine Learning Deep Learning LSTM RNN
title	Deep Learning Approach for Detecting Audio Deepfakes in Urdu
title_full	Deep Learning Approach for Detecting Audio Deepfakes in Urdu
title_fullStr	Deep Learning Approach for Detecting Audio Deepfakes in Urdu
title_full_unstemmed	Deep Learning Approach for Detecting Audio Deepfakes in Urdu
title_short	Deep Learning Approach for Detecting Audio Deepfakes in Urdu
title_sort	deep learning approach for detecting audio deepfakes in urdu
topic	Audio DeepFakes Machine Learning Deep Learning LSTM RNN
url	https://nijec.numl.edu.pk/index.php/nijec/article/view/37
work_keys_str_mv	AT mariummateen deeplearningapproachfordetectingaudiodeepfakesinurdu

Deep Learning Approach for Detecting Audio Deepfakes in Urdu

Similar Items