Deep Learning Approach for Detecting Audio Deepfakes in Urdu

The application of Deep Learning algorithms for speech synthesis has led to the widespread generation of Audio Deepfakes, which are becoming a real threat to voice interfaces. Audio Deepfakes are fake audio recordings that are difficult to differentiate from real recordings because they use AI-gene...

Full description

Saved in:
Bibliographic Details
Main Author: Marium Mateen
Format: Article
Language:English
Published: National University of Modern Languages (NUML), Islamabad 2023-07-01
Series:NUML International Journal of Engineering and Computing
Subjects:
Online Access:https://nijec.numl.edu.pk/index.php/nijec/article/view/37
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846129739259969536
author Marium Mateen
author_facet Marium Mateen
author_sort Marium Mateen
collection DOAJ
description The application of Deep Learning algorithms for speech synthesis has led to the widespread generation of Audio Deepfakes, which are becoming a real threat to voice interfaces. Audio Deepfakes are fake audio recordings that are difficult to differentiate from real recordings because they use AI-generated techniques to clone human voices. When prominent speakers, celebrities, and politicians are the target of Audio Deepfakes, this technology can potentially undermine public confidence and trustworthiness. Therefore, it is essential to create efficient methods and technologies to identify and stop the creation and spread of Audio Deepfakes. To address the critical issue of the widespread circulation of fake audio and to detect Audio Deepfakes, several Machine Learning and Deep Learning techniques have been developed recently. However, most such solutions have been trained using datasets in English, raising concerns about their accuracy and trustworthiness for other languages. The primary objective of this research is to develop a Deep Learning model for detecting Audio Deepfakes in Urdu. For this purpose, the deep learning model is trained using an Urdu language audio dataset. The dataset was prepared using both real and fake audio. The real Urdu audio clips were initially collected from which Deep fakes were generated with the help of the Real-Time Voice Cloning tool. Our Deep Learning-based model is built to detect Audio Deep fakes produced using imitation and synthesis techniques. According to the findings of our study, when tested and evaluated, our model obtained an accuracy of 91 percent.
format Article
id doaj-art-382f8debbb554eb0a56dd33f876e04cf
institution Kabale University
issn 2788-9629
2791-3465
language English
publishDate 2023-07-01
publisher National University of Modern Languages (NUML), Islamabad
record_format Article
series NUML International Journal of Engineering and Computing
spelling doaj-art-382f8debbb554eb0a56dd33f876e04cf2024-12-09T20:53:55ZengNational University of Modern Languages (NUML), IslamabadNUML International Journal of Engineering and Computing2788-96292791-34652023-07-012110.52015/nijec.v2i1.37Deep Learning Approach for Detecting Audio Deepfakes in Urdu Marium Mateen0Jinnah University for Women - Karachi, Pakistan The application of Deep Learning algorithms for speech synthesis has led to the widespread generation of Audio Deepfakes, which are becoming a real threat to voice interfaces. Audio Deepfakes are fake audio recordings that are difficult to differentiate from real recordings because they use AI-generated techniques to clone human voices. When prominent speakers, celebrities, and politicians are the target of Audio Deepfakes, this technology can potentially undermine public confidence and trustworthiness. Therefore, it is essential to create efficient methods and technologies to identify and stop the creation and spread of Audio Deepfakes. To address the critical issue of the widespread circulation of fake audio and to detect Audio Deepfakes, several Machine Learning and Deep Learning techniques have been developed recently. However, most such solutions have been trained using datasets in English, raising concerns about their accuracy and trustworthiness for other languages. The primary objective of this research is to develop a Deep Learning model for detecting Audio Deepfakes in Urdu. For this purpose, the deep learning model is trained using an Urdu language audio dataset. The dataset was prepared using both real and fake audio. The real Urdu audio clips were initially collected from which Deep fakes were generated with the help of the Real-Time Voice Cloning tool. Our Deep Learning-based model is built to detect Audio Deep fakes produced using imitation and synthesis techniques. According to the findings of our study, when tested and evaluated, our model obtained an accuracy of 91 percent. https://nijec.numl.edu.pk/index.php/nijec/article/view/37Audio DeepFakesMachine LearningDeep LearningLSTMRNN
spellingShingle Marium Mateen
Deep Learning Approach for Detecting Audio Deepfakes in Urdu
NUML International Journal of Engineering and Computing
Audio DeepFakes
Machine Learning
Deep Learning
LSTM
RNN
title Deep Learning Approach for Detecting Audio Deepfakes in Urdu
title_full Deep Learning Approach for Detecting Audio Deepfakes in Urdu
title_fullStr Deep Learning Approach for Detecting Audio Deepfakes in Urdu
title_full_unstemmed Deep Learning Approach for Detecting Audio Deepfakes in Urdu
title_short Deep Learning Approach for Detecting Audio Deepfakes in Urdu
title_sort deep learning approach for detecting audio deepfakes in urdu
topic Audio DeepFakes
Machine Learning
Deep Learning
LSTM
RNN
url https://nijec.numl.edu.pk/index.php/nijec/article/view/37
work_keys_str_mv AT mariummateen deeplearningapproachfordetectingaudiodeepfakesinurdu