Deep Learning Approach for Detecting Audio Deepfakes in Urdu
The application of Deep Learning algorithms for speech synthesis has led to the widespread generation of Audio Deepfakes, which are becoming a real threat to voice interfaces. Audio Deepfakes are fake audio recordings that are difficult to differentiate from real recordings because they use AI-gene...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
National University of Modern Languages (NUML), Islamabad
2023-07-01
|
Series: | NUML International Journal of Engineering and Computing |
Subjects: | |
Online Access: | https://nijec.numl.edu.pk/index.php/nijec/article/view/37 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1846129739259969536 |
---|---|
author | Marium Mateen |
author_facet | Marium Mateen |
author_sort | Marium Mateen |
collection | DOAJ |
description |
The application of Deep Learning algorithms for speech synthesis has led to the widespread generation of Audio Deepfakes, which are becoming a real threat to voice interfaces. Audio Deepfakes are fake audio recordings that are difficult to differentiate from real recordings because they use AI-generated techniques to clone human voices. When prominent speakers, celebrities, and politicians are the target of Audio Deepfakes, this technology can potentially undermine public confidence and trustworthiness. Therefore, it is essential to create efficient methods and technologies to identify and stop the creation and spread of Audio Deepfakes. To address the critical issue of the widespread circulation of fake audio and to detect Audio Deepfakes, several Machine Learning and Deep Learning techniques have been developed recently. However, most such solutions have been trained using datasets in English, raising concerns about their accuracy and trustworthiness for other languages. The primary objective of this research is to develop a Deep Learning model for detecting Audio Deepfakes in Urdu. For this purpose, the deep learning model is trained using an Urdu language audio dataset. The dataset was prepared using both real and fake audio. The real Urdu audio clips were initially collected from which Deep fakes were generated with the help of the Real-Time Voice Cloning tool. Our Deep Learning-based model is built to detect Audio Deep fakes produced using imitation and synthesis techniques. According to the findings of our study, when tested and evaluated, our model obtained an accuracy of 91 percent.
|
format | Article |
id | doaj-art-382f8debbb554eb0a56dd33f876e04cf |
institution | Kabale University |
issn | 2788-9629 2791-3465 |
language | English |
publishDate | 2023-07-01 |
publisher | National University of Modern Languages (NUML), Islamabad |
record_format | Article |
series | NUML International Journal of Engineering and Computing |
spelling | doaj-art-382f8debbb554eb0a56dd33f876e04cf2024-12-09T20:53:55ZengNational University of Modern Languages (NUML), IslamabadNUML International Journal of Engineering and Computing2788-96292791-34652023-07-012110.52015/nijec.v2i1.37Deep Learning Approach for Detecting Audio Deepfakes in Urdu Marium Mateen0Jinnah University for Women - Karachi, Pakistan The application of Deep Learning algorithms for speech synthesis has led to the widespread generation of Audio Deepfakes, which are becoming a real threat to voice interfaces. Audio Deepfakes are fake audio recordings that are difficult to differentiate from real recordings because they use AI-generated techniques to clone human voices. When prominent speakers, celebrities, and politicians are the target of Audio Deepfakes, this technology can potentially undermine public confidence and trustworthiness. Therefore, it is essential to create efficient methods and technologies to identify and stop the creation and spread of Audio Deepfakes. To address the critical issue of the widespread circulation of fake audio and to detect Audio Deepfakes, several Machine Learning and Deep Learning techniques have been developed recently. However, most such solutions have been trained using datasets in English, raising concerns about their accuracy and trustworthiness for other languages. The primary objective of this research is to develop a Deep Learning model for detecting Audio Deepfakes in Urdu. For this purpose, the deep learning model is trained using an Urdu language audio dataset. The dataset was prepared using both real and fake audio. The real Urdu audio clips were initially collected from which Deep fakes were generated with the help of the Real-Time Voice Cloning tool. Our Deep Learning-based model is built to detect Audio Deep fakes produced using imitation and synthesis techniques. According to the findings of our study, when tested and evaluated, our model obtained an accuracy of 91 percent. https://nijec.numl.edu.pk/index.php/nijec/article/view/37Audio DeepFakesMachine LearningDeep LearningLSTMRNN |
spellingShingle | Marium Mateen Deep Learning Approach for Detecting Audio Deepfakes in Urdu NUML International Journal of Engineering and Computing Audio DeepFakes Machine Learning Deep Learning LSTM RNN |
title | Deep Learning Approach for Detecting Audio Deepfakes in Urdu |
title_full | Deep Learning Approach for Detecting Audio Deepfakes in Urdu |
title_fullStr | Deep Learning Approach for Detecting Audio Deepfakes in Urdu |
title_full_unstemmed | Deep Learning Approach for Detecting Audio Deepfakes in Urdu |
title_short | Deep Learning Approach for Detecting Audio Deepfakes in Urdu |
title_sort | deep learning approach for detecting audio deepfakes in urdu |
topic | Audio DeepFakes Machine Learning Deep Learning LSTM RNN |
url | https://nijec.numl.edu.pk/index.php/nijec/article/view/37 |
work_keys_str_mv | AT mariummateen deeplearningapproachfordetectingaudiodeepfakesinurdu |