A method of synthetic speech spoofing detection using constant Q modulation envelope

In response to the low accuracy of synthetic speech spoofing detection based on traditional acoustic feature parameters, poor detection performance for unknown types of synthetic speech, and performance degradation in noisy environments, a method for detecting spoofing synthetic speech was proposed...

Full description

Saved in:
Bibliographic Details
Main Authors: Jia XU, Zhihua JIAN, Honghui JIN, Chao WU
Format: Article
Language:zho
Published: Beijing Xintong Media Co., Ltd 2023-11-01
Series:Dianxin kexue
Subjects:
Online Access:http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2023187/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841530889102688256
author Jia XU
Zhihua JIAN
Honghui JIN
Chao WU
author_facet Jia XU
Zhihua JIAN
Honghui JIN
Chao WU
author_sort Jia XU
collection DOAJ
description In response to the low accuracy of synthetic speech spoofing detection based on traditional acoustic feature parameters, poor detection performance for unknown types of synthetic speech, and performance degradation in noisy environments, a method for detecting spoofing synthetic speech was proposed using constant Q modulation envelope (CQME) .The motivation of the method was from the fact that the temporal envelope of speech contained abundant information and there was a big difference in detail between the envelope of synthetic speech and genuine speech.The modulation envelope spectrum of speech was obtained by employing constant Q transform (CQT), and the root mean square of each frequency component was calculated to derive the CQME feature vector.And then the CQME feature vector was used to train the random forest classifier for discriminating genuine speech from spoofing synthetic speech.Experimental results demonstrate that the random forest trained with CQME features achieves high detection performance on the ASVspoof 2019 dataset and exhibites good detection efficacy for unknown types of synthetic speech.Furthermore, the proposed method shows high detection performance even under various noise conditions, having excellent noise robustness.
format Article
id doaj-art-14b3b1db7f894e29ad7782d7fad53920
institution Kabale University
issn 1000-0801
language zho
publishDate 2023-11-01
publisher Beijing Xintong Media Co., Ltd
record_format Article
series Dianxin kexue
spelling doaj-art-14b3b1db7f894e29ad7782d7fad539202025-01-15T02:57:57ZzhoBeijing Xintong Media Co., LtdDianxin kexue1000-08012023-11-013910711559559389A method of synthetic speech spoofing detection using constant Q modulation envelopeJia XUZhihua JIANHonghui JINChao WUIn response to the low accuracy of synthetic speech spoofing detection based on traditional acoustic feature parameters, poor detection performance for unknown types of synthetic speech, and performance degradation in noisy environments, a method for detecting spoofing synthetic speech was proposed using constant Q modulation envelope (CQME) .The motivation of the method was from the fact that the temporal envelope of speech contained abundant information and there was a big difference in detail between the envelope of synthetic speech and genuine speech.The modulation envelope spectrum of speech was obtained by employing constant Q transform (CQT), and the root mean square of each frequency component was calculated to derive the CQME feature vector.And then the CQME feature vector was used to train the random forest classifier for discriminating genuine speech from spoofing synthetic speech.Experimental results demonstrate that the random forest trained with CQME features achieves high detection performance on the ASVspoof 2019 dataset and exhibites good detection efficacy for unknown types of synthetic speech.Furthermore, the proposed method shows high detection performance even under various noise conditions, having excellent noise robustness.http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2023187/synthetic speechspoofing speech detectionconstant Q modulation enveloperandom forest
spellingShingle Jia XU
Zhihua JIAN
Honghui JIN
Chao WU
A method of synthetic speech spoofing detection using constant Q modulation envelope
Dianxin kexue
synthetic speech
spoofing speech detection
constant Q modulation envelope
random forest
title A method of synthetic speech spoofing detection using constant Q modulation envelope
title_full A method of synthetic speech spoofing detection using constant Q modulation envelope
title_fullStr A method of synthetic speech spoofing detection using constant Q modulation envelope
title_full_unstemmed A method of synthetic speech spoofing detection using constant Q modulation envelope
title_short A method of synthetic speech spoofing detection using constant Q modulation envelope
title_sort method of synthetic speech spoofing detection using constant q modulation envelope
topic synthetic speech
spoofing speech detection
constant Q modulation envelope
random forest
url http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2023187/
work_keys_str_mv AT jiaxu amethodofsyntheticspeechspoofingdetectionusingconstantqmodulationenvelope
AT zhihuajian amethodofsyntheticspeechspoofingdetectionusingconstantqmodulationenvelope
AT honghuijin amethodofsyntheticspeechspoofingdetectionusingconstantqmodulationenvelope
AT chaowu amethodofsyntheticspeechspoofingdetectionusingconstantqmodulationenvelope
AT jiaxu methodofsyntheticspeechspoofingdetectionusingconstantqmodulationenvelope
AT zhihuajian methodofsyntheticspeechspoofingdetectionusingconstantqmodulationenvelope
AT honghuijin methodofsyntheticspeechspoofingdetectionusingconstantqmodulationenvelope
AT chaowu methodofsyntheticspeechspoofingdetectionusingconstantqmodulationenvelope