Synthetic speech detection method using texture feature based on circumferential local ternary pattern

In order to further improve the accuracy of synthetic speech detection, a synthetic speech detection method using texture feature based on circumferential local ternary pattern (CLTP) was proposed.The method extracted the texture information from the speech spectrogram using the CLTP and applied it...

Full description

Saved in:

Bibliographic Details
Main Authors:	Honghui JIN, Zhihua JIAN, Man YANG, Chao WU
Format:	Article
Language:	zho
Published:	Beijing Xintong Media Co., Ltd 2023-06-01
Series:	Dianxin kexue
Subjects:	speaker verification synthetic speech detection CLTP deep residual network
Online Access:	http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2023121/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841530832564518912
author	Honghui JIN Zhihua JIAN Man YANG Chao WU
author_facet	Honghui JIN Zhihua JIAN Man YANG Chao WU
author_sort	Honghui JIN
collection	DOAJ
description	In order to further improve the accuracy of synthetic speech detection, a synthetic speech detection method using texture feature based on circumferential local ternary pattern (CLTP) was proposed.The method extracted the texture information from the speech spectrogram using the CLTP and applied it as the feature representation of speech.The deep residual network was employed as the back-end classifier to determine the real or spoofing speech.The experimental results demonstrate that, on the ASVspoof 2019 dataset, the proposed method reduces the equal error rate (EER) by 54.29% and 2.15% respectively, compared with the traditional constant Q cepstral coefficient (CQCC) and linear predictive cepstral coefficient (LPCC), and reduced the EER by 17.14% compared with the local ternary pattern(LTP) texture features.The CLTP comprehensively takes into account the differences between the central and peripheral pixels in the neighborhood and between each peripheral pixel.Then it can acquire more texture information from the speech spectrogram, and improve the accuracy of synthetic speech detection.
format	Article
id	doaj-art-71259d0f51754d50be82dd3c01317747
institution	Kabale University
issn	1000-0801
language	zho
publishDate	2023-06-01
publisher	Beijing Xintong Media Co., Ltd
record_format	Article
series	Dianxin kexue
spelling	doaj-art-71259d0f51754d50be82dd3c013177472025-01-15T02:58:33ZzhoBeijing Xintong Media Co., LtdDianxin kexue1000-08012023-06-0139859559565886Synthetic speech detection method using texture feature based on circumferential local ternary patternHonghui JINZhihua JIANMan YANGChao WUIn order to further improve the accuracy of synthetic speech detection, a synthetic speech detection method using texture feature based on circumferential local ternary pattern (CLTP) was proposed.The method extracted the texture information from the speech spectrogram using the CLTP and applied it as the feature representation of speech.The deep residual network was employed as the back-end classifier to determine the real or spoofing speech.The experimental results demonstrate that, on the ASVspoof 2019 dataset, the proposed method reduces the equal error rate (EER) by 54.29% and 2.15% respectively, compared with the traditional constant Q cepstral coefficient (CQCC) and linear predictive cepstral coefficient (LPCC), and reduced the EER by 17.14% compared with the local ternary pattern(LTP) texture features.The CLTP comprehensively takes into account the differences between the central and peripheral pixels in the neighborhood and between each peripheral pixel.Then it can acquire more texture information from the speech spectrogram, and improve the accuracy of synthetic speech detection.http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2023121/speaker verificationsynthetic speech detectionCLTPdeep residual network
spellingShingle	Honghui JIN Zhihua JIAN Man YANG Chao WU Synthetic speech detection method using texture feature based on circumferential local ternary pattern Dianxin kexue speaker verification synthetic speech detection CLTP deep residual network
title	Synthetic speech detection method using texture feature based on circumferential local ternary pattern
title_full	Synthetic speech detection method using texture feature based on circumferential local ternary pattern
title_fullStr	Synthetic speech detection method using texture feature based on circumferential local ternary pattern
title_full_unstemmed	Synthetic speech detection method using texture feature based on circumferential local ternary pattern
title_short	Synthetic speech detection method using texture feature based on circumferential local ternary pattern
title_sort	synthetic speech detection method using texture feature based on circumferential local ternary pattern
topic	speaker verification synthetic speech detection CLTP deep residual network
url	http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2023121/
work_keys_str_mv	AT honghuijin syntheticspeechdetectionmethodusingtexturefeaturebasedoncircumferentiallocalternarypattern AT zhihuajian syntheticspeechdetectionmethodusingtexturefeaturebasedoncircumferentiallocalternarypattern AT manyang syntheticspeechdetectionmethodusingtexturefeaturebasedoncircumferentiallocalternarypattern AT chaowu syntheticspeechdetectionmethodusingtexturefeaturebasedoncircumferentiallocalternarypattern

Synthetic speech detection method using texture feature based on circumferential local ternary pattern

Similar Items