A playback speech detection algorithm based on log inverse Mel-frequency spectral coefficient

The popularity and portability of high-fidelity audio recording equipment and playback equipment poses a serious challenge for speaker recognition systems against playback attacks.Based on the differences between the original speech and the playback speech in high frequency region,the algorithm reve...

Full description

Saved in:
Bibliographic Details
Main Authors: Lang LIN, Rangding WANG, Diqun YAN, Can LI
Format: Article
Language:zho
Published: Beijing Xintong Media Co., Ltd 2018-05-01
Series:Dianxin kexue
Subjects:
Online Access:http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2018020/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The popularity and portability of high-fidelity audio recording equipment and playback equipment poses a serious challenge for speaker recognition systems against playback attacks.Based on the differences between the original speech and the playback speech in high frequency region,the algorithm reversed the Mel-filter bank in Mel-frequency cepstral coefficient (MFCC) calculation,and the coefficients before the DCT were used as the features of the algorithm.SVM was utilized as the classifier.Experimental results show that this algorithm can effectively detect the playback speech.In addition,the algorithm is integrated into the GMM-UBM speaker recognition system,which significantly improves the systems’ capability of resisting the playback attack.
ISSN:1000-0801