Continuous speech speaker recognition based on CNN

In the last few years, with the constant improvement of the social life level, the requirement for speech recognition is getting higher and higher. GMM-HMM (Gaussian mixture-hidden Markov model) have been the main method for speaker recognition. Because of the bad modeling capability of big data and...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhendong WU, Shucheng PAN, Jianwu ZHANG
Format: Article
Language:zho
Published: Beijing Xintong Media Co., Ltd 2017-03-01
Series:Dianxin kexue
Subjects:
Online Access:http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2017046/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841529124267491328
author Zhendong WU
Shucheng PAN
Jianwu ZHANG
author_facet Zhendong WU
Shucheng PAN
Jianwu ZHANG
author_sort Zhendong WU
collection DOAJ
description In the last few years, with the constant improvement of the social life level, the requirement for speech recognition is getting higher and higher. GMM-HMM (Gaussian mixture-hidden Markov model) have been the main method for speaker recognition. Because of the bad modeling capability of big data and the bad performance of robustness, the development of this model meets the bottleneck.In order to solve this question, researchers began to focus on deep learning technologies. CNN deep learning model for continuous speech speaker recognition was introduced and CSR-CNN model was put forward. The model extracts fixed-length and right-order phonetic fraction to form an ordered sound spectrograph. Then input the voiceprint extract from CNN model to a reward-penalty function to continuous measurement. Experimental results show that CSR-CNN model has very good recognition effectin continuous speech speaker recognition field.
format Article
id doaj-art-c990320a00ba4186bd66056b9d4a5e55
institution Kabale University
issn 1000-0801
language zho
publishDate 2017-03-01
publisher Beijing Xintong Media Co., Ltd
record_format Article
series Dianxin kexue
spelling doaj-art-c990320a00ba4186bd66056b9d4a5e552025-01-15T03:25:30ZzhoBeijing Xintong Media Co., LtdDianxin kexue1000-08012017-03-0133596659804391Continuous speech speaker recognition based on CNNZhendong WUShucheng PANJianwu ZHANGIn the last few years, with the constant improvement of the social life level, the requirement for speech recognition is getting higher and higher. GMM-HMM (Gaussian mixture-hidden Markov model) have been the main method for speaker recognition. Because of the bad modeling capability of big data and the bad performance of robustness, the development of this model meets the bottleneck.In order to solve this question, researchers began to focus on deep learning technologies. CNN deep learning model for continuous speech speaker recognition was introduced and CSR-CNN model was put forward. The model extracts fixed-length and right-order phonetic fraction to form an ordered sound spectrograph. Then input the voiceprint extract from CNN model to a reward-penalty function to continuous measurement. Experimental results show that CSR-CNN model has very good recognition effectin continuous speech speaker recognition field.http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2017046/continuous speechsound spectrographGMM-HMMdeep learning
spellingShingle Zhendong WU
Shucheng PAN
Jianwu ZHANG
Continuous speech speaker recognition based on CNN
Dianxin kexue
continuous speech
sound spectrograph
GMM-HMM
deep learning
title Continuous speech speaker recognition based on CNN
title_full Continuous speech speaker recognition based on CNN
title_fullStr Continuous speech speaker recognition based on CNN
title_full_unstemmed Continuous speech speaker recognition based on CNN
title_short Continuous speech speaker recognition based on CNN
title_sort continuous speech speaker recognition based on cnn
topic continuous speech
sound spectrograph
GMM-HMM
deep learning
url http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2017046/
work_keys_str_mv AT zhendongwu continuousspeechspeakerrecognitionbasedoncnn
AT shuchengpan continuousspeechspeakerrecognitionbasedoncnn
AT jianwuzhang continuousspeechspeakerrecognitionbasedoncnn