Speech emotion recognition algorithm based on spectrogram feature extraction of deep space attention feature

Starts from the extraction and classification modeling of speech emotion features, based on the hybrid convolutional neural network model, the Itti model in feature extraction was improved, including increasing the extraction by local binary mode. The strong correlation features were extracted combi...

Full description

Saved in:
Bibliographic Details
Main Authors: Jinhua WANG, Na YING, Chendu ZHU, Zhaosen LIU, Zhedong CAI
Format: Article
Language:zho
Published: Beijing Xintong Media Co., Ltd 2019-07-01
Series:Dianxin kexue
Subjects:
Online Access:http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2019052/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841530531708141568
author Jinhua WANG
Na YING
Chendu ZHU
Zhaosen LIU
Zhedong CAI
author_facet Jinhua WANG
Na YING
Chendu ZHU
Zhaosen LIU
Zhedong CAI
author_sort Jinhua WANG
collection DOAJ
description Starts from the extraction and classification modeling of speech emotion features, based on the hybrid convolutional neural network model, the Itti model in feature extraction was improved, including increasing the extraction by local binary mode. The strong correlation features were extracted combining with the sensitivity of the auditory sensitivity. Then, the constrained extrusion and excitation network structure of the calibration weights were extracted by feature constraints. Finally, a fine-tuning model based on VGGnet and long-short-time memory network hybrid network was formed, further enhancing the ability to express emotions. By validating on the natural sentiment database and the German-German database, the model had a significant increase in the rate of sentiment recognition, which is 8. 43% higher than the benchmark model. At the same time, the recognition effect of the model on the natural database (FAU-AEC) and the Berlin database (EMO-DB) were compared. The experimental results show that the model has a good generalization.
format Article
id doaj-art-8d2cdbc488a545f9ae6682e1d37d85bd
institution Kabale University
issn 1000-0801
language zho
publishDate 2019-07-01
publisher Beijing Xintong Media Co., Ltd
record_format Article
series Dianxin kexue
spelling doaj-art-8d2cdbc488a545f9ae6682e1d37d85bd2025-01-15T03:02:37ZzhoBeijing Xintong Media Co., LtdDianxin kexue1000-08012019-07-013510010859589011Speech emotion recognition algorithm based on spectrogram feature extraction of deep space attention featureJinhua WANGNa YINGChendu ZHUZhaosen LIUZhedong CAIStarts from the extraction and classification modeling of speech emotion features, based on the hybrid convolutional neural network model, the Itti model in feature extraction was improved, including increasing the extraction by local binary mode. The strong correlation features were extracted combining with the sensitivity of the auditory sensitivity. Then, the constrained extrusion and excitation network structure of the calibration weights were extracted by feature constraints. Finally, a fine-tuning model based on VGGnet and long-short-time memory network hybrid network was formed, further enhancing the ability to express emotions. By validating on the natural sentiment database and the German-German database, the model had a significant increase in the rate of sentiment recognition, which is 8. 43% higher than the benchmark model. At the same time, the recognition effect of the model on the natural database (FAU-AEC) and the Berlin database (EMO-DB) were compared. The experimental results show that the model has a good generalization.http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2019052/emotion recognitiondeep hybrid neural network modelvisual attention mechanism
spellingShingle Jinhua WANG
Na YING
Chendu ZHU
Zhaosen LIU
Zhedong CAI
Speech emotion recognition algorithm based on spectrogram feature extraction of deep space attention feature
Dianxin kexue
emotion recognition
deep hybrid neural network model
visual attention mechanism
title Speech emotion recognition algorithm based on spectrogram feature extraction of deep space attention feature
title_full Speech emotion recognition algorithm based on spectrogram feature extraction of deep space attention feature
title_fullStr Speech emotion recognition algorithm based on spectrogram feature extraction of deep space attention feature
title_full_unstemmed Speech emotion recognition algorithm based on spectrogram feature extraction of deep space attention feature
title_short Speech emotion recognition algorithm based on spectrogram feature extraction of deep space attention feature
title_sort speech emotion recognition algorithm based on spectrogram feature extraction of deep space attention feature
topic emotion recognition
deep hybrid neural network model
visual attention mechanism
url http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2019052/
work_keys_str_mv AT jinhuawang speechemotionrecognitionalgorithmbasedonspectrogramfeatureextractionofdeepspaceattentionfeature
AT naying speechemotionrecognitionalgorithmbasedonspectrogramfeatureextractionofdeepspaceattentionfeature
AT chenduzhu speechemotionrecognitionalgorithmbasedonspectrogramfeatureextractionofdeepspaceattentionfeature
AT zhaosenliu speechemotionrecognitionalgorithmbasedonspectrogramfeatureextractionofdeepspaceattentionfeature
AT zhedongcai speechemotionrecognitionalgorithmbasedonspectrogramfeatureextractionofdeepspaceattentionfeature