Multimodal fusion: A study on speech-text emotion recognition with the integration of deep learning

Multimodal fusion: A study on speech-text emotion recognition with the integration of deep learning

Recognition of various human emotions holds significant value in numerous real-world scenarios. This paper focuses on the multimodal fusion of speech and text for emotion recognition. A 39-dimensional Mel-frequency cepstral coefficient (MFCC) was used as a feature for speech emotion. A 300-dimension...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yanan Shang, Tianqi Fu
Format:	Article
Language:	English
Published:	Elsevier 2024-12-01
Series:	Intelligent Systems with Applications
Subjects:	Multimodal fusion Deep learning Glove model BiGRU Emotion recognition
Online Access:	http://www.sciencedirect.com/science/article/pii/S2667305324001108
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Chinese Mathematical Knowledge Entity Recognition Based on Linguistically Motivated Bidirectional Encoder Representation from Transformers
by: Wei Song, et al.
Published: (2025-01-01)

DropBlock based bimodal hybrid neural network for wireless communication modulation recognition
by: Yan GAO, et al.
Published: (2022-05-01)

Polish Speech and Text Emotion Recognition in a Multimodal Emotion Analysis System
by: Kamil Skowroński, et al.
Published: (2024-11-01)

Research on Medical Text Parsing Method Based on BiGRU-BiLSTM Multi-Task Learning
by: Yunli Fan, et al.
Published: (2024-11-01)

A hybrid deep learning framework for short-term load forecasting with improved data cleansing and preprocessing techniques
by: Muhammad Sajid Iqbal, et al.
Published: (2024-12-01)

Research on ocean buoy attitude prediction model based on multi-dimensional feature fusion
by: Yingjie Liu, et al.
Published: (2024-11-01)

A Learning Emotion Recognition Model Based on Feature Fusion of Photoplethysmography and Video Signal
by: Xiaoliang Zhu, et al.
Published: (2024-12-01)

IoT-based approach to multimodal music emotion recognition
by: Hanbing Zhao, et al.
Published: (2025-02-01)

Advanced Trans-BiGRU-QA Fusion Model for Atmospheric Mercury Prediction
by: Dong-Her Shih, et al.
Published: (2024-11-01)

RETRACTED: Intelligent power grid energy supply forecasting and economic operation management using the snake optimizer algorithm with Bigur-attention model
by: Lingling Cui, et al.
Published: (2023-09-01)

Causal Inference for Modality Debiasing in Multimodal Emotion Recognition
by: Juyeon Kim, et al.
Published: (2024-12-01)

Emission prediction and optimization of methanol/diesel dual-fuel engines based on ITransformer-BiGRU and NSGA-III
by: Mingzhang Pan, et al.
Published: (2025-01-01)

AFT-SAM: Adaptive Fusion Transformer with a Sparse Attention Mechanism for Audio–Visual Speech Recognition
by: Na Che, et al.
Published: (2024-12-01)

Dual-feature speech emotion recognition fusion algorithm based on wavelet scattering transform and MFCC
by: YING Na, et al.
Published: (2024-05-01)

Secure Image Reconstruction using Deep Learning-based Autoencoder with Integrated Encryption Layers
by: Wurood Abd Ali
Published: (2024-12-01)

Enhancing Cross-Language Multimodal Emotion Recognition With Dual Attention Transformers
by: Syed Aun Muhammad Zaidi, et al.
Published: (2024-01-01)

Short-term prediction network for short-wave MUF based on model-data dual-driven
by: Junbing LI, et al.
Published: (2023-12-01)

Multimodality Fusion Aspects of Medical Diagnosis: A Comprehensive Review
by: Sachin Kumar, et al.
Published: (2024-12-01)

Multimodal Adaptive Identity-Recognition Algorithm Fused with Gait Perception
by: Changjie Wang, et al.
Published: (2021-12-01)

Enhancing Human–Robot Interaction: Development of Multimodal Robotic Assistant for User Emotion Recognition
by: Sergio Garcia, et al.
Published: (2024-12-01)

Speech Emotion Recognition via Sparse Learning-Based Fusion Model
by: Dong-Jin Min, et al.
Published: (2024-01-01)

Emotional recognition of EEG signals utilizing residual structure fusion in bi-directional LSTM
by: Yue Xu, et al.
Published: (2024-12-01)

Research on Gait Recognition Based on GaitSet and Multimodal Fusion
by: Xiling Shi, et al.
Published: (2025-01-01)

MM-BiFPN: Multi-Modality Fusion Network With Bi-FPN for MRI Brain Tumor Segmentation
by: Nur Suriza Syazwany, et al.
Published: (2021-01-01)

TMFN: a text-based multimodal fusion network with multi-scale feature extraction and unsupervised contrastive learning for multimodal sentiment analysis
by: Junsong Fu, et al.
Published: (2025-01-01)

Image sensor fusion for multimodal biometric recognition in mobile devices
by: J. Bhuvana, et al.
Published: (2024-12-01)

STAFNet: an adaptive multi-feature learning network via spatiotemporal fusion for EEG-based emotion recognition
by: Fo Hu, et al.
Published: (2024-12-01)

Biomedical named entity recognition using improved green anaconda-assisted Bi-GRU-based hierarchical ResNet model
by: Ram Chandra Bhushan, et al.
Published: (2025-01-01)

Speech recognition using an english multimodal corpus with integrated image and depth information
by: Bing Wang
Published: (2024-11-01)

A Resource-Efficient Multi-Entropy Fusion Method and Its Application for EEG-Based Emotion Recognition
by: Jiawen Li, et al.
Published: (2025-01-01)

Classification of Speech Emotion State Based on Feature Map Fusion of TCN and Pretrained CNN Model From Korean Speech Emotion Data
by: A-Hyeon Jo, et al.
Published: (2025-01-01)

Deep Learning-Based Speech Emotion Recognition Using Multi-Level Fusion of Concurrent Features
by: Samuel, Kakuba, et al.
Published: (2023)

FusionMamba: dynamic feature enhancement for multimodal image fusion with Mamba
by: Xinyu Xie, et al.
Published: (2024-12-01)

SARO‐MB3‐BiGRU: A novel model for short‐term traffic flow forecasting in the context of big data
by: Haoxu Wang, et al.
Published: (2024-11-01)

Multi-Stage Recognition of Speech Emotion Using Sequential Forward Feature Selection
by: Liogienė Tatjana, et al.
Published: (2016-07-01)

Hybrid LSTM–Attention and CNN Model for Enhanced Speech Emotion Recognition
by: Fazliddin Makhmudov, et al.
Published: (2024-12-01)

Multimodal driver emotion recognition using motor activity and facial expressions
by: Carlos H. Espino-Salinas, et al.
Published: (2024-11-01)

Multimodal Data Fusion for Depression Detection Approach
by: Mariia Nykoniuk, et al.
Published: (2025-01-01)

Speech Emotion Recognition Model Based on Joint Modeling of Discrete and Dimensional Emotion Representation
by: John Lorenzo Bautista, et al.
Published: (2025-01-01)

Optimizing Speech Emotion Recognition with Hilbert Curve and convolutional neural network
by: Zijun Yang, et al.
Published: (2024-01-01)