Video description method based on multidimensional and multimodal information
In order to solve the problem of complex information representation in automatic video description tasks,a multi-dimensional and multi-modal visual feature extraction and fusion method was proposed.Firstly,multi-dimensional features such as static and dynamic attributes of the video sequence were ex...
Saved in:
Main Authors: | Enjie DING, Zhongyu LIU, Yafeng LIU, Wanli YU |
---|---|
Format: | Article |
Language: | zho |
Published: |
Editorial Department of Journal on Communications
2020-02-01
|
Series: | Tongxin xuebao |
Subjects: | |
Online Access: | http://www.joconline.com.cn/zh/article/doi/10.11959/j.issn.1000-436x.2020037/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Overseas short video recommendations: A multimodal graph convolutional network approach incorporating cultural preferences
by: Xishi Liu, et al.
Published: (2025-03-01) -
Deep Learning-Based Damage Assessment in Cherry Leaves
by: Burakhan Cubukcu, et al.
Published: (2024-12-01) -
Deep learning Chinese input method with incremental vocabulary selection
by: Huajian REN, et al.
Published: (2022-12-01) -
Rice Price Prediction with Long Short-Term Memory (LSTM) Neural Network
by: Rahmat Hidayat, et al.
Published: (2024-10-01) -
Two-Step Deep Learning Approach for Estimating Vegetation Backscatter: A Case Study of Soybean Fields
by: Dong Zhu, et al.
Published: (2024-12-01)