Video description method based on multidimensional and multimodal information
In order to solve the problem of complex information representation in automatic video description tasks,a multi-dimensional and multi-modal visual feature extraction and fusion method was proposed.Firstly,multi-dimensional features such as static and dynamic attributes of the video sequence were ex...
Saved in:
| Main Authors: | Enjie DING, Zhongyu LIU, Yafeng LIU, Wanli YU |
|---|---|
| Format: | Article |
| Language: | zho |
| Published: |
Editorial Department of Journal on Communications
2020-02-01
|
| Series: | Tongxin xuebao |
| Subjects: | |
| Online Access: | http://www.joconline.com.cn/thesisDetails#10.11959/j.issn.1000-436x.2020037 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Camscribe: Enhanced Dashcam Video Descriptions Through Multimodal Spatiotemporal and Object Detection for Autonomous Vehicles
by: Muhammad Rafiq, et al.
Published: (2025-01-01) -
Fusion-Optimized Multimodal Entity Alignment with Textual Descriptions
by: Chenchen Wang, et al.
Published: (2025-06-01) -
High Perplexity Mountain Flood Level Forecasting in Small Watersheds Based on Compound Long Short-Term Memory Model and Multimodal Short Disaster-Causing Factors
by: Songsong Wang, et al.
Published: (2025-01-01) -
Deep Memory Fusion Model for Long Video Question Answering
by: SUN Guanglu, et al.
Published: (2021-02-01) -
TRAINING OF THE FUTURE INTERPRETERS’ WORKING MEMORY
by: Antonina V. Prokopenko, et al.
Published: (2021-12-01)