A Social Media Dataset and H-GNN-Based Contrastive Learning Scheme for Multimodal Sentiment Analysis

Multimodal sentiment analysis faces a number of challenges, including modality missing, modality heterogeneity gap, incomplete datasets, etc. Previous studies usually adopt schemes like meta-learning or multi-layer structures. Nevertheless, these methods lack interpretability for the interaction bet...

Full description

Saved in:
Bibliographic Details
Main Authors: Jiao Peng, Yue He, Yongjuan Chang, Yanyan Lu, Pengfei Zhang, Zhonghong Ou, Qingzhi Yu
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/2/636
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832589255126286336
author Jiao Peng
Yue He
Yongjuan Chang
Yanyan Lu
Pengfei Zhang
Zhonghong Ou
Qingzhi Yu
author_facet Jiao Peng
Yue He
Yongjuan Chang
Yanyan Lu
Pengfei Zhang
Zhonghong Ou
Qingzhi Yu
author_sort Jiao Peng
collection DOAJ
description Multimodal sentiment analysis faces a number of challenges, including modality missing, modality heterogeneity gap, incomplete datasets, etc. Previous studies usually adopt schemes like meta-learning or multi-layer structures. Nevertheless, these methods lack interpretability for the interaction between modalities. In this paper, we constructed a new dataset, SM-MSD, for sentiment analysis in social media (SAS) that differs significantly from conventional corpora, comprising 10K instances of diverse data from Twitter, encompassing text, emoticons, emojis, and text embedded in images. This dataset aims to reflect authentic social scenarios and various emotional expressions, and provides a meaningful and challenging evaluation benchmark for multimodal sentiment analysis in specific contexts. Furthermore, we propose a multi-task framework based on heterogeneous graph neural networks (H-GNNs) and contrastive learning. For the first time, heterogeneous graph neural networks are applied to multimodal sentiment analysis tasks. In the case of additional labeling data, it guides the emotion prediction of the missing mode. We conduct extensive experiments on multiple datasets to verify the effectiveness of the proposed scheme. Experimental results demonstrate that our proposed scheme surpasses state-of-the-art methods by 1.7% and 0 in accuracy and 1.54% and 4.9% in F1-score on the MOSI and MOSEI datasets, respectively, and exhibits robustness to modality missing scenarios.
format Article
id doaj-art-4aeee671dda648d7a005e60058ff3bb8
institution Kabale University
issn 2076-3417
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj-art-4aeee671dda648d7a005e60058ff3bb82025-01-24T13:20:13ZengMDPI AGApplied Sciences2076-34172025-01-0115263610.3390/app15020636A Social Media Dataset and H-GNN-Based Contrastive Learning Scheme for Multimodal Sentiment AnalysisJiao Peng0Yue He1Yongjuan Chang2Yanyan Lu3Pengfei Zhang4Zhonghong Ou5Qingzhi Yu6State Grid Hebei Information and Telecommunication Branch, Shijiazhuang 050051, ChinaState Grid Hebei Information and Telecommunication Branch, Shijiazhuang 050051, ChinaState Grid Hebei Information and Telecommunication Branch, Shijiazhuang 050051, ChinaState Grid Hebei Information and Telecommunication Branch, Shijiazhuang 050051, ChinaState Grid Hebei Information and Telecommunication Branch, Shijiazhuang 050051, ChinaState Key Laboratory of Networking and Switching Technology, Beijing University of Posts and Telecommunications, Beijing 100876, ChinaSchool of Computer Science (National Pilot Software Engineering School), Beijing University of Posts and Telecommunications, Beijing 100876, ChinaMultimodal sentiment analysis faces a number of challenges, including modality missing, modality heterogeneity gap, incomplete datasets, etc. Previous studies usually adopt schemes like meta-learning or multi-layer structures. Nevertheless, these methods lack interpretability for the interaction between modalities. In this paper, we constructed a new dataset, SM-MSD, for sentiment analysis in social media (SAS) that differs significantly from conventional corpora, comprising 10K instances of diverse data from Twitter, encompassing text, emoticons, emojis, and text embedded in images. This dataset aims to reflect authentic social scenarios and various emotional expressions, and provides a meaningful and challenging evaluation benchmark for multimodal sentiment analysis in specific contexts. Furthermore, we propose a multi-task framework based on heterogeneous graph neural networks (H-GNNs) and contrastive learning. For the first time, heterogeneous graph neural networks are applied to multimodal sentiment analysis tasks. In the case of additional labeling data, it guides the emotion prediction of the missing mode. We conduct extensive experiments on multiple datasets to verify the effectiveness of the proposed scheme. Experimental results demonstrate that our proposed scheme surpasses state-of-the-art methods by 1.7% and 0 in accuracy and 1.54% and 4.9% in F1-score on the MOSI and MOSEI datasets, respectively, and exhibits robustness to modality missing scenarios.https://www.mdpi.com/2076-3417/15/2/636multimodal sentiment analysisdatasetscontrastive learningheterogeneous graph neural networks
spellingShingle Jiao Peng
Yue He
Yongjuan Chang
Yanyan Lu
Pengfei Zhang
Zhonghong Ou
Qingzhi Yu
A Social Media Dataset and H-GNN-Based Contrastive Learning Scheme for Multimodal Sentiment Analysis
Applied Sciences
multimodal sentiment analysis
datasets
contrastive learning
heterogeneous graph neural networks
title A Social Media Dataset and H-GNN-Based Contrastive Learning Scheme for Multimodal Sentiment Analysis
title_full A Social Media Dataset and H-GNN-Based Contrastive Learning Scheme for Multimodal Sentiment Analysis
title_fullStr A Social Media Dataset and H-GNN-Based Contrastive Learning Scheme for Multimodal Sentiment Analysis
title_full_unstemmed A Social Media Dataset and H-GNN-Based Contrastive Learning Scheme for Multimodal Sentiment Analysis
title_short A Social Media Dataset and H-GNN-Based Contrastive Learning Scheme for Multimodal Sentiment Analysis
title_sort social media dataset and h gnn based contrastive learning scheme for multimodal sentiment analysis
topic multimodal sentiment analysis
datasets
contrastive learning
heterogeneous graph neural networks
url https://www.mdpi.com/2076-3417/15/2/636
work_keys_str_mv AT jiaopeng asocialmediadatasetandhgnnbasedcontrastivelearningschemeformultimodalsentimentanalysis
AT yuehe asocialmediadatasetandhgnnbasedcontrastivelearningschemeformultimodalsentimentanalysis
AT yongjuanchang asocialmediadatasetandhgnnbasedcontrastivelearningschemeformultimodalsentimentanalysis
AT yanyanlu asocialmediadatasetandhgnnbasedcontrastivelearningschemeformultimodalsentimentanalysis
AT pengfeizhang asocialmediadatasetandhgnnbasedcontrastivelearningschemeformultimodalsentimentanalysis
AT zhonghongou asocialmediadatasetandhgnnbasedcontrastivelearningschemeformultimodalsentimentanalysis
AT qingzhiyu asocialmediadatasetandhgnnbasedcontrastivelearningschemeformultimodalsentimentanalysis
AT jiaopeng socialmediadatasetandhgnnbasedcontrastivelearningschemeformultimodalsentimentanalysis
AT yuehe socialmediadatasetandhgnnbasedcontrastivelearningschemeformultimodalsentimentanalysis
AT yongjuanchang socialmediadatasetandhgnnbasedcontrastivelearningschemeformultimodalsentimentanalysis
AT yanyanlu socialmediadatasetandhgnnbasedcontrastivelearningschemeformultimodalsentimentanalysis
AT pengfeizhang socialmediadatasetandhgnnbasedcontrastivelearningschemeformultimodalsentimentanalysis
AT zhonghongou socialmediadatasetandhgnnbasedcontrastivelearningschemeformultimodalsentimentanalysis
AT qingzhiyu socialmediadatasetandhgnnbasedcontrastivelearningschemeformultimodalsentimentanalysis