Social Media Based Topic Modeling for Smart Campus: A Deep Topical Correlation Analysis Method

Smart campus builds on characteristic learning and feedback evaluation of diverse students and aims to enable intelligent, accurate, and customized education. Mining social media data, especially topic modeling, from students, provides a non-intrusive method to know the instantaneous thoughts and wi...

Full description

Saved in:
Bibliographic Details
Main Authors: Jun Peng, Yiyi Zhou, Xiaoshuai Sun, Jinsong Su, Rongrong Ji
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8594651/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850153592049106944
author Jun Peng
Yiyi Zhou
Xiaoshuai Sun
Jinsong Su
Rongrong Ji
author_facet Jun Peng
Yiyi Zhou
Xiaoshuai Sun
Jinsong Su
Rongrong Ji
author_sort Jun Peng
collection DOAJ
description Smart campus builds on characteristic learning and feedback evaluation of diverse students and aims to enable intelligent, accurate, and customized education. Mining social media data, especially topic modeling, from students, provides a non-intrusive method to know the instantaneous thoughts and willings of them. However, it is challenging to deal with multi-modal data (i.e., text, images, and videos contained in the social media data) as well as the modality dependence and missing modality. In this paper, we present a novel deep topical correlation analysis (DTCA) approach, which achieves robust and accurate topic detection for microblogs and simultaneously handles the two challenges aforementioned. In particular, bidirectional recurrent neural networks and convolutional neural networks are used to learn deep textual and visual features, respectively. Then, a canonical correlation analysis-based fusion scheme is proposed, which has two innovations to deal with both modality independence and modality missing, <italic>i.e.,</italic> a filter gate to capture the modality dependency and a matrix-projection based component to handle the missing modality. DTCA is trained in an end-to-end manner, in which the parameters of visual, textual, and cross-modal prediction parts are trained jointly. We further release a large-scale cross-modal twitter dataset for topic detection, denoted as TM-Twitter. On this dataset, extensive and quantitative evaluations are conducted with comparisons to several state-of-the-art and alternative approaches. Significant performance gains are reported to demonstrate the merits of the proposed DTCA.
format Article
id doaj-art-a6e2b7a0de344b10b0aa04897c9ecf74
institution OA Journals
issn 2169-3536
language English
publishDate 2019-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-a6e2b7a0de344b10b0aa04897c9ecf742025-08-20T02:25:40ZengIEEEIEEE Access2169-35362019-01-0177555756410.1109/ACCESS.2018.28900918594651Social Media Based Topic Modeling for Smart Campus: A Deep Topical Correlation Analysis MethodJun Peng0https://orcid.org/0000-0003-0655-1594Yiyi Zhou1Xiaoshuai Sun2Jinsong Su3Rongrong Ji4School of Information Science and Engineering, Xiamen University, Xiamen, ChinaSchool of Information Science and Engineering, Xiamen University, Xiamen, ChinaSchool of Computer Science and Technology, Harbin Institute of Technology, Harbin, ChinaSchool of Software, Xiamen University, Xiamen, ChinaSchool of Information Science and Engineering, Xiamen University, Xiamen, ChinaSmart campus builds on characteristic learning and feedback evaluation of diverse students and aims to enable intelligent, accurate, and customized education. Mining social media data, especially topic modeling, from students, provides a non-intrusive method to know the instantaneous thoughts and willings of them. However, it is challenging to deal with multi-modal data (i.e., text, images, and videos contained in the social media data) as well as the modality dependence and missing modality. In this paper, we present a novel deep topical correlation analysis (DTCA) approach, which achieves robust and accurate topic detection for microblogs and simultaneously handles the two challenges aforementioned. In particular, bidirectional recurrent neural networks and convolutional neural networks are used to learn deep textual and visual features, respectively. Then, a canonical correlation analysis-based fusion scheme is proposed, which has two innovations to deal with both modality independence and modality missing, <italic>i.e.,</italic> a filter gate to capture the modality dependency and a matrix-projection based component to handle the missing modality. DTCA is trained in an end-to-end manner, in which the parameters of visual, textual, and cross-modal prediction parts are trained jointly. We further release a large-scale cross-modal twitter dataset for topic detection, denoted as TM-Twitter. On this dataset, extensive and quantitative evaluations are conducted with comparisons to several state-of-the-art and alternative approaches. Significant performance gains are reported to demonstrate the merits of the proposed DTCA.https://ieeexplore.ieee.org/document/8594651/Topic modelingdeep neural networkscorrelation analysis
spellingShingle Jun Peng
Yiyi Zhou
Xiaoshuai Sun
Jinsong Su
Rongrong Ji
Social Media Based Topic Modeling for Smart Campus: A Deep Topical Correlation Analysis Method
IEEE Access
Topic modeling
deep neural networks
correlation analysis
title Social Media Based Topic Modeling for Smart Campus: A Deep Topical Correlation Analysis Method
title_full Social Media Based Topic Modeling for Smart Campus: A Deep Topical Correlation Analysis Method
title_fullStr Social Media Based Topic Modeling for Smart Campus: A Deep Topical Correlation Analysis Method
title_full_unstemmed Social Media Based Topic Modeling for Smart Campus: A Deep Topical Correlation Analysis Method
title_short Social Media Based Topic Modeling for Smart Campus: A Deep Topical Correlation Analysis Method
title_sort social media based topic modeling for smart campus a deep topical correlation analysis method
topic Topic modeling
deep neural networks
correlation analysis
url https://ieeexplore.ieee.org/document/8594651/
work_keys_str_mv AT junpeng socialmediabasedtopicmodelingforsmartcampusadeeptopicalcorrelationanalysismethod
AT yiyizhou socialmediabasedtopicmodelingforsmartcampusadeeptopicalcorrelationanalysismethod
AT xiaoshuaisun socialmediabasedtopicmodelingforsmartcampusadeeptopicalcorrelationanalysismethod
AT jinsongsu socialmediabasedtopicmodelingforsmartcampusadeeptopicalcorrelationanalysismethod
AT rongrongji socialmediabasedtopicmodelingforsmartcampusadeeptopicalcorrelationanalysismethod