Micro-blog topic detection algorithm based on topic model

Micro-blog data has the characteristic of real-time,volume,short-text,and noise-rich.So it is a challenge for the traditional topic detection technology.A novel micro-blog topic detection algorithm based on topic model was proposed.Firstly,the micro-blog data was expressed as text word matrix and wo...

Full description

Saved in:
Bibliographic Details
Main Authors: Hua-jun HUANG, Jun-shan TAN, Jiao-hua QIN
Format: Article
Language:English
Published: POSTS&TELECOM PRESS Co., LTD 2016-05-01
Series:网络与信息安全学报
Subjects:
Online Access:http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2016.00049
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850091950478196736
author Hua-jun HUANG
Jun-shan TAN
Jiao-hua QIN
author_facet Hua-jun HUANG
Jun-shan TAN
Jiao-hua QIN
author_sort Hua-jun HUANG
collection DOAJ
description Micro-blog data has the characteristic of real-time,volume,short-text,and noise-rich.So it is a challenge for the traditional topic detection technology.A novel micro-blog topic detection algorithm based on topic model was proposed.Firstly,the micro-blog data was expressed as text word matrix and word relation matrix.The topic word was extracted from the two vectors.Secondly,the topic model was obtained with clustering.Finally,the topic detection of micro-blog was obtained by clustering text and topic model.Experimental results show that the algorithm proposed can effectively detection the text topic,and with the best parameter group of precision,recall rate,F,and the value F is about 95%.
format Article
id doaj-art-734cbd2bbd68402dbb5f863cf7e9261c
institution DOAJ
issn 2096-109X
language English
publishDate 2016-05-01
publisher POSTS&TELECOM PRESS Co., LTD
record_format Article
series 网络与信息安全学报
spelling doaj-art-734cbd2bbd68402dbb5f863cf7e9261c2025-08-20T02:42:14ZengPOSTS&TELECOM PRESS Co., LTD网络与信息安全学报2096-109X2016-05-012303859545355Micro-blog topic detection algorithm based on topic modelHua-jun HUANGJun-shan TANJiao-hua QINMicro-blog data has the characteristic of real-time,volume,short-text,and noise-rich.So it is a challenge for the traditional topic detection technology.A novel micro-blog topic detection algorithm based on topic model was proposed.Firstly,the micro-blog data was expressed as text word matrix and word relation matrix.The topic word was extracted from the two vectors.Secondly,the topic model was obtained with clustering.Finally,the topic detection of micro-blog was obtained by clustering text and topic model.Experimental results show that the algorithm proposed can effectively detection the text topic,and with the best parameter group of precision,recall rate,F,and the value F is about 95%.http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2016.00049topic detectiontopic modeltext word matrixword relation matrix
spellingShingle Hua-jun HUANG
Jun-shan TAN
Jiao-hua QIN
Micro-blog topic detection algorithm based on topic model
网络与信息安全学报
topic detection
topic model
text word matrix
word relation matrix
title Micro-blog topic detection algorithm based on topic model
title_full Micro-blog topic detection algorithm based on topic model
title_fullStr Micro-blog topic detection algorithm based on topic model
title_full_unstemmed Micro-blog topic detection algorithm based on topic model
title_short Micro-blog topic detection algorithm based on topic model
title_sort micro blog topic detection algorithm based on topic model
topic topic detection
topic model
text word matrix
word relation matrix
url http://www.cjnis.com.cn/thesisDetails#10.11959/j.issn.2096-109x.2016.00049
work_keys_str_mv AT huajunhuang microblogtopicdetectionalgorithmbasedontopicmodel
AT junshantan microblogtopicdetectionalgorithmbasedontopicmodel
AT jiaohuaqin microblogtopicdetectionalgorithmbasedontopicmodel