Tweetluenza: Predicting Flu Trends from Twitter Data
Health authorities worldwide strive to detect Influenza prevalence as early as possible in order to prepare for it and minimize its impacts. To this end, we address the Influenza prevalence surveillance and prediction problem. In this paper, we develop a new Influenza prevalence prediction model, ca...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Tsinghua University Press
2019-12-01
|
Series: | Big Data Mining and Analytics |
Subjects: | |
Online Access: | https://www.sciopen.com/article/10.26599/BDMA.2019.9020012 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832572845255819264 |
---|---|
author | Balsam Alkouz Zaher Al Aghbari Jemal Hussien Abawajy |
author_facet | Balsam Alkouz Zaher Al Aghbari Jemal Hussien Abawajy |
author_sort | Balsam Alkouz |
collection | DOAJ |
description | Health authorities worldwide strive to detect Influenza prevalence as early as possible in order to prepare for it and minimize its impacts. To this end, we address the Influenza prevalence surveillance and prediction problem. In this paper, we develop a new Influenza prevalence prediction model, called Tweetluenza, to predict the spread of the Influenza in real time using cross-lingual data harvested from Twitter data streams with emphases on the United Arab Emirates (UAE). Based on the features of tweets, Tweetluenza filters the Influenza tweets and classifies them into two classes, reporting and non-reporting. To monitor the growth of Influenza, the reporting tweets were employed. Furthermore, a linear regression model leverages the reporting tweets to predict the Influenza-related hospital visits in the future. We evaluated Tweetluenza empirically to study its feasibility and compared the results with the actual hospital visits recorded by the UAE Ministry of Health. The results of our experiments demonstrate the practicality of Tweetluenza, which was verified by the high correlation between the Influenza-related Twitter data and hospital visits due to Influenza. Furthermore, the evaluation of the analysis and prediction of Influenza shows that combining English and Arabic tweets improves the correlation results. |
format | Article |
id | doaj-art-e4e70d18c2f2438bb817b7717b2a21c5 |
institution | Kabale University |
issn | 2096-0654 |
language | English |
publishDate | 2019-12-01 |
publisher | Tsinghua University Press |
record_format | Article |
series | Big Data Mining and Analytics |
spelling | doaj-art-e4e70d18c2f2438bb817b7717b2a21c52025-02-02T06:50:16ZengTsinghua University PressBig Data Mining and Analytics2096-06542019-12-012427328710.26599/BDMA.2019.9020012Tweetluenza: Predicting Flu Trends from Twitter DataBalsam Alkouz0Zaher Al Aghbari1Jemal Hussien Abawajy2<institution content-type="dept">Department of Computer Science</institution>, <institution>University of Sharjah</institution>, <city>Sharjah</city> <postal-code>27272</postal-code>, <country>UAE</country>.<institution content-type="dept">Department of Computer Science</institution>, <institution>University of Sharjah</institution>, <city>Sharjah</city> <postal-code>27272</postal-code>, <country>UAE</country>.<institution content-type="dept">Department of Science, Engineering and Built Environment</institution>, <institution>Deakin University</institution>, <city>Melbourne</city> <postal-code>3125</postal-code>, <country>Australia</country>.Health authorities worldwide strive to detect Influenza prevalence as early as possible in order to prepare for it and minimize its impacts. To this end, we address the Influenza prevalence surveillance and prediction problem. In this paper, we develop a new Influenza prevalence prediction model, called Tweetluenza, to predict the spread of the Influenza in real time using cross-lingual data harvested from Twitter data streams with emphases on the United Arab Emirates (UAE). Based on the features of tweets, Tweetluenza filters the Influenza tweets and classifies them into two classes, reporting and non-reporting. To monitor the growth of Influenza, the reporting tweets were employed. Furthermore, a linear regression model leverages the reporting tweets to predict the Influenza-related hospital visits in the future. We evaluated Tweetluenza empirically to study its feasibility and compared the results with the actual hospital visits recorded by the UAE Ministry of Health. The results of our experiments demonstrate the practicality of Tweetluenza, which was verified by the high correlation between the Influenza-related Twitter data and hospital visits due to Influenza. Furthermore, the evaluation of the analysis and prediction of Influenza shows that combining English and Arabic tweets improves the correlation results.https://www.sciopen.com/article/10.26599/BDMA.2019.9020012twitter data analysisinfluenza forecastingprediction using social mediasocial media mining |
spellingShingle | Balsam Alkouz Zaher Al Aghbari Jemal Hussien Abawajy Tweetluenza: Predicting Flu Trends from Twitter Data Big Data Mining and Analytics twitter data analysis influenza forecasting prediction using social media social media mining |
title | Tweetluenza: Predicting Flu Trends from Twitter Data |
title_full | Tweetluenza: Predicting Flu Trends from Twitter Data |
title_fullStr | Tweetluenza: Predicting Flu Trends from Twitter Data |
title_full_unstemmed | Tweetluenza: Predicting Flu Trends from Twitter Data |
title_short | Tweetluenza: Predicting Flu Trends from Twitter Data |
title_sort | tweetluenza predicting flu trends from twitter data |
topic | twitter data analysis influenza forecasting prediction using social media social media mining |
url | https://www.sciopen.com/article/10.26599/BDMA.2019.9020012 |
work_keys_str_mv | AT balsamalkouz tweetluenzapredictingflutrendsfromtwitterdata AT zaheralaghbari tweetluenzapredictingflutrendsfromtwitterdata AT jemalhussienabawajy tweetluenzapredictingflutrendsfromtwitterdata |