Analysis and Classification of Fake News Using Sequential Pattern Mining

Disinformation, often known as fake news, is a major issue that has received a lot of attention lately. Many researchers have proposed effective means of detecting and addressing it. Current machine and deep learning based methodologies for classification/detection of fake news are content-based, ne...

Full description

Saved in:
Bibliographic Details
Main Authors: M. Zohaib Nawaz, M. Saqib Nawaz, Philippe Fournier-Viger, Yulin He
Format: Article
Language:English
Published: Tsinghua University Press 2024-09-01
Series:Big Data Mining and Analytics
Subjects:
Online Access:https://www.sciopen.com/article/10.26599/BDMA.2024.9020015
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832544400957243392
author M. Zohaib Nawaz
M. Saqib Nawaz
Philippe Fournier-Viger
Yulin He
author_facet M. Zohaib Nawaz
M. Saqib Nawaz
Philippe Fournier-Viger
Yulin He
author_sort M. Zohaib Nawaz
collection DOAJ
description Disinformation, often known as fake news, is a major issue that has received a lot of attention lately. Many researchers have proposed effective means of detecting and addressing it. Current machine and deep learning based methodologies for classification/detection of fake news are content-based, network (propagation) based, or multimodal methods that combine both textual and visual information. We introduce here a framework, called FNACSPM, based on sequential pattern mining (SPM), for fake news analysis and classification. In this framework, six publicly available datasets, containing a diverse range of fake and real news, and their combination, are first transformed into a proper format. Then, algorithms for SPM are applied to the transformed datasets to extract frequent patterns (and rules) of words, phrases, or linguistic features. The obtained patterns capture distinctive characteristics associated with fake or real news content, providing valuable insights into the underlying structures and commonalities of misinformation. Subsequently, the discovered frequent patterns are used as features for fake news classification. This framework is evaluated with eight classifiers, and their performance is assessed with various metrics. Extensive experiments were performed and obtained results show that FNACSPM outperformed other state-of-the-art approaches for fake news classification, and that it expedites the classification task with high accuracy.
format Article
id doaj-art-533e3f62e8a04698a0254c095ae51e7e
institution Kabale University
issn 2096-0654
language English
publishDate 2024-09-01
publisher Tsinghua University Press
record_format Article
series Big Data Mining and Analytics
spelling doaj-art-533e3f62e8a04698a0254c095ae51e7e2025-02-03T10:19:59ZengTsinghua University PressBig Data Mining and Analytics2096-06542024-09-017394296310.26599/BDMA.2024.9020015Analysis and Classification of Fake News Using Sequential Pattern MiningM. Zohaib Nawaz0M. Saqib Nawaz1Philippe Fournier-Viger2Yulin He3College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060Guangdong Laboratory of Artificial Intelligence and Digital Economy (SZ), Shenzhen 518107, ChinaDisinformation, often known as fake news, is a major issue that has received a lot of attention lately. Many researchers have proposed effective means of detecting and addressing it. Current machine and deep learning based methodologies for classification/detection of fake news are content-based, network (propagation) based, or multimodal methods that combine both textual and visual information. We introduce here a framework, called FNACSPM, based on sequential pattern mining (SPM), for fake news analysis and classification. In this framework, six publicly available datasets, containing a diverse range of fake and real news, and their combination, are first transformed into a proper format. Then, algorithms for SPM are applied to the transformed datasets to extract frequent patterns (and rules) of words, phrases, or linguistic features. The obtained patterns capture distinctive characteristics associated with fake or real news content, providing valuable insights into the underlying structures and commonalities of misinformation. Subsequently, the discovered frequent patterns are used as features for fake news classification. This framework is evaluated with eight classifiers, and their performance is assessed with various metrics. Extensive experiments were performed and obtained results show that FNACSPM outperformed other state-of-the-art approaches for fake news classification, and that it expedites the classification task with high accuracy.https://www.sciopen.com/article/10.26599/BDMA.2024.9020015disinformationfake newssequential pattern mining (spm)frequent patternsclassification
spellingShingle M. Zohaib Nawaz
M. Saqib Nawaz
Philippe Fournier-Viger
Yulin He
Analysis and Classification of Fake News Using Sequential Pattern Mining
Big Data Mining and Analytics
disinformation
fake news
sequential pattern mining (spm)
frequent patterns
classification
title Analysis and Classification of Fake News Using Sequential Pattern Mining
title_full Analysis and Classification of Fake News Using Sequential Pattern Mining
title_fullStr Analysis and Classification of Fake News Using Sequential Pattern Mining
title_full_unstemmed Analysis and Classification of Fake News Using Sequential Pattern Mining
title_short Analysis and Classification of Fake News Using Sequential Pattern Mining
title_sort analysis and classification of fake news using sequential pattern mining
topic disinformation
fake news
sequential pattern mining (spm)
frequent patterns
classification
url https://www.sciopen.com/article/10.26599/BDMA.2024.9020015
work_keys_str_mv AT mzohaibnawaz analysisandclassificationoffakenewsusingsequentialpatternmining
AT msaqibnawaz analysisandclassificationoffakenewsusingsequentialpatternmining
AT philippefournierviger analysisandclassificationoffakenewsusingsequentialpatternmining
AT yulinhe analysisandclassificationoffakenewsusingsequentialpatternmining