Conceptualizing Discussions on the Dark Web: An Empirical Topic Modeling Approach

Social networks on the Internet have become a home that attracts all types of human thinking to exchange knowledge and ideas and share businesses. On the other hand, it has also become a source for researchers to analyze this knowledge and frame it in patterns that define types of thoughts circulati...

Full description

Saved in:
Bibliographic Details
Main Authors: Randa Basheer, Bassel Alkhatib
Format: Article
Language:English
Published: Wiley 2024-01-01
Series:Complexity
Online Access:http://dx.doi.org/10.1155/2024/2775236
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832559635115016192
author Randa Basheer
Bassel Alkhatib
author_facet Randa Basheer
Bassel Alkhatib
author_sort Randa Basheer
collection DOAJ
description Social networks on the Internet have become a home that attracts all types of human thinking to exchange knowledge and ideas and share businesses. On the other hand, it has also become a source for researchers to analyze this knowledge and frame it in patterns that define types of thoughts circulating on these networks and representing the communities around them. In particular, some social networks on the Dark Web attract a special kind of thinking centered around the malicious and illegal activities disseminated on websites and marketplaces on the Dark Web. These networks involve discussions to exchange and discourse information, tips, and advice on performing such business. Studying social networks on the Dark Web is still in its infancy. In this paper, we present a methodology for analyzing the content of social networks on the Dark Web using topic modeling methods. We demonstrate the needed stages for the topic modeling process, beginning with data preprocessing and feature extraction to topic modeling algorithms. We utilize and discuss the following four topic models: LDA, CTM, PAM, and PTM. We discuss the following four topic coherence measures as evaluation metrics: UMass, UCI, CNPMI, and CV, demonstrating the selection of the best number of topics for each model according to the most coherent produced topics. Furthermore, we discuss the limitations, challenges, and future work. Our proposed approach highlights the ability to discover the latent thematic patterns in conversations and messages in the common language used in social networks on the Dark Web, constructing topics as groups of terms and their associations. This paper provides researchers with a leading methodology for analyzing thought patterns on the Dark Web.
format Article
id doaj-art-858aac26cff34846b757e87ba27194be
institution Kabale University
issn 1099-0526
language English
publishDate 2024-01-01
publisher Wiley
record_format Article
series Complexity
spelling doaj-art-858aac26cff34846b757e87ba27194be2025-02-03T01:29:32ZengWileyComplexity1099-05262024-01-01202410.1155/2024/2775236Conceptualizing Discussions on the Dark Web: An Empirical Topic Modeling ApproachRanda Basheer0Bassel Alkhatib1Faculty of Information Technology and CommunicationsFaculty of Informatics EngineeringSocial networks on the Internet have become a home that attracts all types of human thinking to exchange knowledge and ideas and share businesses. On the other hand, it has also become a source for researchers to analyze this knowledge and frame it in patterns that define types of thoughts circulating on these networks and representing the communities around them. In particular, some social networks on the Dark Web attract a special kind of thinking centered around the malicious and illegal activities disseminated on websites and marketplaces on the Dark Web. These networks involve discussions to exchange and discourse information, tips, and advice on performing such business. Studying social networks on the Dark Web is still in its infancy. In this paper, we present a methodology for analyzing the content of social networks on the Dark Web using topic modeling methods. We demonstrate the needed stages for the topic modeling process, beginning with data preprocessing and feature extraction to topic modeling algorithms. We utilize and discuss the following four topic models: LDA, CTM, PAM, and PTM. We discuss the following four topic coherence measures as evaluation metrics: UMass, UCI, CNPMI, and CV, demonstrating the selection of the best number of topics for each model according to the most coherent produced topics. Furthermore, we discuss the limitations, challenges, and future work. Our proposed approach highlights the ability to discover the latent thematic patterns in conversations and messages in the common language used in social networks on the Dark Web, constructing topics as groups of terms and their associations. This paper provides researchers with a leading methodology for analyzing thought patterns on the Dark Web.http://dx.doi.org/10.1155/2024/2775236
spellingShingle Randa Basheer
Bassel Alkhatib
Conceptualizing Discussions on the Dark Web: An Empirical Topic Modeling Approach
Complexity
title Conceptualizing Discussions on the Dark Web: An Empirical Topic Modeling Approach
title_full Conceptualizing Discussions on the Dark Web: An Empirical Topic Modeling Approach
title_fullStr Conceptualizing Discussions on the Dark Web: An Empirical Topic Modeling Approach
title_full_unstemmed Conceptualizing Discussions on the Dark Web: An Empirical Topic Modeling Approach
title_short Conceptualizing Discussions on the Dark Web: An Empirical Topic Modeling Approach
title_sort conceptualizing discussions on the dark web an empirical topic modeling approach
url http://dx.doi.org/10.1155/2024/2775236
work_keys_str_mv AT randabasheer conceptualizingdiscussionsonthedarkwebanempiricaltopicmodelingapproach
AT basselalkhatib conceptualizingdiscussionsonthedarkwebanempiricaltopicmodelingapproach