Natural language processing reveals network structure of pain communication in social media using discrete mathematical analysis

Abstract Pain-related discussions on social media provide valuable insights into how people naturally express and communicate their pain experiences. However, the network structure of these discussions remains poorly understood. This study analyzed 57,000 Reddit comments from the GoEmotions dataset...

Full description

Saved in:
Bibliographic Details
Main Authors: Nobuo Okui, Shigeo Horie
Format: Article
Language:English
Published: Nature Portfolio 2025-08-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-14680-y
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849235411009798144
author Nobuo Okui
Shigeo Horie
author_facet Nobuo Okui
Shigeo Horie
author_sort Nobuo Okui
collection DOAJ
description Abstract Pain-related discussions on social media provide valuable insights into how people naturally express and communicate their pain experiences. However, the network structure of these discussions remains poorly understood. This study analyzed 57,000 Reddit comments from the GoEmotions dataset (2005–2019) using natural language processing and network analysis techniques grounded in discrete mathematical principles. The constructed network, comprising 5,630 nodes and 86,972 edges, revealed complex patterns of pain-related language use. The network exhibited a sparse overall density (0.0055) but a high clustering coefficient (0.7700), indicating the presence of distinct thematic communities. At the center of the network was the term pain, which showed the highest degree centrality (0.821429), reflecting its semantic anchoring function in pain discourse. Other terms, such as headache, served as context-sensitive bridge nodes that connected different semantic subdomains. In contrast, terms like burning, despite moderate centrality values, were found to co-occur predominantly with metaphorical or decorative expressions rather than emotion- or symptom-related descriptors. Community detection revealed 12 distinct clusters, with the largest containing 1,021 nodes, capturing diverse aspects of pain communication. Stability analysis demonstrated that core pain-related terms maintained consistent centrality, while peripheral or metaphorical terms showed greater variability. These findings offer novel insights into the semantic structure of pain discourse and suggest that network analysis of social media discussions can inform improved clinical communication and symptom assessment.
format Article
id doaj-art-d94b0f0d7f77425bb6970192c68c20c7
institution Kabale University
issn 2045-2322
language English
publishDate 2025-08-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-d94b0f0d7f77425bb6970192c68c20c72025-08-20T04:02:46ZengNature PortfolioScientific Reports2045-23222025-08-0115111210.1038/s41598-025-14680-yNatural language processing reveals network structure of pain communication in social media using discrete mathematical analysisNobuo Okui0Shigeo Horie1Data Science and Informatics for Genetic Disorders, Juntendo UniversityData Science and Informatics for Genetic Disorders, Juntendo UniversityAbstract Pain-related discussions on social media provide valuable insights into how people naturally express and communicate their pain experiences. However, the network structure of these discussions remains poorly understood. This study analyzed 57,000 Reddit comments from the GoEmotions dataset (2005–2019) using natural language processing and network analysis techniques grounded in discrete mathematical principles. The constructed network, comprising 5,630 nodes and 86,972 edges, revealed complex patterns of pain-related language use. The network exhibited a sparse overall density (0.0055) but a high clustering coefficient (0.7700), indicating the presence of distinct thematic communities. At the center of the network was the term pain, which showed the highest degree centrality (0.821429), reflecting its semantic anchoring function in pain discourse. Other terms, such as headache, served as context-sensitive bridge nodes that connected different semantic subdomains. In contrast, terms like burning, despite moderate centrality values, were found to co-occur predominantly with metaphorical or decorative expressions rather than emotion- or symptom-related descriptors. Community detection revealed 12 distinct clusters, with the largest containing 1,021 nodes, capturing diverse aspects of pain communication. Stability analysis demonstrated that core pain-related terms maintained consistent centrality, while peripheral or metaphorical terms showed greater variability. These findings offer novel insights into the semantic structure of pain discourse and suggest that network analysis of social media discussions can inform improved clinical communication and symptom assessment.https://doi.org/10.1038/s41598-025-14680-yPain perceptionDigital health communicationDiscrete mathematics in health communicationNatural languagePain assessment in social mediaSymptom network analysis
spellingShingle Nobuo Okui
Shigeo Horie
Natural language processing reveals network structure of pain communication in social media using discrete mathematical analysis
Scientific Reports
Pain perception
Digital health communication
Discrete mathematics in health communication
Natural language
Pain assessment in social media
Symptom network analysis
title Natural language processing reveals network structure of pain communication in social media using discrete mathematical analysis
title_full Natural language processing reveals network structure of pain communication in social media using discrete mathematical analysis
title_fullStr Natural language processing reveals network structure of pain communication in social media using discrete mathematical analysis
title_full_unstemmed Natural language processing reveals network structure of pain communication in social media using discrete mathematical analysis
title_short Natural language processing reveals network structure of pain communication in social media using discrete mathematical analysis
title_sort natural language processing reveals network structure of pain communication in social media using discrete mathematical analysis
topic Pain perception
Digital health communication
Discrete mathematics in health communication
Natural language
Pain assessment in social media
Symptom network analysis
url https://doi.org/10.1038/s41598-025-14680-y
work_keys_str_mv AT nobuookui naturallanguageprocessingrevealsnetworkstructureofpaincommunicationinsocialmediausingdiscretemathematicalanalysis
AT shigeohorie naturallanguageprocessingrevealsnetworkstructureofpaincommunicationinsocialmediausingdiscretemathematicalanalysis