Patient Voices in Dialysis Care: Sentiment Analysis and Topic Modeling Study of Social Media Discourse

BackgroundPatients with end-stage kidney disease undergoing dialysis face significant physical, psychological, and social challenges that impact their quality of life. Social media platforms such as X (formerly known as Twitter) have become important outlets for these patient...

Full description

Saved in:
Bibliographic Details
Main Authors: Ravi Shankar, Qian Xu, Anjali Bundele
Format: Article
Language:English
Published: JMIR Publications 2025-05-01
Series:Journal of Medical Internet Research
Online Access:https://www.jmir.org/2025/1/e70128
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849322095778988032
author Ravi Shankar
Qian Xu
Anjali Bundele
author_facet Ravi Shankar
Qian Xu
Anjali Bundele
author_sort Ravi Shankar
collection DOAJ
description BackgroundPatients with end-stage kidney disease undergoing dialysis face significant physical, psychological, and social challenges that impact their quality of life. Social media platforms such as X (formerly known as Twitter) have become important outlets for these patients to share experiences and exchange information. ObjectiveThis study aimed to uncover key themes, emotions, and challenges expressed by the dialysis community on X from April 2006 to August 2024 by leveraging natural language processing techniques, specifically sentiment analysis and topic modeling. MethodsWe collected 12,976 publicly available X posts related to dialysis using the platform’s application programming interface version 2 and Python’s Tweepy library. After rigorous preprocessing, 58.13% (7543/12,976) of the posts were retained for analysis. Sentiment analysis using the Valence Aware Dictionary and Sentiment Reasoner (VADER) model, which is a rule-based sentiment analyzer specifically attuned to social media content, classified the emotional tone of posts. VADER uses a human-curated lexicon that maps lexical features to sentiment scores, considering punctuation, capitalization, and modifiers. For topic modeling, posts with <50 tokens were removed, leaving 53.81% (4059/7543) of the posts, which were analyzed using latent Dirichlet allocation with coherence score optimization to identify the optimal number of topics (k=8). The analysis pipeline was implemented using Python’s Natural Language Toolkit, Gensim, and scikit-learn libraries, with hyperparameter tuning to maximize model performance. ResultsSentiment analysis revealed 49.2% (3711/7543) positive, 26.2% (1976/7543) negative, and 24.7% (1863/7543) neutral sentiment posts. Latent Dirichlet allocation topic modeling identified 8 key thematic clusters: medical procedures and outcomes (722/4059, 17.8% prevalence), daily life impact (666/4059, 16.4%), risks and complications (621/4059, 15.3%), patient education and support (544/4059, 13.4%), health care access and costs (499/4059, 12.3%), symptoms and side effects (442/4059, 10.9%), patient experiences and socioeconomic challenges (406/4059, 10%), and diet and fluid management (162/4059, 4%). Cross-analysis of topics and sentiment revealed that negative sentiment was highest for daily life impact (580/666, 87.1%) and socioeconomic challenges (145/406, 35.8%), whereas the education and support topic exhibited more positive sentiment (250/544, 46%). Topic coherence scores ranged from 0.38 to 0.52, with the medical procedures topic showing the highest semantic coherence. Intertopic distance mapping via multidimensional scaling revealed conceptual relationships between identified themes, with lifestyle impact and socioeconomic challenges clustering closely. Our longitudinal analysis demonstrated evolving discourse patterns, with technology-related discussions increasing by 24% in recent years, whereas financial concerns remained consistently prominent. ConclusionsThis study provides a comprehensive, data-driven understanding of the complex lived experiences of patients undergoing dialysis shared on social media. The findings underscore the need for more holistic, patient-centered care models and policies that address the multidimensional challenges illuminated by patients’ voices.
format Article
id doaj-art-f0892025b3ba4f4ca0fcee70e981a6c0
institution Kabale University
issn 1438-8871
language English
publishDate 2025-05-01
publisher JMIR Publications
record_format Article
series Journal of Medical Internet Research
spelling doaj-art-f0892025b3ba4f4ca0fcee70e981a6c02025-08-20T03:49:32ZengJMIR PublicationsJournal of Medical Internet Research1438-88712025-05-0127e7012810.2196/70128Patient Voices in Dialysis Care: Sentiment Analysis and Topic Modeling Study of Social Media DiscourseRavi Shankarhttps://orcid.org/0009-0005-5578-3481Qian Xuhttps://orcid.org/0009-0007-6845-9618Anjali Bundelehttps://orcid.org/0000-0003-1450-5467 BackgroundPatients with end-stage kidney disease undergoing dialysis face significant physical, psychological, and social challenges that impact their quality of life. Social media platforms such as X (formerly known as Twitter) have become important outlets for these patients to share experiences and exchange information. ObjectiveThis study aimed to uncover key themes, emotions, and challenges expressed by the dialysis community on X from April 2006 to August 2024 by leveraging natural language processing techniques, specifically sentiment analysis and topic modeling. MethodsWe collected 12,976 publicly available X posts related to dialysis using the platform’s application programming interface version 2 and Python’s Tweepy library. After rigorous preprocessing, 58.13% (7543/12,976) of the posts were retained for analysis. Sentiment analysis using the Valence Aware Dictionary and Sentiment Reasoner (VADER) model, which is a rule-based sentiment analyzer specifically attuned to social media content, classified the emotional tone of posts. VADER uses a human-curated lexicon that maps lexical features to sentiment scores, considering punctuation, capitalization, and modifiers. For topic modeling, posts with <50 tokens were removed, leaving 53.81% (4059/7543) of the posts, which were analyzed using latent Dirichlet allocation with coherence score optimization to identify the optimal number of topics (k=8). The analysis pipeline was implemented using Python’s Natural Language Toolkit, Gensim, and scikit-learn libraries, with hyperparameter tuning to maximize model performance. ResultsSentiment analysis revealed 49.2% (3711/7543) positive, 26.2% (1976/7543) negative, and 24.7% (1863/7543) neutral sentiment posts. Latent Dirichlet allocation topic modeling identified 8 key thematic clusters: medical procedures and outcomes (722/4059, 17.8% prevalence), daily life impact (666/4059, 16.4%), risks and complications (621/4059, 15.3%), patient education and support (544/4059, 13.4%), health care access and costs (499/4059, 12.3%), symptoms and side effects (442/4059, 10.9%), patient experiences and socioeconomic challenges (406/4059, 10%), and diet and fluid management (162/4059, 4%). Cross-analysis of topics and sentiment revealed that negative sentiment was highest for daily life impact (580/666, 87.1%) and socioeconomic challenges (145/406, 35.8%), whereas the education and support topic exhibited more positive sentiment (250/544, 46%). Topic coherence scores ranged from 0.38 to 0.52, with the medical procedures topic showing the highest semantic coherence. Intertopic distance mapping via multidimensional scaling revealed conceptual relationships between identified themes, with lifestyle impact and socioeconomic challenges clustering closely. Our longitudinal analysis demonstrated evolving discourse patterns, with technology-related discussions increasing by 24% in recent years, whereas financial concerns remained consistently prominent. ConclusionsThis study provides a comprehensive, data-driven understanding of the complex lived experiences of patients undergoing dialysis shared on social media. The findings underscore the need for more holistic, patient-centered care models and policies that address the multidimensional challenges illuminated by patients’ voices.https://www.jmir.org/2025/1/e70128
spellingShingle Ravi Shankar
Qian Xu
Anjali Bundele
Patient Voices in Dialysis Care: Sentiment Analysis and Topic Modeling Study of Social Media Discourse
Journal of Medical Internet Research
title Patient Voices in Dialysis Care: Sentiment Analysis and Topic Modeling Study of Social Media Discourse
title_full Patient Voices in Dialysis Care: Sentiment Analysis and Topic Modeling Study of Social Media Discourse
title_fullStr Patient Voices in Dialysis Care: Sentiment Analysis and Topic Modeling Study of Social Media Discourse
title_full_unstemmed Patient Voices in Dialysis Care: Sentiment Analysis and Topic Modeling Study of Social Media Discourse
title_short Patient Voices in Dialysis Care: Sentiment Analysis and Topic Modeling Study of Social Media Discourse
title_sort patient voices in dialysis care sentiment analysis and topic modeling study of social media discourse
url https://www.jmir.org/2025/1/e70128
work_keys_str_mv AT ravishankar patientvoicesindialysiscaresentimentanalysisandtopicmodelingstudyofsocialmediadiscourse
AT qianxu patientvoicesindialysiscaresentimentanalysisandtopicmodelingstudyofsocialmediadiscourse
AT anjalibundele patientvoicesindialysiscaresentimentanalysisandtopicmodelingstudyofsocialmediadiscourse