CoWIN twitter dataset: A comprehensive collection of public discourse on India's COVID-19 vaccination platformMendeley Data

The CoWIN Twitter Dataset offers a wide-ranging collection of public opinions on India's COVID-19 vaccination platform CoWIN. The raw dataset has 635,000 tweets that mention “cowin,” collected over the period of January to December 2021. The dataset was extracted by employing the Twitter Academ...

Full description

Saved in:
Bibliographic Details
Main Authors: Shubham Mittal, Swarnalakshmi Umamaheswaran
Format: Article
Language:English
Published: Elsevier 2025-02-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2352340924012149
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832576471761158144
author Shubham Mittal
Swarnalakshmi Umamaheswaran
author_facet Shubham Mittal
Swarnalakshmi Umamaheswaran
author_sort Shubham Mittal
collection DOAJ
description The CoWIN Twitter Dataset offers a wide-ranging collection of public opinions on India's COVID-19 vaccination platform CoWIN. The raw dataset has 635,000 tweets that mention “cowin,” collected over the period of January to December 2021. The dataset was extracted by employing the Twitter Academic API. It addition to the raw data, it also included a cleaned and processed set of 419,409 English tweets, and a labeled subset with sentiment analysis. The raw data file has tweet details like ID, text, timestamp, user ID, and language. The processed dataset is devoid of URLs and hashtags and other noise, and also adds month and category groupings. Finally,the labelled dataset gives sentiment classifications of positive or negative the relevant tweets. This dataset enables researchers to analyse themes and sentiments related to India's vaccination administration. It can help policymakers gain insights around issues related to large-scale health initiatives and digital health systems. The mix of languages in the data also makes it useful for language processing research.
format Article
id doaj-art-554547fb6da84c8091a523ccf7e4240b
institution Kabale University
issn 2352-3409
language English
publishDate 2025-02-01
publisher Elsevier
record_format Article
series Data in Brief
spelling doaj-art-554547fb6da84c8091a523ccf7e4240b2025-01-31T05:11:41ZengElsevierData in Brief2352-34092025-02-0158111252CoWIN twitter dataset: A comprehensive collection of public discourse on India's COVID-19 vaccination platformMendeley DataShubham Mittal0Swarnalakshmi Umamaheswaran1Symbiosis Institute of Business Management, Symbiosis International (Deemed University), Bengaluru 560100, India; Mckinsey, Gurugram 122018, IndiaSymbiosis Institute of Business Management, Symbiosis International (Deemed University), Bengaluru 560100, India; Mckinsey, Gurugram 122018, India; Corresponding author at: Symbiosis Institute of Business Management, Symbiosis International (Deemed University), Bengaluru 560100, India.The CoWIN Twitter Dataset offers a wide-ranging collection of public opinions on India's COVID-19 vaccination platform CoWIN. The raw dataset has 635,000 tweets that mention “cowin,” collected over the period of January to December 2021. The dataset was extracted by employing the Twitter Academic API. It addition to the raw data, it also included a cleaned and processed set of 419,409 English tweets, and a labeled subset with sentiment analysis. The raw data file has tweet details like ID, text, timestamp, user ID, and language. The processed dataset is devoid of URLs and hashtags and other noise, and also adds month and category groupings. Finally,the labelled dataset gives sentiment classifications of positive or negative the relevant tweets. This dataset enables researchers to analyse themes and sentiments related to India's vaccination administration. It can help policymakers gain insights around issues related to large-scale health initiatives and digital health systems. The mix of languages in the data also makes it useful for language processing research.http://www.sciencedirect.com/science/article/pii/S2352340924012149CoWINCOVID-19Social media analyticsDigital healthSentiment analysisHealth informatics
spellingShingle Shubham Mittal
Swarnalakshmi Umamaheswaran
CoWIN twitter dataset: A comprehensive collection of public discourse on India's COVID-19 vaccination platformMendeley Data
Data in Brief
CoWIN
COVID-19
Social media analytics
Digital health
Sentiment analysis
Health informatics
title CoWIN twitter dataset: A comprehensive collection of public discourse on India's COVID-19 vaccination platformMendeley Data
title_full CoWIN twitter dataset: A comprehensive collection of public discourse on India's COVID-19 vaccination platformMendeley Data
title_fullStr CoWIN twitter dataset: A comprehensive collection of public discourse on India's COVID-19 vaccination platformMendeley Data
title_full_unstemmed CoWIN twitter dataset: A comprehensive collection of public discourse on India's COVID-19 vaccination platformMendeley Data
title_short CoWIN twitter dataset: A comprehensive collection of public discourse on India's COVID-19 vaccination platformMendeley Data
title_sort cowin twitter dataset a comprehensive collection of public discourse on india s covid 19 vaccination platformmendeley data
topic CoWIN
COVID-19
Social media analytics
Digital health
Sentiment analysis
Health informatics
url http://www.sciencedirect.com/science/article/pii/S2352340924012149
work_keys_str_mv AT shubhammittal cowintwitterdatasetacomprehensivecollectionofpublicdiscourseonindiascovid19vaccinationplatformmendeleydata
AT swarnalakshmiumamaheswaran cowintwitterdatasetacomprehensivecollectionofpublicdiscourseonindiascovid19vaccinationplatformmendeleydata