CoWIN twitter dataset: A comprehensive collection of public discourse on India's COVID-19 vaccination platformMendeley Data
The CoWIN Twitter Dataset offers a wide-ranging collection of public opinions on India's COVID-19 vaccination platform CoWIN. The raw dataset has 635,000 tweets that mention “cowin,” collected over the period of January to December 2021. The dataset was extracted by employing the Twitter Academ...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2025-02-01
|
Series: | Data in Brief |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2352340924012149 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832576471761158144 |
---|---|
author | Shubham Mittal Swarnalakshmi Umamaheswaran |
author_facet | Shubham Mittal Swarnalakshmi Umamaheswaran |
author_sort | Shubham Mittal |
collection | DOAJ |
description | The CoWIN Twitter Dataset offers a wide-ranging collection of public opinions on India's COVID-19 vaccination platform CoWIN. The raw dataset has 635,000 tweets that mention “cowin,” collected over the period of January to December 2021. The dataset was extracted by employing the Twitter Academic API. It addition to the raw data, it also included a cleaned and processed set of 419,409 English tweets, and a labeled subset with sentiment analysis. The raw data file has tweet details like ID, text, timestamp, user ID, and language. The processed dataset is devoid of URLs and hashtags and other noise, and also adds month and category groupings. Finally,the labelled dataset gives sentiment classifications of positive or negative the relevant tweets. This dataset enables researchers to analyse themes and sentiments related to India's vaccination administration. It can help policymakers gain insights around issues related to large-scale health initiatives and digital health systems. The mix of languages in the data also makes it useful for language processing research. |
format | Article |
id | doaj-art-554547fb6da84c8091a523ccf7e4240b |
institution | Kabale University |
issn | 2352-3409 |
language | English |
publishDate | 2025-02-01 |
publisher | Elsevier |
record_format | Article |
series | Data in Brief |
spelling | doaj-art-554547fb6da84c8091a523ccf7e4240b2025-01-31T05:11:41ZengElsevierData in Brief2352-34092025-02-0158111252CoWIN twitter dataset: A comprehensive collection of public discourse on India's COVID-19 vaccination platformMendeley DataShubham Mittal0Swarnalakshmi Umamaheswaran1Symbiosis Institute of Business Management, Symbiosis International (Deemed University), Bengaluru 560100, India; Mckinsey, Gurugram 122018, IndiaSymbiosis Institute of Business Management, Symbiosis International (Deemed University), Bengaluru 560100, India; Mckinsey, Gurugram 122018, India; Corresponding author at: Symbiosis Institute of Business Management, Symbiosis International (Deemed University), Bengaluru 560100, India.The CoWIN Twitter Dataset offers a wide-ranging collection of public opinions on India's COVID-19 vaccination platform CoWIN. The raw dataset has 635,000 tweets that mention “cowin,” collected over the period of January to December 2021. The dataset was extracted by employing the Twitter Academic API. It addition to the raw data, it also included a cleaned and processed set of 419,409 English tweets, and a labeled subset with sentiment analysis. The raw data file has tweet details like ID, text, timestamp, user ID, and language. The processed dataset is devoid of URLs and hashtags and other noise, and also adds month and category groupings. Finally,the labelled dataset gives sentiment classifications of positive or negative the relevant tweets. This dataset enables researchers to analyse themes and sentiments related to India's vaccination administration. It can help policymakers gain insights around issues related to large-scale health initiatives and digital health systems. The mix of languages in the data also makes it useful for language processing research.http://www.sciencedirect.com/science/article/pii/S2352340924012149CoWINCOVID-19Social media analyticsDigital healthSentiment analysisHealth informatics |
spellingShingle | Shubham Mittal Swarnalakshmi Umamaheswaran CoWIN twitter dataset: A comprehensive collection of public discourse on India's COVID-19 vaccination platformMendeley Data Data in Brief CoWIN COVID-19 Social media analytics Digital health Sentiment analysis Health informatics |
title | CoWIN twitter dataset: A comprehensive collection of public discourse on India's COVID-19 vaccination platformMendeley Data |
title_full | CoWIN twitter dataset: A comprehensive collection of public discourse on India's COVID-19 vaccination platformMendeley Data |
title_fullStr | CoWIN twitter dataset: A comprehensive collection of public discourse on India's COVID-19 vaccination platformMendeley Data |
title_full_unstemmed | CoWIN twitter dataset: A comprehensive collection of public discourse on India's COVID-19 vaccination platformMendeley Data |
title_short | CoWIN twitter dataset: A comprehensive collection of public discourse on India's COVID-19 vaccination platformMendeley Data |
title_sort | cowin twitter dataset a comprehensive collection of public discourse on india s covid 19 vaccination platformmendeley data |
topic | CoWIN COVID-19 Social media analytics Digital health Sentiment analysis Health informatics |
url | http://www.sciencedirect.com/science/article/pii/S2352340924012149 |
work_keys_str_mv | AT shubhammittal cowintwitterdatasetacomprehensivecollectionofpublicdiscourseonindiascovid19vaccinationplatformmendeleydata AT swarnalakshmiumamaheswaran cowintwitterdatasetacomprehensivecollectionofpublicdiscourseonindiascovid19vaccinationplatformmendeleydata |