Harnessing Big Heterogeneous Data to Evaluate the Potential Impact of HIV Responses Among Key Populations in Sub-Saharan Africa: Protocol for the Boloka Data Repository Initiative

BackgroundIn South Africa, there is no centralized HIV surveillance system where key populations (KPs) data, including gay men and other men who have sex with men, female sex workers, transgender persons, people who use drugs, and incarcerated persons, are stored in South Afr...

Full description

Saved in:
Bibliographic Details
Main Authors: Refilwe Nancy Phaswana Mafuya, Edith Phalane, Amrita Rao, Kalai Willis, Katherine Rucinski, K Alida Voet, Amal Abdulrahman, Claris Siyamayambo, Betty Sebati, Mohlago Seloka, Musa Jaiteh, Lerato Lucia Olifant, Katharine Journeay, Haley Sisel, Xiaoming Li, Bankole Olatosi, Neset Hikmet, Prashant Duhoon, Francois Wolmarans, Yegnanew A Shiferaw, Lifutso Motsieloa, Mashudu Rampilo, Stefan Baral
Format: Article
Language:English
Published: JMIR Publications 2025-01-01
Series:JMIR Research Protocols
Online Access:https://www.researchprotocols.org/2025/1/e63583
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832591176507588608
author Refilwe Nancy Phaswana Mafuya
Edith Phalane
Amrita Rao
Kalai Willis
Katherine Rucinski
K Alida Voet
Amal Abdulrahman
Claris Siyamayambo
Betty Sebati
Mohlago Seloka
Musa Jaiteh
Lerato Lucia Olifant
Katharine Journeay
Haley Sisel
Xiaoming Li
Bankole Olatosi
Neset Hikmet
Prashant Duhoon
Francois Wolmarans
Yegnanew A Shiferaw
Lifutso Motsieloa
Mashudu Rampilo
Stefan Baral
author_facet Refilwe Nancy Phaswana Mafuya
Edith Phalane
Amrita Rao
Kalai Willis
Katherine Rucinski
K Alida Voet
Amal Abdulrahman
Claris Siyamayambo
Betty Sebati
Mohlago Seloka
Musa Jaiteh
Lerato Lucia Olifant
Katharine Journeay
Haley Sisel
Xiaoming Li
Bankole Olatosi
Neset Hikmet
Prashant Duhoon
Francois Wolmarans
Yegnanew A Shiferaw
Lifutso Motsieloa
Mashudu Rampilo
Stefan Baral
author_sort Refilwe Nancy Phaswana Mafuya
collection DOAJ
description BackgroundIn South Africa, there is no centralized HIV surveillance system where key populations (KPs) data, including gay men and other men who have sex with men, female sex workers, transgender persons, people who use drugs, and incarcerated persons, are stored in South Africa despite being on higher risk of HIV acquisition and transmission than the general population. Data on KPs are being collected on a smaller scale by numerous stakeholders and managed in silos. There exists an opportunity to harness a variety of data, such as empirical, contextual, observational, and programmatic data, for evaluating the potential impact of HIV responses among KPs in South Africa. ObjectiveThis study aimed to leverage and harness big heterogeneous data on HIV among KPs and harmonize and analyze it to inform a targeted HIV response for greater impact in Sub-Saharan Africa. MethodsThe Boloka data repository initiative has 5 stages. There will be engagement of a wide range of stakeholders to facilitate the acquisition of data (stage 1). Through these engagements, different data types will be collated (stage 2). The data will be filtered and screened to enable high-quality analyses (stage 3). The collated data will be stored in the Boloka data repository (stage 4). The Boloka data repository will be made accessible to stakeholders and authorized users (stage 5). ResultsThe protocol was funded by the South African Medical Research Council following external peer reviews (December 2022). The study received initial ethics approval (May 2022), renewal (June 2023), and amendment (July 2024) from the University of Johannesburg (UJ) Research Ethics Committee. The research team has been recruited, onboarded, and received non–web-based internet ethics training (January 2023). A list of current and potential data partners has been compiled (January 2023 to date). Data sharing or user agreements have been signed with several data partners (August 2023 to date). Survey and routine data have been and are being secured (January 5, 2023). In (September 2024) we received Ghana Men Study data. The data transfer agreement between the Pan African Centre for Epidemics Research and the Perinatal HIV Research Unit was finalized (October 2024), and we are anticipating receiving data by (December 2024). In total, 7 abstracts are underway, with 1 abstract completed the analysis and expected to submit the full article to the peer-reviewed journal in early January 2024. As of March 2025, we expect to submit the remaining 6 full articles. ConclusionsA truly “complete” data infrastructure that systematically and rigorously integrates diverse data for KPs will not only improve our understanding of local epidemics but will also improve HIV interventions and policies. Furthermore, it will inform future research directions and become an incredible institutional mechanism for epidemiological and public health training in South Africa and Sub-Saharan Africa. International Registered Report Identifier (IRRID)DERR1-10.2196/63583
format Article
id doaj-art-905d10b509e14bf9af572495b15dcbeb
institution Kabale University
issn 1929-0748
language English
publishDate 2025-01-01
publisher JMIR Publications
record_format Article
series JMIR Research Protocols
spelling doaj-art-905d10b509e14bf9af572495b15dcbeb2025-01-22T21:00:59ZengJMIR PublicationsJMIR Research Protocols1929-07482025-01-0114e6358310.2196/63583Harnessing Big Heterogeneous Data to Evaluate the Potential Impact of HIV Responses Among Key Populations in Sub-Saharan Africa: Protocol for the Boloka Data Repository InitiativeRefilwe Nancy Phaswana Mafuyahttps://orcid.org/0000-0001-9387-0432Edith Phalanehttps://orcid.org/0000-0001-6128-2337Amrita Raohttps://orcid.org/0000-0002-9596-2418Kalai Willishttps://orcid.org/0000-0003-0157-0593Katherine Rucinskihttps://orcid.org/0000-0002-9858-5953K Alida Voethttps://orcid.org/0009-0003-2278-2314Amal Abdulrahmanhttps://orcid.org/0009-0000-3468-0970Claris Siyamayambohttps://orcid.org/0000-0002-4884-681XBetty Sebatihttps://orcid.org/0000-0001-9236-9443Mohlago Selokahttps://orcid.org/0000-0001-5614-0078Musa Jaitehhttps://orcid.org/0000-0001-6920-9919Lerato Lucia Olifanthttps://orcid.org/0000-0002-4564-0760Katharine Journeayhttps://orcid.org/0009-0008-7597-966XHaley Siselhttps://orcid.org/0000-0002-0660-0889Xiaoming Lihttps://orcid.org/0000-0002-5555-9034Bankole Olatosihttps://orcid.org/0000-0002-8295-8735Neset Hikmethttps://orcid.org/0000-0002-0777-3132Prashant Duhoonhttps://orcid.org/0009-0003-2970-5565Francois Wolmaranshttps://orcid.org/0009-0008-5565-3670Yegnanew A Shiferawhttps://orcid.org/0000-0002-2422-4768Lifutso Motsieloahttps://orcid.org/0000-0001-6741-9760Mashudu Rampilohttps://orcid.org/0009-0004-3570-0602Stefan Baralhttps://orcid.org/0000-0002-5482-2419 BackgroundIn South Africa, there is no centralized HIV surveillance system where key populations (KPs) data, including gay men and other men who have sex with men, female sex workers, transgender persons, people who use drugs, and incarcerated persons, are stored in South Africa despite being on higher risk of HIV acquisition and transmission than the general population. Data on KPs are being collected on a smaller scale by numerous stakeholders and managed in silos. There exists an opportunity to harness a variety of data, such as empirical, contextual, observational, and programmatic data, for evaluating the potential impact of HIV responses among KPs in South Africa. ObjectiveThis study aimed to leverage and harness big heterogeneous data on HIV among KPs and harmonize and analyze it to inform a targeted HIV response for greater impact in Sub-Saharan Africa. MethodsThe Boloka data repository initiative has 5 stages. There will be engagement of a wide range of stakeholders to facilitate the acquisition of data (stage 1). Through these engagements, different data types will be collated (stage 2). The data will be filtered and screened to enable high-quality analyses (stage 3). The collated data will be stored in the Boloka data repository (stage 4). The Boloka data repository will be made accessible to stakeholders and authorized users (stage 5). ResultsThe protocol was funded by the South African Medical Research Council following external peer reviews (December 2022). The study received initial ethics approval (May 2022), renewal (June 2023), and amendment (July 2024) from the University of Johannesburg (UJ) Research Ethics Committee. The research team has been recruited, onboarded, and received non–web-based internet ethics training (January 2023). A list of current and potential data partners has been compiled (January 2023 to date). Data sharing or user agreements have been signed with several data partners (August 2023 to date). Survey and routine data have been and are being secured (January 5, 2023). In (September 2024) we received Ghana Men Study data. The data transfer agreement between the Pan African Centre for Epidemics Research and the Perinatal HIV Research Unit was finalized (October 2024), and we are anticipating receiving data by (December 2024). In total, 7 abstracts are underway, with 1 abstract completed the analysis and expected to submit the full article to the peer-reviewed journal in early January 2024. As of March 2025, we expect to submit the remaining 6 full articles. ConclusionsA truly “complete” data infrastructure that systematically and rigorously integrates diverse data for KPs will not only improve our understanding of local epidemics but will also improve HIV interventions and policies. Furthermore, it will inform future research directions and become an incredible institutional mechanism for epidemiological and public health training in South Africa and Sub-Saharan Africa. International Registered Report Identifier (IRRID)DERR1-10.2196/63583https://www.researchprotocols.org/2025/1/e63583
spellingShingle Refilwe Nancy Phaswana Mafuya
Edith Phalane
Amrita Rao
Kalai Willis
Katherine Rucinski
K Alida Voet
Amal Abdulrahman
Claris Siyamayambo
Betty Sebati
Mohlago Seloka
Musa Jaiteh
Lerato Lucia Olifant
Katharine Journeay
Haley Sisel
Xiaoming Li
Bankole Olatosi
Neset Hikmet
Prashant Duhoon
Francois Wolmarans
Yegnanew A Shiferaw
Lifutso Motsieloa
Mashudu Rampilo
Stefan Baral
Harnessing Big Heterogeneous Data to Evaluate the Potential Impact of HIV Responses Among Key Populations in Sub-Saharan Africa: Protocol for the Boloka Data Repository Initiative
JMIR Research Protocols
title Harnessing Big Heterogeneous Data to Evaluate the Potential Impact of HIV Responses Among Key Populations in Sub-Saharan Africa: Protocol for the Boloka Data Repository Initiative
title_full Harnessing Big Heterogeneous Data to Evaluate the Potential Impact of HIV Responses Among Key Populations in Sub-Saharan Africa: Protocol for the Boloka Data Repository Initiative
title_fullStr Harnessing Big Heterogeneous Data to Evaluate the Potential Impact of HIV Responses Among Key Populations in Sub-Saharan Africa: Protocol for the Boloka Data Repository Initiative
title_full_unstemmed Harnessing Big Heterogeneous Data to Evaluate the Potential Impact of HIV Responses Among Key Populations in Sub-Saharan Africa: Protocol for the Boloka Data Repository Initiative
title_short Harnessing Big Heterogeneous Data to Evaluate the Potential Impact of HIV Responses Among Key Populations in Sub-Saharan Africa: Protocol for the Boloka Data Repository Initiative
title_sort harnessing big heterogeneous data to evaluate the potential impact of hiv responses among key populations in sub saharan africa protocol for the boloka data repository initiative
url https://www.researchprotocols.org/2025/1/e63583
work_keys_str_mv AT refilwenancyphaswanamafuya harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT edithphalane harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT amritarao harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT kalaiwillis harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT katherinerucinski harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT kalidavoet harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT amalabdulrahman harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT clarissiyamayambo harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT bettysebati harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT mohlagoseloka harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT musajaiteh harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT leratoluciaolifant harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT katharinejourneay harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT haleysisel harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT xiaomingli harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT bankoleolatosi harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT nesethikmet harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT prashantduhoon harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT francoiswolmarans harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT yegnanewashiferaw harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT lifutsomotsieloa harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT mashudurampilo harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative
AT stefanbaral harnessingbigheterogeneousdatatoevaluatethepotentialimpactofhivresponsesamongkeypopulationsinsubsaharanafricaprotocolforthebolokadatarepositoryinitiative