Estimating rare disease prevalence and costs in the USA: a cohort study approach using the Healthcare Cost Institute claims data

Objective The study capitalised on national insurance claims data to gather information on patient characteristics and associated costs to better understand the diagnosis and treatment of rare diseases (RDs).Materials and methods Data from the Healthcare Cost Institute (HCCI) data enclave were analy...

Full description

Saved in:
Bibliographic Details
Main Authors: Keith A Crandall, Christine M Cutillo, Ainslie Tisdale, Mahdi Baghbanzadeh, Reva L Stidd, Manpreet S Khural, Laurie J Hartman, Jeff Greenberg, Kevin B Zhang, Ali Rahnavard
Format: Article
Language:English
Published: BMJ Publishing Group 2024-04-01
Series:BMJ Public Health
Online Access:https://bmjpublichealth.bmj.com/content/2/1/e000346.full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832583182991491072
author Keith A Crandall
Christine M Cutillo
Ainslie Tisdale
Mahdi Baghbanzadeh
Reva L Stidd
Manpreet S Khural
Laurie J Hartman
Jeff Greenberg
Kevin B Zhang
Ali Rahnavard
author_facet Keith A Crandall
Christine M Cutillo
Ainslie Tisdale
Mahdi Baghbanzadeh
Reva L Stidd
Manpreet S Khural
Laurie J Hartman
Jeff Greenberg
Kevin B Zhang
Ali Rahnavard
author_sort Keith A Crandall
collection DOAJ
description Objective The study capitalised on national insurance claims data to gather information on patient characteristics and associated costs to better understand the diagnosis and treatment of rare diseases (RDs).Materials and methods Data from the Healthcare Cost Institute (HCCI) data enclave were analysed using R statistical software and filtered by the International Classification of Diseases, 10th edition (ICD-10), current procedural terminology codes and the National Drug Code associated with 14 RDs and disease-modifying therapy options. Data were aggregated by prevalence, costs, patient characteristics and effects of treatment modification.Results The prevalence and costs of RDs in the HCCI commercial claims database varied significantly across the USA and between urban and rural areas. Pharmacy costs increased when a new treatment was initiated, while non-pharmacy costs decreased.Discussion Prevalence and cost estimations are highly variable due to the small number of patients with RDs, and the lack of a national healthcare database limits inferences for such patient populations. Accurate assessments require a diverse population, which can likely be achieved by analysing multiple databases. RDs face challenges in prevalence estimation due to a lack of specific disease coding and a small patient population, compounded by issues like data standardisation and privacy concerns. Addressing these through improved data management in healthcare systems, increased research and education will lead to better diagnosis, care management and quality of life for patients with RD.Conclusion Data on patients with RD in the HCCI database were analysed for prevalence, costs, patient characteristics and treatment modification effects. Significant heterogeneity in each of these factors was found across RDs, geography and locality (eg, urban and rural). Building capabilities to use machine learning to accelerate the diagnosis of RDs would vastly improve with changes to healthcare data, such as standardising data input, linking databases, addressing privacy issues and assigning ICD-10 codes for all RDs, resulting in more robust data for RD analytics.
format Article
id doaj-art-a8d9db16d5af415998d3185910c758e9
institution Kabale University
issn 2753-4294
language English
publishDate 2024-04-01
publisher BMJ Publishing Group
record_format Article
series BMJ Public Health
spelling doaj-art-a8d9db16d5af415998d3185910c758e92025-01-29T00:35:10ZengBMJ Publishing GroupBMJ Public Health2753-42942024-04-012110.1136/bmjph-2023-000346Estimating rare disease prevalence and costs in the USA: a cohort study approach using the Healthcare Cost Institute claims dataKeith A Crandall0Christine M Cutillo1Ainslie Tisdale2Mahdi Baghbanzadeh3Reva L Stidd4Manpreet S Khural5Laurie J Hartman6Jeff Greenberg7Kevin B Zhang8Ali Rahnavard9Department of Biostatistics and Bioinformatics, The George Washington University, Washington, District of Columbia, USANational Center for Advancing Translational Sciences Office of Rare Diseases Research, Bethesda, Maryland, USANational Center for Advancing Translational Sciences Office of Rare Diseases Research, Bethesda, Maryland, USADepartment of Biostatistics and Bioinformatics, The George Washington University, Washington, District of Columbia, USACustomer Value Partners, Fairfax, Virginia, USACustomer Value Partners, Fairfax, Virginia, USACustomer Value Partners, Fairfax, Virginia, USACustomer Value Partners, Fairfax, Virginia, USACustomer Value Partners, Fairfax, Virginia, USADepartment of Biostatistics and Bioinformatics, The George Washington University, Washington, District of Columbia, USAObjective The study capitalised on national insurance claims data to gather information on patient characteristics and associated costs to better understand the diagnosis and treatment of rare diseases (RDs).Materials and methods Data from the Healthcare Cost Institute (HCCI) data enclave were analysed using R statistical software and filtered by the International Classification of Diseases, 10th edition (ICD-10), current procedural terminology codes and the National Drug Code associated with 14 RDs and disease-modifying therapy options. Data were aggregated by prevalence, costs, patient characteristics and effects of treatment modification.Results The prevalence and costs of RDs in the HCCI commercial claims database varied significantly across the USA and between urban and rural areas. Pharmacy costs increased when a new treatment was initiated, while non-pharmacy costs decreased.Discussion Prevalence and cost estimations are highly variable due to the small number of patients with RDs, and the lack of a national healthcare database limits inferences for such patient populations. Accurate assessments require a diverse population, which can likely be achieved by analysing multiple databases. RDs face challenges in prevalence estimation due to a lack of specific disease coding and a small patient population, compounded by issues like data standardisation and privacy concerns. Addressing these through improved data management in healthcare systems, increased research and education will lead to better diagnosis, care management and quality of life for patients with RD.Conclusion Data on patients with RD in the HCCI database were analysed for prevalence, costs, patient characteristics and treatment modification effects. Significant heterogeneity in each of these factors was found across RDs, geography and locality (eg, urban and rural). Building capabilities to use machine learning to accelerate the diagnosis of RDs would vastly improve with changes to healthcare data, such as standardising data input, linking databases, addressing privacy issues and assigning ICD-10 codes for all RDs, resulting in more robust data for RD analytics.https://bmjpublichealth.bmj.com/content/2/1/e000346.full
spellingShingle Keith A Crandall
Christine M Cutillo
Ainslie Tisdale
Mahdi Baghbanzadeh
Reva L Stidd
Manpreet S Khural
Laurie J Hartman
Jeff Greenberg
Kevin B Zhang
Ali Rahnavard
Estimating rare disease prevalence and costs in the USA: a cohort study approach using the Healthcare Cost Institute claims data
BMJ Public Health
title Estimating rare disease prevalence and costs in the USA: a cohort study approach using the Healthcare Cost Institute claims data
title_full Estimating rare disease prevalence and costs in the USA: a cohort study approach using the Healthcare Cost Institute claims data
title_fullStr Estimating rare disease prevalence and costs in the USA: a cohort study approach using the Healthcare Cost Institute claims data
title_full_unstemmed Estimating rare disease prevalence and costs in the USA: a cohort study approach using the Healthcare Cost Institute claims data
title_short Estimating rare disease prevalence and costs in the USA: a cohort study approach using the Healthcare Cost Institute claims data
title_sort estimating rare disease prevalence and costs in the usa a cohort study approach using the healthcare cost institute claims data
url https://bmjpublichealth.bmj.com/content/2/1/e000346.full
work_keys_str_mv AT keithacrandall estimatingrarediseaseprevalenceandcostsintheusaacohortstudyapproachusingthehealthcarecostinstituteclaimsdata
AT christinemcutillo estimatingrarediseaseprevalenceandcostsintheusaacohortstudyapproachusingthehealthcarecostinstituteclaimsdata
AT ainslietisdale estimatingrarediseaseprevalenceandcostsintheusaacohortstudyapproachusingthehealthcarecostinstituteclaimsdata
AT mahdibaghbanzadeh estimatingrarediseaseprevalenceandcostsintheusaacohortstudyapproachusingthehealthcarecostinstituteclaimsdata
AT revalstidd estimatingrarediseaseprevalenceandcostsintheusaacohortstudyapproachusingthehealthcarecostinstituteclaimsdata
AT manpreetskhural estimatingrarediseaseprevalenceandcostsintheusaacohortstudyapproachusingthehealthcarecostinstituteclaimsdata
AT lauriejhartman estimatingrarediseaseprevalenceandcostsintheusaacohortstudyapproachusingthehealthcarecostinstituteclaimsdata
AT jeffgreenberg estimatingrarediseaseprevalenceandcostsintheusaacohortstudyapproachusingthehealthcarecostinstituteclaimsdata
AT kevinbzhang estimatingrarediseaseprevalenceandcostsintheusaacohortstudyapproachusingthehealthcarecostinstituteclaimsdata
AT alirahnavard estimatingrarediseaseprevalenceandcostsintheusaacohortstudyapproachusingthehealthcarecostinstituteclaimsdata