Rapid identification and phenotyping of nonalcoholic fatty liver disease patients using a machine‐based approach in diverse healthcare systems

Abstract Nonalcoholic fatty liver disease (NAFLD) is the most common global cause of chronic liver disease and remains under‐recognized within healthcare systems. Therapeutic interventions are rapidly advancing for its inflammatory phenotype, nonalcoholic steatohepatitis (NASH) at all stages of dise...

Full description

Saved in:
Bibliographic Details
Main Authors: Anna O. Basile, Anurag Verma, Leigh Anne Tang, Marina Serper, Andrew Scanga, Ava Farrell, Brittney Destin, Rotonya M. Carr, Anuli Anyanwu‐Ofili, Gunaretnam Rajagopal, Abraham Krikhely, Marc Bessler, Muredach P. Reilly, Marylyn D. Ritchie, Nicholas P. Tatonetti, Julia Wattacheril
Format: Article
Language:English
Published: Wiley 2025-01-01
Series:Clinical and Translational Science
Online Access:https://doi.org/10.1111/cts.70105
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832589801510928384
author Anna O. Basile
Anurag Verma
Leigh Anne Tang
Marina Serper
Andrew Scanga
Ava Farrell
Brittney Destin
Rotonya M. Carr
Anuli Anyanwu‐Ofili
Gunaretnam Rajagopal
Abraham Krikhely
Marc Bessler
Muredach P. Reilly
Marylyn D. Ritchie
Nicholas P. Tatonetti
Julia Wattacheril
author_facet Anna O. Basile
Anurag Verma
Leigh Anne Tang
Marina Serper
Andrew Scanga
Ava Farrell
Brittney Destin
Rotonya M. Carr
Anuli Anyanwu‐Ofili
Gunaretnam Rajagopal
Abraham Krikhely
Marc Bessler
Muredach P. Reilly
Marylyn D. Ritchie
Nicholas P. Tatonetti
Julia Wattacheril
author_sort Anna O. Basile
collection DOAJ
description Abstract Nonalcoholic fatty liver disease (NAFLD) is the most common global cause of chronic liver disease and remains under‐recognized within healthcare systems. Therapeutic interventions are rapidly advancing for its inflammatory phenotype, nonalcoholic steatohepatitis (NASH) at all stages of disease. Diagnosis codes alone fail to recognize and stratify at‐risk patients accurately. Our work aims to rapidly identify NAFLD patients within large electronic health record (EHR) databases for automated stratification and targeted intervention based on clinically relevant phenotypes. We present a rule‐based phenotyping algorithm for efficient identification of NAFLD patients developed using EHRs from 6.4 million patients at Columbia University Irving Medical Center (CUIMC) and validated at two independent healthcare centers. The algorithm uses the Observational Medical Outcomes Partnership (OMOP) Common Data Model and queries structured and unstructured data elements, including diagnosis codes, laboratory measurements, and radiology and pathology modalities. Our approach identified 16,006 CUIMC NAFLD patients, 10,753 (67%) previously unidentifiable by NAFLD diagnosis codes. Fibrosis scoring on patients without histology identified 943 subjects with scores indicative of advanced fibrosis (FIB‐4, APRI, NAFLD–FS). The algorithm was validated at two independent healthcare systems, University of Pennsylvania Health System (UPHS) and Vanderbilt Medical Center (VUMC), where 20,779 and 19,575 NAFLD patients were identified, respectively. Clinical chart review identified a high positive predictive value (PPV) across all healthcare systems: 91% at CUIMC, 75% at UPHS, and 85% at VUMC, and a sensitivity of 79.6%. Our rule‐based algorithm provides an accurate, automated approach for rapidly identifying, stratifying, and sub‐phenotyping NAFLD patients within a large EHR system.
format Article
id doaj-art-db47e59a8ad34622bbc94a05869d9747
institution Kabale University
issn 1752-8054
1752-8062
language English
publishDate 2025-01-01
publisher Wiley
record_format Article
series Clinical and Translational Science
spelling doaj-art-db47e59a8ad34622bbc94a05869d97472025-01-24T08:17:46ZengWileyClinical and Translational Science1752-80541752-80622025-01-01181n/an/a10.1111/cts.70105Rapid identification and phenotyping of nonalcoholic fatty liver disease patients using a machine‐based approach in diverse healthcare systemsAnna O. Basile0Anurag Verma1Leigh Anne Tang2Marina Serper3Andrew Scanga4Ava Farrell5Brittney Destin6Rotonya M. Carr7Anuli Anyanwu‐Ofili8Gunaretnam Rajagopal9Abraham Krikhely10Marc Bessler11Muredach P. Reilly12Marylyn D. Ritchie13Nicholas P. Tatonetti14Julia Wattacheril15Department of Biomedical Informatics Columbia University New York New York USADivision of Translational Medicine and Human Genetics, Department of Medicine, Institute for Biomedical Informatics University of Pennsylvania Perelman School of Medicine Philadelphia Pennsylvania USADepartment of Biomedical Informatics Vanderbilt University Medical Center Nashville Tennessee USADivision of Gastroenterology and Hepatology University of Pennsylvania Perelman School of Medicine Philadelphia Pennsylvania USADepartment of Medicine Vanderbilt University Medical Center Nashville Tennessee USADivision of Critical Care, Department of Pediatrics New York Presbyterian Morgan Stanley Children's Hospital of New York New York New York USADivision of Metabolic and Bariatric Surgery, Department of Surgery Columbia University Irving Medical Center New York New York USADivision of Gastroenterology, Department of Medicine University of Washington Seattle Washington USAJohnson & Johnson Innovative Medicine Spring House Pennsylvania USAJohnson & Johnson Innovative Medicine Spring House Pennsylvania USADivision of Metabolic and Bariatric Surgery, Department of Surgery Columbia University Irving Medical Center New York New York USADivision of Metabolic and Bariatric Surgery, Department of Surgery Columbia University Irving Medical Center New York New York USAIrving Institute for Clinical and Translational Research Columbia University New York New York USADepartment of Genetics, Institute for Biomedical Informatics University of Pennsylvania Perelman School of Medicine Philadelphia Pennsylvania USADepartment of Biomedical Informatics Columbia University New York New York USADivision of Digestive and Liver Diseases, Department of Medicine, Center for Liver Disease and Transplantation Columbia University Irving Medical Center New York New York USAAbstract Nonalcoholic fatty liver disease (NAFLD) is the most common global cause of chronic liver disease and remains under‐recognized within healthcare systems. Therapeutic interventions are rapidly advancing for its inflammatory phenotype, nonalcoholic steatohepatitis (NASH) at all stages of disease. Diagnosis codes alone fail to recognize and stratify at‐risk patients accurately. Our work aims to rapidly identify NAFLD patients within large electronic health record (EHR) databases for automated stratification and targeted intervention based on clinically relevant phenotypes. We present a rule‐based phenotyping algorithm for efficient identification of NAFLD patients developed using EHRs from 6.4 million patients at Columbia University Irving Medical Center (CUIMC) and validated at two independent healthcare centers. The algorithm uses the Observational Medical Outcomes Partnership (OMOP) Common Data Model and queries structured and unstructured data elements, including diagnosis codes, laboratory measurements, and radiology and pathology modalities. Our approach identified 16,006 CUIMC NAFLD patients, 10,753 (67%) previously unidentifiable by NAFLD diagnosis codes. Fibrosis scoring on patients without histology identified 943 subjects with scores indicative of advanced fibrosis (FIB‐4, APRI, NAFLD–FS). The algorithm was validated at two independent healthcare systems, University of Pennsylvania Health System (UPHS) and Vanderbilt Medical Center (VUMC), where 20,779 and 19,575 NAFLD patients were identified, respectively. Clinical chart review identified a high positive predictive value (PPV) across all healthcare systems: 91% at CUIMC, 75% at UPHS, and 85% at VUMC, and a sensitivity of 79.6%. Our rule‐based algorithm provides an accurate, automated approach for rapidly identifying, stratifying, and sub‐phenotyping NAFLD patients within a large EHR system.https://doi.org/10.1111/cts.70105
spellingShingle Anna O. Basile
Anurag Verma
Leigh Anne Tang
Marina Serper
Andrew Scanga
Ava Farrell
Brittney Destin
Rotonya M. Carr
Anuli Anyanwu‐Ofili
Gunaretnam Rajagopal
Abraham Krikhely
Marc Bessler
Muredach P. Reilly
Marylyn D. Ritchie
Nicholas P. Tatonetti
Julia Wattacheril
Rapid identification and phenotyping of nonalcoholic fatty liver disease patients using a machine‐based approach in diverse healthcare systems
Clinical and Translational Science
title Rapid identification and phenotyping of nonalcoholic fatty liver disease patients using a machine‐based approach in diverse healthcare systems
title_full Rapid identification and phenotyping of nonalcoholic fatty liver disease patients using a machine‐based approach in diverse healthcare systems
title_fullStr Rapid identification and phenotyping of nonalcoholic fatty liver disease patients using a machine‐based approach in diverse healthcare systems
title_full_unstemmed Rapid identification and phenotyping of nonalcoholic fatty liver disease patients using a machine‐based approach in diverse healthcare systems
title_short Rapid identification and phenotyping of nonalcoholic fatty liver disease patients using a machine‐based approach in diverse healthcare systems
title_sort rapid identification and phenotyping of nonalcoholic fatty liver disease patients using a machine based approach in diverse healthcare systems
url https://doi.org/10.1111/cts.70105
work_keys_str_mv AT annaobasile rapididentificationandphenotypingofnonalcoholicfattyliverdiseasepatientsusingamachinebasedapproachindiversehealthcaresystems
AT anuragverma rapididentificationandphenotypingofnonalcoholicfattyliverdiseasepatientsusingamachinebasedapproachindiversehealthcaresystems
AT leighannetang rapididentificationandphenotypingofnonalcoholicfattyliverdiseasepatientsusingamachinebasedapproachindiversehealthcaresystems
AT marinaserper rapididentificationandphenotypingofnonalcoholicfattyliverdiseasepatientsusingamachinebasedapproachindiversehealthcaresystems
AT andrewscanga rapididentificationandphenotypingofnonalcoholicfattyliverdiseasepatientsusingamachinebasedapproachindiversehealthcaresystems
AT avafarrell rapididentificationandphenotypingofnonalcoholicfattyliverdiseasepatientsusingamachinebasedapproachindiversehealthcaresystems
AT brittneydestin rapididentificationandphenotypingofnonalcoholicfattyliverdiseasepatientsusingamachinebasedapproachindiversehealthcaresystems
AT rotonyamcarr rapididentificationandphenotypingofnonalcoholicfattyliverdiseasepatientsusingamachinebasedapproachindiversehealthcaresystems
AT anulianyanwuofili rapididentificationandphenotypingofnonalcoholicfattyliverdiseasepatientsusingamachinebasedapproachindiversehealthcaresystems
AT gunaretnamrajagopal rapididentificationandphenotypingofnonalcoholicfattyliverdiseasepatientsusingamachinebasedapproachindiversehealthcaresystems
AT abrahamkrikhely rapididentificationandphenotypingofnonalcoholicfattyliverdiseasepatientsusingamachinebasedapproachindiversehealthcaresystems
AT marcbessler rapididentificationandphenotypingofnonalcoholicfattyliverdiseasepatientsusingamachinebasedapproachindiversehealthcaresystems
AT muredachpreilly rapididentificationandphenotypingofnonalcoholicfattyliverdiseasepatientsusingamachinebasedapproachindiversehealthcaresystems
AT marylyndritchie rapididentificationandphenotypingofnonalcoholicfattyliverdiseasepatientsusingamachinebasedapproachindiversehealthcaresystems
AT nicholasptatonetti rapididentificationandphenotypingofnonalcoholicfattyliverdiseasepatientsusingamachinebasedapproachindiversehealthcaresystems
AT juliawattacheril rapididentificationandphenotypingofnonalcoholicfattyliverdiseasepatientsusingamachinebasedapproachindiversehealthcaresystems