A phenotype-based AI pipeline outperforms human experts in differentially diagnosing rare diseases using EHRs

Abstract Rare diseases, affecting ~350 million people worldwide, pose significant challenges in clinical diagnosis due to the lack of experienced physicians and the complexity of differentiating between numerous rare diseases. To address these challenges, we introduce PhenoBrain, a fully automated a...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiaohao Mao, Yu Huang, Ye Jin, Lun Wang, Xuanzhong Chen, Honghong Liu, Xinglin Yang, Haopeng Xu, Xiaodong Luan, Ying Xiao, Siqin Feng, Jiahao Zhu, Xuegong Zhang, Rui Jiang, Shuyang Zhang, Ting Chen
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:npj Digital Medicine
Online Access:https://doi.org/10.1038/s41746-025-01452-1
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832571316160430080
author Xiaohao Mao
Yu Huang
Ye Jin
Lun Wang
Xuanzhong Chen
Honghong Liu
Xinglin Yang
Haopeng Xu
Xiaodong Luan
Ying Xiao
Siqin Feng
Jiahao Zhu
Xuegong Zhang
Rui Jiang
Shuyang Zhang
Ting Chen
author_facet Xiaohao Mao
Yu Huang
Ye Jin
Lun Wang
Xuanzhong Chen
Honghong Liu
Xinglin Yang
Haopeng Xu
Xiaodong Luan
Ying Xiao
Siqin Feng
Jiahao Zhu
Xuegong Zhang
Rui Jiang
Shuyang Zhang
Ting Chen
author_sort Xiaohao Mao
collection DOAJ
description Abstract Rare diseases, affecting ~350 million people worldwide, pose significant challenges in clinical diagnosis due to the lack of experienced physicians and the complexity of differentiating between numerous rare diseases. To address these challenges, we introduce PhenoBrain, a fully automated artificial intelligence pipeline. PhenoBrain utilizes a BERT-based natural language processing model to extract phenotypes from clinical texts in EHRs and employs five new diagnostic models for differential diagnoses of rare diseases. The AI system was developed and evaluated on diverse, multi-country rare disease datasets, comprising 2271 cases with 431 rare diseases. In 1936 test cases, PhenoBrain achieved an average predicted top-3 recall of 0.513 and a top-10 recall of 0.654, surpassing 13 leading prediction methods. In a human-computer study with 75 cases, PhenoBrain exhibited exceptional performance with a top-3 recall of 0.613 and a top-10 recall of 0.813, surpassing the performance of 50 specialist physicians and large language models like ChatGPT and GPT-4. Combining PhenoBrain’s predictions with specialists increased the top-3 recall to 0.768, demonstrating its potential to enhance diagnostic accuracy in clinical workflows.
format Article
id doaj-art-928ab9111ce24cde9c384c5d71a14586
institution Kabale University
issn 2398-6352
language English
publishDate 2025-01-01
publisher Nature Portfolio
record_format Article
series npj Digital Medicine
spelling doaj-art-928ab9111ce24cde9c384c5d71a145862025-02-02T12:43:41ZengNature Portfolionpj Digital Medicine2398-63522025-01-018111510.1038/s41746-025-01452-1A phenotype-based AI pipeline outperforms human experts in differentially diagnosing rare diseases using EHRsXiaohao Mao0Yu Huang1Ye Jin2Lun Wang3Xuanzhong Chen4Honghong Liu5Xinglin Yang6Haopeng Xu7Xiaodong Luan8Ying Xiao9Siqin Feng10Jiahao Zhu11Xuegong Zhang12Rui Jiang13Shuyang Zhang14Ting Chen15Department of Computer Science and Technology & Institute for Artificial Intelligence & BNRist, Tsinghua UniversityDepartment of Computer Science and Technology & Institute for Artificial Intelligence & BNRist, Tsinghua UniversityMedical Research Center, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical CollegeDepartment of Internal Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical CollegeDepartment of Computer Science and Technology & Institute for Artificial Intelligence & BNRist, Tsinghua UniversityDepartment of Internal Medicine, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical CollegeDepartment of Cardiology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical CollegeNuffield Department of Medicine, University of OxfordState Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical CollegeDepartment of Geriatrics, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical CollegeDepartment of Cardiology, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical CollegeDepartment of Computer Science and Technology & Institute for Artificial Intelligence & BNRist, Tsinghua UniversityDepartment of Automation & BNRist, Tsinghua UniversityDepartment of Automation & BNRist, Tsinghua UniversityDepartment of Cardiology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical CollegeDepartment of Computer Science and Technology & Institute for Artificial Intelligence & BNRist, Tsinghua UniversityAbstract Rare diseases, affecting ~350 million people worldwide, pose significant challenges in clinical diagnosis due to the lack of experienced physicians and the complexity of differentiating between numerous rare diseases. To address these challenges, we introduce PhenoBrain, a fully automated artificial intelligence pipeline. PhenoBrain utilizes a BERT-based natural language processing model to extract phenotypes from clinical texts in EHRs and employs five new diagnostic models for differential diagnoses of rare diseases. The AI system was developed and evaluated on diverse, multi-country rare disease datasets, comprising 2271 cases with 431 rare diseases. In 1936 test cases, PhenoBrain achieved an average predicted top-3 recall of 0.513 and a top-10 recall of 0.654, surpassing 13 leading prediction methods. In a human-computer study with 75 cases, PhenoBrain exhibited exceptional performance with a top-3 recall of 0.613 and a top-10 recall of 0.813, surpassing the performance of 50 specialist physicians and large language models like ChatGPT and GPT-4. Combining PhenoBrain’s predictions with specialists increased the top-3 recall to 0.768, demonstrating its potential to enhance diagnostic accuracy in clinical workflows.https://doi.org/10.1038/s41746-025-01452-1
spellingShingle Xiaohao Mao
Yu Huang
Ye Jin
Lun Wang
Xuanzhong Chen
Honghong Liu
Xinglin Yang
Haopeng Xu
Xiaodong Luan
Ying Xiao
Siqin Feng
Jiahao Zhu
Xuegong Zhang
Rui Jiang
Shuyang Zhang
Ting Chen
A phenotype-based AI pipeline outperforms human experts in differentially diagnosing rare diseases using EHRs
npj Digital Medicine
title A phenotype-based AI pipeline outperforms human experts in differentially diagnosing rare diseases using EHRs
title_full A phenotype-based AI pipeline outperforms human experts in differentially diagnosing rare diseases using EHRs
title_fullStr A phenotype-based AI pipeline outperforms human experts in differentially diagnosing rare diseases using EHRs
title_full_unstemmed A phenotype-based AI pipeline outperforms human experts in differentially diagnosing rare diseases using EHRs
title_short A phenotype-based AI pipeline outperforms human experts in differentially diagnosing rare diseases using EHRs
title_sort phenotype based ai pipeline outperforms human experts in differentially diagnosing rare diseases using ehrs
url https://doi.org/10.1038/s41746-025-01452-1
work_keys_str_mv AT xiaohaomao aphenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT yuhuang aphenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT yejin aphenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT lunwang aphenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT xuanzhongchen aphenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT honghongliu aphenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT xinglinyang aphenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT haopengxu aphenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT xiaodongluan aphenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT yingxiao aphenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT siqinfeng aphenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT jiahaozhu aphenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT xuegongzhang aphenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT ruijiang aphenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT shuyangzhang aphenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT tingchen aphenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT xiaohaomao phenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT yuhuang phenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT yejin phenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT lunwang phenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT xuanzhongchen phenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT honghongliu phenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT xinglinyang phenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT haopengxu phenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT xiaodongluan phenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT yingxiao phenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT siqinfeng phenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT jiahaozhu phenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT xuegongzhang phenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT ruijiang phenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT shuyangzhang phenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs
AT tingchen phenotypebasedaipipelineoutperformshumanexpertsindifferentiallydiagnosingrarediseasesusingehrs