A scoping review of robustness concepts for machine learning in healthcare

Abstract While machine learning (ML)-based solutions—often referred to as artificial intelligence (AI) solutions—have demonstrated comparable or superior performance to human experts across various healthcare applications, their vulnerability to perturbations and stability to variations due to new e...

Full description

Saved in:
Bibliographic Details
Main Authors: Alan Balendran, Céline Beji, Florie Bouvier, Ottavio Khalifa, Theodoros Evgeniou, Philippe Ravaud, Raphaël Porcher
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:npj Digital Medicine
Online Access:https://doi.org/10.1038/s41746-024-01420-1
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract While machine learning (ML)-based solutions—often referred to as artificial intelligence (AI) solutions—have demonstrated comparable or superior performance to human experts across various healthcare applications, their vulnerability to perturbations and stability to variations due to new environments—essentially, their robustness—remains ambiguous and often overlooked. In this review, we aimed to identify the types of robustness addressed in the literature for ML models in healthcare. A total of 274 eligible records were retrieved from PubMed, Web of Science, IEEE Xplore, and additional sources. Eight general concepts of robustness emerged. Furthermore, an analysis of those concepts across types of data and types of predictive models revealed that the concepts were differently addressed. Our findings offer valuable insights for stakeholders seeking to understand and navigate the robustness of machine learning models during their development, validation, and deployment in healthcare settings, where interpretation of robustness may vary.
ISSN:2398-6352