Clinicians must participate in the development of multimodal AI

Summary: Multimodal artificial intelligence (AI) is a powerful new technological advance, capable of simultaneously learning from diverse data types, such as text, images, video, and audio. Because clinical decisions are usually based on information from multiple sources, multimodal AI has the poten...

Full description

Saved in:
Bibliographic Details
Main Authors: Christopher R.S. Banerji, Aroon Bhardwaj Shah, Ben Dabson, Tapabrata Chakraborti, Vicky Hellon, Chris Harbron, Ben D. MacArthur
Format: Article
Language:English
Published: Elsevier 2025-06-01
Series:EClinicalMedicine
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2589537025001841
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Summary: Multimodal artificial intelligence (AI) is a powerful new technological advance, capable of simultaneously learning from diverse data types, such as text, images, video, and audio. Because clinical decisions are usually based on information from multiple sources, multimodal AI has the potential to significantly improve clinical practice. However, unlike most developed multimodal AI workflows, clinical medicine is both a dynamic and interventional process in which the clinician continually learns about the patient's health and acts accordingly as data is collected. In this article we argue that multimodal clinical AI must be fully attuned to the particular challenges and constraints of the clinic, and clinician involvement is needed throughout development—not just at clinical deployment. We propose ways that clinician involvement can add value at each stage of the multimodal AI development pipeline, and argue for the establishment of actively managed multidisciplinary communities to work collaboratively towards the shared goal of improving the health of all.
ISSN:2589-5370