Comparative analysis of ChatGPT and Gemini (Bard) in medical inquiry: a scoping review

IntroductionArtificial intelligence and machine learning are popular interconnected technologies. AI chatbots like ChatGPT and Gemini show considerable promise in medical inquiries. This scoping review aims to assess the accuracy and response length (in characters) of ChatGPT and Gemini in medical a...

Full description

Saved in:
Bibliographic Details
Main Authors: Fattah H. Fattah, Abdulwahid M. Salih, Ameer M. Salih, Saywan K. Asaad, Abdullah K. Ghafour, Rawa Bapir, Berun A. Abdalla, Snur Othman, Sasan M. Ahmed, Sabah Jalal Hasan, Yousif M. Mahmood, Fahmi H. Kakamad
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-02-01
Series:Frontiers in Digital Health
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fdgth.2025.1482712/full
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:IntroductionArtificial intelligence and machine learning are popular interconnected technologies. AI chatbots like ChatGPT and Gemini show considerable promise in medical inquiries. This scoping review aims to assess the accuracy and response length (in characters) of ChatGPT and Gemini in medical applications.MethodsThe eligible databases were searched to find studies published in English from January 1 to October 20, 2023. The inclusion criteria consisted of studies that focused on using AI in medicine and assessed outcomes based on the accuracy and character count (length) of ChatGPT and Gemini. Data collected from the studies included the first author's name, the country where the study was conducted, the type of study design, publication year, sample size, medical speciality, and the accuracy and response length.ResultsThe initial search identified 64 papers, with 11 meeting the inclusion criteria, involving 1,177 samples. ChatGPT showed higher accuracy in radiology (87.43% vs. Gemini's 71%) and shorter responses (907 vs. 1,428 characters). Similar trends were noted in other specialties. However, Gemini outperformed ChatGPT in emergency scenarios (87% vs. 77%) and in renal diets with low potassium and high phosphorus (79% vs. 60% and 100% vs. 77%). Statistical analysis confirms that ChatGPT has greater accuracy and shorter responses than Gemini in medical studies, with a p-value of <.001 for both metrics.ConclusionThis Scoping review suggests that ChatGPT may demonstrate higher accuracy and provide shorter responses than Gemini in medical studies.
ISSN:2673-253X