Five advanced chatbots solving European Diploma in Radiology (EDiR) text-based questions: differences in performance and consistency

Abstract Background We compared the performance, confidence, and response consistency of five chatbots powered by large language models in solving European Diploma in Radiology (EDiR) text-based multiple-response questions. Methods ChatGPT-4o, ChatGPT-4o-mini, Copilot, Gemini, and Claude 3.5 Sonnet...

Full description

Saved in:
Bibliographic Details
Main Authors: Jakub Pristoupil, Laura Oleaga, Vanesa Junquero, Cristina Merino, Suha Sureyya Ozbek, Lukas Lambert, European Society of Radiology (ESR)
Format: Article
Language:English
Published: SpringerOpen 2025-08-01
Series:European Radiology Experimental
Subjects:
Online Access:https://doi.org/10.1186/s41747-025-00591-0
Tags: Add Tag
No Tags, Be the first to tag this record!