Evaluating the performance of GPT-3.5, GPT-4, and GPT-4o in the Chinese National Medical Licensing Examination

Evaluating the performance of GPT-3.5, GPT-4, and GPT-4o in the Chinese National Medical Licensing Examination

Abstract This study aims to compare and evaluate the performance of GPT-3.5, GPT-4, and GPT-4o in the 2020 and 2021 Chinese National Medical Licensing Examination (NMLE), exploring their potential value in medical education and clinical applications. Six hundred original test questions from the 2020...

Full description

Saved in:

Bibliographic Details
Main Authors:	Dingyuan Luo, Mengke Liu, Runyuan Yu, Yulian Liu, Wenjun Jiang, Qi Fan, Naifeng Kuang, Qiang Gao, Tao Yin, Zuncheng Zheng
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-04-01
Series:	Scientific Reports
Subjects:	ChatGPT Large Language models Artificial intelligence Medical licensing examination Medical education
Online Access:	https://doi.org/10.1038/s41598-025-98949-2
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Performance of ChatGPT-3.5 and GPT-4 in national licensing examinations for medicine, pharmacy, dentistry, and nursing: a systematic review and meta-analysis
by: Hye Kyung Jin, et al.
Published: (2024-09-01)

The performance of ChatGPT on medical image-based assessments and implications for medical education
by: Xiang Yang, et al.
Published: (2025-08-01)

The performance of AI in medical examinations: an exploration of ChatGPT in ultrasound medical education
by: Dao-Rong Hong, et al.
Published: (2024-11-01)

Revolutionizing research training: ChatGPT as a catalyst in medical education
by: Peng Zhang, et al.
Published: (2025-05-01)

Analysis of ChatGPT-3.5’s Potential in Generating NBME-Standard Pharmacology Questions: What Can Be Improved?
by: Marwa Saad, et al.
Published: (2024-10-01)

Comparative analysis of ChatGPT 3.5 and ChatGPT 4 obstetric and gynecological knowledge
by: Franciszek Ługowski, et al.
Published: (2025-07-01)

ChatGPT-4 versus human generated multiple choice questions - A study from a medical college in Pakistan
by: Muhammad Ahsan Naseer, et al.
Published: (2024-12-01)

Navigating the integration of ChatGPT in UAE’s government sector: challenges and opportunities
by: Ghada Nabil Goher
Published: (2025-01-01)

Image Recognition Performance of GPT-4V(ision) and GPT-4o in Ophthalmology: Use of Images in Clinical Questions
by: Tomita K, et al.
Published: (2025-05-01)

Status and perceptions of ChatGPT utilization among medical students: a survey-based study
by: Na Hu, et al.
Published: (2025-06-01)

Assessing ChatGPT adoption in Jordanian medical education: a UTAUT model approach
by: Noura Alqaisi, et al.
Published: (2025-05-01)

The Role of ChatGPT in Dermatology Diagnostics
by: Ziad Khamaysi, et al.
Published: (2025-06-01)

Medical students and ChatGPT: analyzing attitudes, practices, and academic perceptions
by: Ahmed Samir Abdelhafiz, et al.
Published: (2025-02-01)

The application of problem-based learning (PBL) guided by ChatGPT in clinical education in the Department of Nephrology
by: Xiaoya Tong, et al.
Published: (2025-07-01)

Usefulness of the large language model ChatGPT (GPT‐4) as a diagnostic tool and information source in dermatology
by: Jacob P. S. Nielsen, et al.
Published: (2024-12-01)

Clinical Simulation with ChatGpt: A Revolution in Medical Education?
by: Aziel Alejandro Peralta Ramirez, et al.
Published: (2025-12-01)

The gendered nature of AI: Men and masculinities through the lens of ChatGPT and GPT4
by: Andreas Walther, et al.
Published: (2024-08-01)

ChatGPT nella valutazione dell’elaborato scritto
by: Nizzolino, Salvatore
Published: (2025-04-01)

Assessing medical students’ attitudes, performance, and usage of ChatGPT in Jeddah, Saudi Arabia
by: Dalia Alammari, et al.
Published: (2025-07-01)

The impact of ChatGPT on academic integrity in medical education: a developing nation perspective
by: Anila Jaleel, et al.
Published: (2025-05-01)

Unge og helseinformasjon<subtitle>ChatGPT vs. fagpersoner</subtitle>
by: Marita Skjuve, et al.
Published: (2025-01-01)

Recherchieren mit ChatGPT?
by: Friedrich Quaasdorf
Published: (2024-12-01)

Generative Artificial Intelligence in Medicine: Has ChatGPT the Potential in Assisting Dermatologists?
by: Adela-Vasilica GUDIU, et al.
Published: (2025-05-01)

Comparative analysis of ChatGPT and Gemini (Bard) in medical inquiry: a scoping review
by: Fattah H. Fattah, et al.
Published: (2025-02-01)

Utilizing ChatGPT-3.5 to Assist Ophthalmologists in Clinical Decision-making
by: Samir Cayenne, et al.
Published: (2025-05-01)

ChatGPT-4 Vision: a promising tool for diagnosing thyroid nodules
by: Dao-Rong Hong, et al.
Published: (2025-07-01)

Implementation and evaluation of an optimized surgical clerkship teaching model utilizing ChatGPT
by: Yi Huang, et al.
Published: (2024-12-01)

Capabilities of ChatGPT-3.5 as a Urological Triage System
by: Christopher Hirtsiefer, et al.
Published: (2024-12-01)

Capturing pharmacists’ perspectives on the value, risks, and applications of ChatGPT in pharmacy practice: A qualitative study
by: Ammar Abdulrahman Jairoun, et al.
Published: (2024-12-01)

Assessing the clinical support capabilities of ChatGPT 4o and ChatGPT 4o mini in managing lumbar disc herniation
by: Suning Wang, et al.
Published: (2025-01-01)

ChatGPT in healthcare education: a double-edged sword of trends, challenges, and opportunities
by: Michael Agyemang Adarkwah, et al.
Published: (2025-01-01)

ChatGPT-4 Omni’s superiority in answering multiple-choice oral radiology questions
by: Melek Tassoker
Published: (2025-02-01)

Evaluating the performance of ChatGPT and GPT-4o in coding classroom discourse data: A study of synchronous online mathematics instruction
by: Simin Xu, et al.
Published: (2024-12-01)

Bridging consciousness and AI: ChatGPT-assisted phenomenological analysis
by: David Martínez-Pernía, et al.
Published: (2025-05-01)

Playing with words: Comparing the vocabulary and lexical diversity of ChatGPT and humans
by: Pedro Reviriego, et al.
Published: (2024-12-01)

The role of ChatGPT-4o in differential diagnosis and management of vertigo-related disorders
by: Xu Liu, et al.
Published: (2025-05-01)

ChatGPT perceptions, experiences, and uses with emphasis on academia
by: Haneen Ali, et al.
Published: (2025-07-01)

Teaching Arabic-Korean translation using ChatGPT
by: Esraa Hasan, et al.
Published: (2025-01-01)

Is artificial intelligence for everyone? Analyzing the role of ChatGPT as a writing assistant for medical students
by: Zahra Shahsavar, et al.
Published: (2024-12-01)

Chat GPT 4o vs residents: French language evaluation in ophthalmology
by: Leah Attal, et al.
Published: (2025-04-01)