Evaluating the performance of GPT-3.5, GPT-4, and GPT-4o in the Chinese National Medical Licensing Examination
Abstract This study aims to compare and evaluate the performance of GPT-3.5, GPT-4, and GPT-4o in the 2020 and 2021 Chinese National Medical Licensing Examination (NMLE), exploring their potential value in medical education and clinical applications. Six hundred original test questions from the 2020...
Saved in:
| Main Authors: | Dingyuan Luo, Mengke Liu, Runyuan Yu, Yulian Liu, Wenjun Jiang, Qi Fan, Naifeng Kuang, Qiang Gao, Tao Yin, Zuncheng Zheng |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-04-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-98949-2 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Performance of ChatGPT-3.5 and GPT-4 in national licensing examinations for medicine, pharmacy, dentistry, and nursing: a systematic review and meta-analysis
by: Hye Kyung Jin, et al.
Published: (2024-09-01) -
The performance of ChatGPT on medical image-based assessments and implications for medical education
by: Xiang Yang, et al.
Published: (2025-08-01) -
The performance of AI in medical examinations: an exploration of ChatGPT in ultrasound medical education
by: Dao-Rong Hong, et al.
Published: (2024-11-01) -
Revolutionizing research training: ChatGPT as a catalyst in medical education
by: Peng Zhang, et al.
Published: (2025-05-01) -
Analysis of ChatGPT-3.5’s Potential in Generating NBME-Standard Pharmacology Questions: What Can Be Improved?
by: Marwa Saad, et al.
Published: (2024-10-01)