Towards evaluating and building versatile large language models for medicine

Towards evaluating and building versatile large language models for medicine

Abstract In this study, we present MedS-Bench, a comprehensive benchmark to evaluate large language models (LLMs) in clinical contexts, MedS-Bench, spanning 11 high-level clinical tasks. We evaluate nine leading LLMs, e.g., MEDITRON, Llama 3, Mistral, GPT-4, Claude-3.5, etc. and found that most mode...

Full description

Saved in:

Bibliographic Details
Main Authors:	Chaoyi Wu, Pengcheng Qiu, Jinxin Liu, Hongfei Gu, Na Li, Ya Zhang, Yanfeng Wang, Weidi Xie
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-01-01
Series:	npj Digital Medicine
Online Access:	https://doi.org/10.1038/s41746-024-01390-4
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Toward cultural interpretability: A linguistic anthropological framework for describing and evaluating large language models
by: Graham M Jones, et al.
Published: (2025-03-01)

Application, Challenges, and Prospects of Large Language Model in the Field of Traditional Chinese Medicine
by: CHEN Zijia, et al.
Published: (2024-08-01)

Toward the Development of Large-Scale Word Embedding for Low-Resourced Language
by: Shahzad Nazir, et al.
Published: (2022-01-01)

Software-Defined Radio FPGA Cores: Building towards a Domain-Specific Language
by: Lekhobola Tsoeunyane, et al.
Published: (2017-01-01)

Realization of DVCCTA Based Versatile Modulator
by: Neeta Pandey, et al.
Published: (2014-01-01)

Imidazole: Having Versatile Biological Activities
by: Amita Verma, et al.
Published: (2013-01-01)

Versatile Active Biquad Using FTFNs
by: Muhammad Taher Abuelma'atti, et al.
Published: (1998-01-01)

An Evaluation on the Potential of Large Language Models for Use in Trauma Triage
by: Kelvin Le, et al.
Published: (2024-10-01)

Strategic Synthesis of Sulfinamides as Versatile S(IV) Intermediates
by: Subham Das, et al.
Published: (2024-11-01)

MIKRORNA – MOLEKUL VERSATIL DALAM KANSER TIROID
by: Azliana Mohamad Yusof, et al.
Published: (2019-07-01)

Graphene: A Multifunctional Nanomaterial with Versatile Applications
by: H. C. Ananda Murthy, et al.
Published: (2021-01-01)

RNA-binding proteins as versatile metabolic regulators
by: Ellie Koletsou, et al.
Published: (2025-01-01)

Large Language Models as Evaluators in Education: Verification of Feedback Consistency and Accuracy
by: Hyein Seo, et al.
Published: (2025-01-01)

A dataset for evaluating clinical research claims in large language models
by: Boya Zhang, et al.
Published: (2025-01-01)

Design and Simulation of a Versatile Electric Cooking Stove.
by: Mukama, Percy Murungi
Published: (2024)

Electrospun Fibers: Versatile Approaches for Controlled Release Applications
by: Sepideh Karimi Afshar, et al.
Published: (2022-01-01)

Versatility of the Anterolateral Thigh Flap for Abdominal Wall Reconstruction
by: Nadine E. Pedrazzi, MD, et al.
Published: (2025-01-01)

Clostridium difficile-associated disease in versatile hospital patient population and risk factors for its development
by: Ya. N. Yarushina, et al.
Published: (2018-08-01)

Versatile nitrate-respiring heterotrophs are previously concealed contributors to sulfur cycle
by: Bo Shao, et al.
Published: (2025-01-01)

Decoding cortical folding patterns in marmosets using machine learning and large language model
by: Yue Wu, et al.
Published: (2025-03-01)

Spherical harmonics texture extraction for versatile analysis of biological objects.
by: Oane Gros, et al.
Published: (2025-01-01)

Versatile One-Pot Synthesis of Hydrophobic Tags by Multicomponent Reactions
by: Federica Carolina Balestrero, et al.
Published: (2025-01-01)

Versatility of Chitosan-Based Biomaterials and Their Use as Scaffolds for Tissue Regeneration
by: José Carlos Viana Ribeiro, et al.
Published: (2017-01-01)

Versatile High-Throughput Platform for Focused Ultrasound In Vitro Application
by: Steffen H. Tretbar, et al.
Published: (2025-01-01)

In vitro multiplication and phytochemical evaluation of Apios americana Medik for enhanced production of the staple food and tissues with versatile bioactivities.
by: Okello, Denis, et al.
Published: (2024)

Versatile Bottom-Up Approach to Nanostructured Functional Materials for Optoelectronic Applications
by: Giorgio Macchi, et al.
Published: (2008-01-01)

SEMbeddings: how to evaluate model misfit before data collection using large-language models
by: Tommaso Feraco, et al.
Published: (2025-02-01)

Evaluating Large Language Models for Optimized Intent Translation and Contradiction Detection Using KNN in IBN
by: Muhammad Asif, et al.
Published: (2025-01-01)

Large language models for pretreatment education in pediatric radiation oncology: A comparative evaluation study
by: Dominik Wawrzuta, et al.
Published: (2025-03-01)

Performance Evaluation and Implications of Large Language Models in Radiology Board Exams: Prospective Comparative Analysis
by: Boxiong Wei
Published: (2025-01-01)

Gradually, then suddenly: Singapore’s journey towards sustainable medicine
by: Nick Watts, et al.
Published: (2024-04-01)

The Motives behind the Skepticism of Conventional Medicine Advocates towards Traditional Persian Medicine: An Expert Opinion
by: Majid Nimrouzi, et al.
Published: (2024-12-01)

Impact Evaluation of Prefabricated Buildings Cost on Game Theory-Cloud Model
by: Na Li, et al.
Published: (2022-01-01)

A versatile platform based on matrix metalloproteinase-sensitive peptides for novel diagnostic and therapeutic strategies in arthritis
by: Mingyang Li, et al.
Published: (2025-05-01)

A Versatile SERS Sensor for Multiple Determinations of Polycyclic Aromatic Hydrocarbons and Its Application Potential in Analysis of Fried Foods
by: Shi Wang, et al.
Published: (2020-01-01)

Versatile on-chip polarization-sensitive detection system for optical communication and artificial vision
by: Zhilin Liu, et al.
Published: (2025-02-01)

VERSATILE IN COMBINATION; TRAMADOL AND BUPIVACAINE, AS CAUDAL BLOCKANALGESIA FOR PEDIATRIC INFRA-UMBILICAL SURGERIES
by: Naila Zahoor Awan et al
Published: (2024-09-01)

Reconfigurable Antenna Arrays with Multiple Requirements: A Versatile 3D Approach
by: Massimiliano Comisso, et al.
Published: (2017-01-01)

CathROB: A Highly Compact and Versatile Remote Catheter Navigation System
by: Laura Cercenelli, et al.
Published: (2017-01-01)

Fast and Versatile Pathway in Fabrication of Polyelectrolyte Multilayer Nanofiltration Membrane with Tunable Properties
by: Ahmad M. Alghamdi
Published: (2021-01-01)