Expert of Experts Verification and Alignment (EVAL) Framework for Large Language Models Safety in Gastroenterology

Expert of Experts Verification and Alignment (EVAL) Framework for Large Language Models Safety in Gastroenterology

Abstract Large language models generate plausible text responses to medical questions, but inaccurate responses pose significant risks in medical decision-making. Grading LLM outputs to determine the best model or answer is time-consuming and impractical in clinical settings; therefore, we introduce...

Full description

Saved in:

Bibliographic Details
Main Authors:	Mauro Giuffrè, Kisung You, Ziteng Pang, Simone Kresevic, Sunny Chung, Ryan Chen, Youngmin Ko, Colleen Chan, Theo Saarinen, Milos Ajcevic, Lory S. Crocè, Guadalupe Garcia-Tsao, Ian Gralnek, Joseph J. Y. Sung, Alan Barkun, Loren Laine, Jasjeet Sekhon, Bradly Stadie, Dennis L. Shung
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-05-01
Series:	npj Digital Medicine
Online Access:	https://doi.org/10.1038/s41746-025-01589-z
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Usability and adoption in a randomized trial of GutGPT a GenAI tool for gastrointestinal bleeding
by: Sunny Chung, et al.
Published: (2025-08-01)

PolEval 2022/23 Challenge Tasks and Results
by: Łukasz Kobyliński, et al.
Published: (2023-09-01)

REVISITING THE CONTENT OF THE TERMS «EXPERT» AND «FORENSIC EXPERT»
by: Lada F. Paramonova
Published: (2018-03-01)

EvalRound+ Bootstrapping and Its Rigorous Analysis for CKKS Scheme
by: Hyewon Sung, et al.
Published: (2025-01-01)

Safety Design and Evalation on Traction System of Mass Transit Vehicles
by: JIANG Yue-li, et al.
Published: (2013-01-01)

Assessment of focal liver lesions in non-cirrhotic liver – expert opinion statement by the Swiss Association for the Study of the Liver and the Swiss Society of Gastroenterology
by: Mikael Sawatzki, et al.
Published: (2023-09-01)

Expert Commentary
by: Marina V. Fedoseenko
Published: (2017-12-01)

The Expert Right to Submit Petitions as an Exercise Form of Expert Initiative
by: O. G. Dyakonova
Published: (2019-07-01)

Resolution of the Independent Expert Council of The Union of Experts in the Field of Immunoprophylaxis
by: article Editorial
Published: (2023-08-01)

Resolution of the independent expert council of the union of experts in the field of immunoprophylaxis
by: article Editorial
Published: (2023-08-01)

SmartSolos expert: An expert system for Brazilian soil classification
by: Glauber José Vaz, et al.
Published: (2025-03-01)

"ELECTRONIC EXPERT" EXPERT SYSTEM SUBSYSTEM'S "DE- SIGN" CONSTRUCTION FEATURES
by: L.V. BORISOVA
Published: (2009-06-01)

False expert report
by: Jakub Matis
Published: (2025-05-01)

FPGAs for Domain Experts
by: Wim Vanderbauwhede, et al.
Published: (2020-01-01)

Reflections on being an expert
by: Debbie Garratt
Published: (2018-12-01)

Expert systems in dentistry
by: Todorović Aleksandar, et al.
Published: (2025-01-01)

The Interrogation of Experts and Specialists in the Civil Proceeding (Recomendations to the Expert, the Court, the Parties' Representatives)
by: M. V. Zhizhina
Published: (2016-03-01)

Comparison of maintaining of body balance in combat sports between experts and non-experts
by: Artur Litwiniuk, et al.
Published: (2023-05-01)

Expert-innovator behaviour questionnaire as a new tool for selecting potential experts
by: Deptuła Anna M., et al.
Published: (2025-06-01)

Crowdsourcing Relative Rankings of Multi-Word Expressions: Experts versus Non-Experts
by: David Alfter, et al.
Published: (2022-07-01)

CryptoEval: Evaluating the risk of cryptographic misuses in Android apps with data‐flow analysis
by: Cong Sun, et al.
Published: (2023-07-01)

Arch-Eval benchmark for assessing chinese architectural domain knowledge in large language models
by: Jie Wu, et al.
Published: (2025-04-01)

Use of expert elicitation in the field of occupational hygiene: Comparison of expert and observed data distributions.
by: David Michael Lowry, et al.
Published: (2022-01-01)

Report on the Expert Forum on using Information Technology to Facilitate Uptake and Impact of Colorectal Cancer Screening Guidelines
by: Maida J Sewitch, et al.
Published: (2012-01-01)

Clinical practice recommendations on the use of neuromodulators in gastroenterology: AMG (Asociación Mexicana de Gastroenterología) - AMNM (Asociación Mexicana de Neurogastroenterología y Motilidad) expert joint review
by: O. Gómez-Escudero, et al.
Published: (2025-04-01)

Vem är egentligen expert?
by: Johan Söderman
Published: (2011-06-01)

ON EXPERT INFORMATION FUZZIFICATION METHOD
by: Valery P. Dimitrov, et al.
Published: (2012-03-01)

ON EXPERTS IN RADIATING HYGIENE TRAINING
by: T. B. Baltrukova
Published: (2017-02-01)

The Language of the Linguistic Expert’s Opinion
by: V. O. Kuznetsov
Published: (2023-05-01)

Patellofemoral arthroplasty: expert opinion
by: Paul Hoogervorst, et al.
Published: (2022-01-01)

FORMAL METHODS OF EXPERT ESTIMATIONS
by: Tea Ya. Danelyan
Published: (2016-08-01)

Expert opinion. EDITORIAL NOTE
Published: (2017-11-01)

Expert Concepts in Forensic Ecology
by: N. V. Mikhaleva
Published: (2021-04-01)

EXPERT SYSTEM FOR MACHINE MAINTENANCE
by: V.P. DIMITROV
Published: (2007-09-01)

RESOLUTION ON THE RESULTS OF THE EXPERT COUNCIL
by: article Editorial
Published: (2024-02-01)

Être chercheur, devenir expert ?
by: David Demortain
Published: (2021-03-01)

Remerciements aux expert•e•s
by: Alexandre Fetelian
Published: (2025-04-01)

Professional Communication and Language Experts
by: Oana Celia GHEORGHIU
Published: (2025-04-01)

On Fuzzy Soft Expert Sets
by: Hilal Donmez, et al.
Published: (2015-09-01)

Expert consensus on apical microsurgery
by: Hanguo Wang, et al.
Published: (2025-01-01)