Correctness Coverage Evaluation for Medical Multiple-Choice Question Answering Based on the Enhanced Conformal Prediction Framework

Large language models (LLMs) are increasingly adopted in medical question answering (QA) scenarios. However, LLMs have been proven to generate hallucinations and nonfactual information, undermining their trustworthiness in high-stakes medical tasks. Conformal Prediction (CP) is now recognized as a r...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yusong Ke, Hongru Lin, Yuting Ruan, Junya Tang, Li Li
Format:	Article
Language:	English
Published:	MDPI AG 2025-05-01
Series:	Mathematics
Subjects:	large language models conformal prediction medical multiple-choice question answering average prediction set size
Online Access:	https://www.mdpi.com/2227-7390/13/9/1538
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://www.mdpi.com/2227-7390/13/9/1538

Correctness Coverage Evaluation for Medical Multiple-Choice Question Answering Based on the Enhanced Conformal Prediction Framework

Internet

Similar Items