Enhancing QA System Evaluation: An In-Depth Analysis of Metrics and Model-Specific Behaviors

The purpose of this study is to examine how evaluation metrics influence the perception and performance of question answering (QA) systems, particularly focusing on their effectiveness in QA tasks. We compare four different models: BERT, BioBERT, Bio-ClinicalBERT, and RoBERTa, utilizing ten EPIC-QA...

Full description

Saved in:
Bibliographic Details
Main Authors: Heesop Kim, Aluko Ademola
Format: Article
Language:English
Published: Korea Institute of Science and Technology Information 2025-03-01
Series:Journal of Information Science Theory and Practice
Subjects:
Online Access:https://data.doi.or.kr/10.1633/JISTaP.2025.13.1.6
Tags: Add Tag
No Tags, Be the first to tag this record!