Enhancing QA System Evaluation: An In-Depth Analysis of Metrics and Model-Specific Behaviors
The purpose of this study is to examine how evaluation metrics influence the perception and performance of question answering (QA) systems, particularly focusing on their effectiveness in QA tasks. We compare four different models: BERT, BioBERT, Bio-ClinicalBERT, and RoBERTa, utilizing ten EPIC-QA...
Saved in:
| Main Authors: | , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Korea Institute of Science and Technology Information
2025-03-01
|
| Series: | Journal of Information Science Theory and Practice |
| Subjects: | |
| Online Access: | https://data.doi.or.kr/10.1633/JISTaP.2025.13.1.6 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|