Benchmarking open-source large language models on Portuguese Revalida multiple-choice questions

Objective The study aimed to evaluate the top large language models (LLMs) in validated medical knowledge tests in Portuguese.Methods This study compared 31 LLMs in the context of solving the national Brazilian medical examination test. The research compared the performance of 23 open-source and 8 p...

Full description

Saved in:

Bibliographic Details
Main Authors:	João Victor Bruneti Severino, Pedro Angelo Basei de Paula, Matheus Nespolo Berger, Filipe Silveira Loures, Solano Amadori Todeschini, Eduardo Augusto Roeder, Maria Han Veiga, Murilo Guedes, Gustavo Lenci Marques
Format:	Article
Language:	English
Published:	BMJ Publishing Group 2025-02-01
Series:	BMJ Health & Care Informatics
Online Access:	https://informatics.bmj.com/content/32/1/e101195.full
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://informatics.bmj.com/content/32/1/e101195.full

Benchmarking open-source large language models on Portuguese Revalida multiple-choice questions

Internet

Similar Items