Synthetic data trained open-source language models are feasible alternatives to proprietary models for radiology reporting

Abstract The study assessed the feasibility of using synthetic data to fine-tune various open-source LLMs for free text to structured data conversation in radiology, comparing their performance with GPT models. A training set of 3000 synthetic thyroid nodule dictations was generated to train six ope...

Full description

Saved in:
Bibliographic Details
Main Authors: Aakriti Pandita, Angela Keniston, Nikhil Madhuripan
Format: Article
Language:English
Published: Nature Portfolio 2025-07-01
Series:npj Digital Medicine
Online Access:https://doi.org/10.1038/s41746-025-01658-3
Tags: Add Tag
No Tags, Be the first to tag this record!