Evaluating language model embeddings for Parkinson’s disease cohort harmonization using a novel manually curated variable mapping schema

Abstract Data Harmonization is an important yet time-consuming process. With the recent popularity of applications using Language Models (LMs) due to their high capabilities in text understanding, we investigated whether LMs could facilitate data harmonization for clinical use cases. To evaluate thi...

Full description

Saved in:
Bibliographic Details
Main Authors: Yasamin Salimi, Tim Adams, Mehmet Can Ay, Helena Balabin, Marc Jacobs, Martin Hofmann-Apitius
Format: Article
Language:English
Published: Nature Portfolio 2025-06-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-06447-2
Tags: Add Tag
No Tags, Be the first to tag this record!