Evaluating language model embeddings for Parkinson’s disease cohort harmonization using a novel manually curated variable mapping schema
Abstract Data Harmonization is an important yet time-consuming process. With the recent popularity of applications using Language Models (LMs) due to their high capabilities in text understanding, we investigated whether LMs could facilitate data harmonization for clinical use cases. To evaluate thi...
Saved in:
| Main Authors: | Yasamin Salimi, Tim Adams, Mehmet Can Ay, Helena Balabin, Marc Jacobs, Martin Hofmann-Apitius |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-06-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-06447-2 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Strategies for Embedding Research Data Management Through Effective Communication
by: Fadwa Alshawaf
Published: (2025-05-01) -
Engaging state geological surveys in implementing data stewardship practices: a pilot workshop at the Kentucky Geological Survey
by: Elizabeth Adams, et al.
Published: (2025-04-01) -
Assessing the harmonization of structured electronic health record data to reference terminologies and data completeness through data provenance
by: Keith Marsolo, et al.
Published: (2025-04-01) -
Overcoming the Barriers That Obscure the Interlinking and Analysis of Clinical Data Through Harmonization and Incremental Learning
by: Vasileios C. Pezoulas, et al.
Published: (2020-01-01) -
Data stewardship and curation practices in AI-based genomics and automated microscopy image analysis for high-throughput screening studies: promoting robust and ethical AI applications
by: Asefa Adimasu Taddese, et al.
Published: (2025-02-01)