Longitudinal Metabolomics Data Analysis Informed by Mechanistic Models

<b>Background</b>: Metabolomics measurements are noisy, often characterized by a small sample size and missing entries. While data-driven methods have shown promise in terms of analyzing metabolomics data, e.g., revealing biomarkers of various phenotypes, metabolomics data analysis can s...

Full description

Saved in:
Bibliographic Details
Main Authors: Lu Li, Huub Hoefsloot, Barbara M. Bakker, David Horner, Morten A. Rasmussen, Age K. Smilde, Evrim Acar
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:Metabolites
Subjects:
Online Access:https://www.mdpi.com/2218-1989/15/1/2
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832587991986470912
author Lu Li
Huub Hoefsloot
Barbara M. Bakker
David Horner
Morten A. Rasmussen
Age K. Smilde
Evrim Acar
author_facet Lu Li
Huub Hoefsloot
Barbara M. Bakker
David Horner
Morten A. Rasmussen
Age K. Smilde
Evrim Acar
author_sort Lu Li
collection DOAJ
description <b>Background</b>: Metabolomics measurements are noisy, often characterized by a small sample size and missing entries. While data-driven methods have shown promise in terms of analyzing metabolomics data, e.g., revealing biomarkers of various phenotypes, metabolomics data analysis can significantly benefit from incorporating prior information about metabolic mechanisms. This paper introduces a novel data analysis approach to incorporate mechanistic models in metabolomics data analysis. <b>Methods</b>: We arranged time-resolved metabolomics measurements of plasma samples collected during a meal challenge test from the COPSAC<sub>2000</sub> cohort as a third-order tensor: <i>subjects</i> by <i>metabolites</i> by <i>time samples</i>. Simulated challenge test data generated using a human whole-body metabolic model were also arranged as a third-order tensor: <i>virtual subjects</i> by <i>metabolites</i> by <i>time samples</i>. Real and simulated data sets were coupled in the <i>metabolites</i> mode and jointly analyzed using coupled tensor factorizations to reveal the underlying patterns. <b>Results</b>: Our experiments demonstrated that the joint analysis of simulated and real data had better performance in terms of pattern discovery, achieving higher correlations with a BMI (body mass index)-related phenotype compared to the analysis of only real data in males, while in females, the performance was comparable. We also demonstrated the advantages of such a joint analysis approach in the presence of incomplete measurements and its limitations in the presence of wrong prior information. <b>Conclusions</b>: The joint analysis of real measurements and simulated data (generated using a mechanistic model) through coupled tensor factorizations guides real data analysis with prior information encapsulated in mechanistic models and reveals interpretable patterns.
format Article
id doaj-art-5566100f9d46420aaf91312b5781b171
institution Kabale University
issn 2218-1989
language English
publishDate 2024-12-01
publisher MDPI AG
record_format Article
series Metabolites
spelling doaj-art-5566100f9d46420aaf91312b5781b1712025-01-24T13:41:07ZengMDPI AGMetabolites2218-19892024-12-01151210.3390/metabo15010002Longitudinal Metabolomics Data Analysis Informed by Mechanistic ModelsLu Li0Huub Hoefsloot1Barbara M. Bakker2David Horner3Morten A. Rasmussen4Age K. Smilde5Evrim Acar6School of Mathematics (Zhuhai), Sun Yat-sen University, Zhuhai 519000, ChinaSwammerdam Institute for Life Sciences, University of Amsterdam, 1090 GE Amsterdam, The NetherlandsLaboratory of Pediatrics, Systems Medicine of Metabolism and Signaling, University of Groningen, University Medical Center Groningen, 9700 AD Groningen, The NetherlandsCopenhagen Prospective Studies on Asthma in Childhood (COPSAC), Herlev and Gentofte Hospital, DK-2820 Gentofte, DenmarkCopenhagen Prospective Studies on Asthma in Childhood (COPSAC), Herlev and Gentofte Hospital, DK-2820 Gentofte, DenmarkDepartment of Data Science and Knowledge Discovery, Simula Metropolitan Center for Digital Engineering, 0130 Oslo, NorwayDepartment of Data Science and Knowledge Discovery, Simula Metropolitan Center for Digital Engineering, 0130 Oslo, Norway<b>Background</b>: Metabolomics measurements are noisy, often characterized by a small sample size and missing entries. While data-driven methods have shown promise in terms of analyzing metabolomics data, e.g., revealing biomarkers of various phenotypes, metabolomics data analysis can significantly benefit from incorporating prior information about metabolic mechanisms. This paper introduces a novel data analysis approach to incorporate mechanistic models in metabolomics data analysis. <b>Methods</b>: We arranged time-resolved metabolomics measurements of plasma samples collected during a meal challenge test from the COPSAC<sub>2000</sub> cohort as a third-order tensor: <i>subjects</i> by <i>metabolites</i> by <i>time samples</i>. Simulated challenge test data generated using a human whole-body metabolic model were also arranged as a third-order tensor: <i>virtual subjects</i> by <i>metabolites</i> by <i>time samples</i>. Real and simulated data sets were coupled in the <i>metabolites</i> mode and jointly analyzed using coupled tensor factorizations to reveal the underlying patterns. <b>Results</b>: Our experiments demonstrated that the joint analysis of simulated and real data had better performance in terms of pattern discovery, achieving higher correlations with a BMI (body mass index)-related phenotype compared to the analysis of only real data in males, while in females, the performance was comparable. We also demonstrated the advantages of such a joint analysis approach in the presence of incomplete measurements and its limitations in the presence of wrong prior information. <b>Conclusions</b>: The joint analysis of real measurements and simulated data (generated using a mechanistic model) through coupled tensor factorizations guides real data analysis with prior information encapsulated in mechanistic models and reveals interpretable patterns.https://www.mdpi.com/2218-1989/15/1/2challenge testsmetabolic model(coupled) tensor factorizationslongitudinal metabolomics dataknowledge-guided machine learning
spellingShingle Lu Li
Huub Hoefsloot
Barbara M. Bakker
David Horner
Morten A. Rasmussen
Age K. Smilde
Evrim Acar
Longitudinal Metabolomics Data Analysis Informed by Mechanistic Models
Metabolites
challenge tests
metabolic model
(coupled) tensor factorizations
longitudinal metabolomics data
knowledge-guided machine learning
title Longitudinal Metabolomics Data Analysis Informed by Mechanistic Models
title_full Longitudinal Metabolomics Data Analysis Informed by Mechanistic Models
title_fullStr Longitudinal Metabolomics Data Analysis Informed by Mechanistic Models
title_full_unstemmed Longitudinal Metabolomics Data Analysis Informed by Mechanistic Models
title_short Longitudinal Metabolomics Data Analysis Informed by Mechanistic Models
title_sort longitudinal metabolomics data analysis informed by mechanistic models
topic challenge tests
metabolic model
(coupled) tensor factorizations
longitudinal metabolomics data
knowledge-guided machine learning
url https://www.mdpi.com/2218-1989/15/1/2
work_keys_str_mv AT luli longitudinalmetabolomicsdataanalysisinformedbymechanisticmodels
AT huubhoefsloot longitudinalmetabolomicsdataanalysisinformedbymechanisticmodels
AT barbarambakker longitudinalmetabolomicsdataanalysisinformedbymechanisticmodels
AT davidhorner longitudinalmetabolomicsdataanalysisinformedbymechanisticmodels
AT mortenarasmussen longitudinalmetabolomicsdataanalysisinformedbymechanisticmodels
AT ageksmilde longitudinalmetabolomicsdataanalysisinformedbymechanisticmodels
AT evrimacar longitudinalmetabolomicsdataanalysisinformedbymechanisticmodels