Longitudinal Metabolomics Data Analysis Informed by Mechanistic Models
<b>Background</b>: Metabolomics measurements are noisy, often characterized by a small sample size and missing entries. While data-driven methods have shown promise in terms of analyzing metabolomics data, e.g., revealing biomarkers of various phenotypes, metabolomics data analysis can s...
Saved in:
Main Authors: | , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2024-12-01
|
Series: | Metabolites |
Subjects: | |
Online Access: | https://www.mdpi.com/2218-1989/15/1/2 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832587991986470912 |
---|---|
author | Lu Li Huub Hoefsloot Barbara M. Bakker David Horner Morten A. Rasmussen Age K. Smilde Evrim Acar |
author_facet | Lu Li Huub Hoefsloot Barbara M. Bakker David Horner Morten A. Rasmussen Age K. Smilde Evrim Acar |
author_sort | Lu Li |
collection | DOAJ |
description | <b>Background</b>: Metabolomics measurements are noisy, often characterized by a small sample size and missing entries. While data-driven methods have shown promise in terms of analyzing metabolomics data, e.g., revealing biomarkers of various phenotypes, metabolomics data analysis can significantly benefit from incorporating prior information about metabolic mechanisms. This paper introduces a novel data analysis approach to incorporate mechanistic models in metabolomics data analysis. <b>Methods</b>: We arranged time-resolved metabolomics measurements of plasma samples collected during a meal challenge test from the COPSAC<sub>2000</sub> cohort as a third-order tensor: <i>subjects</i> by <i>metabolites</i> by <i>time samples</i>. Simulated challenge test data generated using a human whole-body metabolic model were also arranged as a third-order tensor: <i>virtual subjects</i> by <i>metabolites</i> by <i>time samples</i>. Real and simulated data sets were coupled in the <i>metabolites</i> mode and jointly analyzed using coupled tensor factorizations to reveal the underlying patterns. <b>Results</b>: Our experiments demonstrated that the joint analysis of simulated and real data had better performance in terms of pattern discovery, achieving higher correlations with a BMI (body mass index)-related phenotype compared to the analysis of only real data in males, while in females, the performance was comparable. We also demonstrated the advantages of such a joint analysis approach in the presence of incomplete measurements and its limitations in the presence of wrong prior information. <b>Conclusions</b>: The joint analysis of real measurements and simulated data (generated using a mechanistic model) through coupled tensor factorizations guides real data analysis with prior information encapsulated in mechanistic models and reveals interpretable patterns. |
format | Article |
id | doaj-art-5566100f9d46420aaf91312b5781b171 |
institution | Kabale University |
issn | 2218-1989 |
language | English |
publishDate | 2024-12-01 |
publisher | MDPI AG |
record_format | Article |
series | Metabolites |
spelling | doaj-art-5566100f9d46420aaf91312b5781b1712025-01-24T13:41:07ZengMDPI AGMetabolites2218-19892024-12-01151210.3390/metabo15010002Longitudinal Metabolomics Data Analysis Informed by Mechanistic ModelsLu Li0Huub Hoefsloot1Barbara M. Bakker2David Horner3Morten A. Rasmussen4Age K. Smilde5Evrim Acar6School of Mathematics (Zhuhai), Sun Yat-sen University, Zhuhai 519000, ChinaSwammerdam Institute for Life Sciences, University of Amsterdam, 1090 GE Amsterdam, The NetherlandsLaboratory of Pediatrics, Systems Medicine of Metabolism and Signaling, University of Groningen, University Medical Center Groningen, 9700 AD Groningen, The NetherlandsCopenhagen Prospective Studies on Asthma in Childhood (COPSAC), Herlev and Gentofte Hospital, DK-2820 Gentofte, DenmarkCopenhagen Prospective Studies on Asthma in Childhood (COPSAC), Herlev and Gentofte Hospital, DK-2820 Gentofte, DenmarkDepartment of Data Science and Knowledge Discovery, Simula Metropolitan Center for Digital Engineering, 0130 Oslo, NorwayDepartment of Data Science and Knowledge Discovery, Simula Metropolitan Center for Digital Engineering, 0130 Oslo, Norway<b>Background</b>: Metabolomics measurements are noisy, often characterized by a small sample size and missing entries. While data-driven methods have shown promise in terms of analyzing metabolomics data, e.g., revealing biomarkers of various phenotypes, metabolomics data analysis can significantly benefit from incorporating prior information about metabolic mechanisms. This paper introduces a novel data analysis approach to incorporate mechanistic models in metabolomics data analysis. <b>Methods</b>: We arranged time-resolved metabolomics measurements of plasma samples collected during a meal challenge test from the COPSAC<sub>2000</sub> cohort as a third-order tensor: <i>subjects</i> by <i>metabolites</i> by <i>time samples</i>. Simulated challenge test data generated using a human whole-body metabolic model were also arranged as a third-order tensor: <i>virtual subjects</i> by <i>metabolites</i> by <i>time samples</i>. Real and simulated data sets were coupled in the <i>metabolites</i> mode and jointly analyzed using coupled tensor factorizations to reveal the underlying patterns. <b>Results</b>: Our experiments demonstrated that the joint analysis of simulated and real data had better performance in terms of pattern discovery, achieving higher correlations with a BMI (body mass index)-related phenotype compared to the analysis of only real data in males, while in females, the performance was comparable. We also demonstrated the advantages of such a joint analysis approach in the presence of incomplete measurements and its limitations in the presence of wrong prior information. <b>Conclusions</b>: The joint analysis of real measurements and simulated data (generated using a mechanistic model) through coupled tensor factorizations guides real data analysis with prior information encapsulated in mechanistic models and reveals interpretable patterns.https://www.mdpi.com/2218-1989/15/1/2challenge testsmetabolic model(coupled) tensor factorizationslongitudinal metabolomics dataknowledge-guided machine learning |
spellingShingle | Lu Li Huub Hoefsloot Barbara M. Bakker David Horner Morten A. Rasmussen Age K. Smilde Evrim Acar Longitudinal Metabolomics Data Analysis Informed by Mechanistic Models Metabolites challenge tests metabolic model (coupled) tensor factorizations longitudinal metabolomics data knowledge-guided machine learning |
title | Longitudinal Metabolomics Data Analysis Informed by Mechanistic Models |
title_full | Longitudinal Metabolomics Data Analysis Informed by Mechanistic Models |
title_fullStr | Longitudinal Metabolomics Data Analysis Informed by Mechanistic Models |
title_full_unstemmed | Longitudinal Metabolomics Data Analysis Informed by Mechanistic Models |
title_short | Longitudinal Metabolomics Data Analysis Informed by Mechanistic Models |
title_sort | longitudinal metabolomics data analysis informed by mechanistic models |
topic | challenge tests metabolic model (coupled) tensor factorizations longitudinal metabolomics data knowledge-guided machine learning |
url | https://www.mdpi.com/2218-1989/15/1/2 |
work_keys_str_mv | AT luli longitudinalmetabolomicsdataanalysisinformedbymechanisticmodels AT huubhoefsloot longitudinalmetabolomicsdataanalysisinformedbymechanisticmodels AT barbarambakker longitudinalmetabolomicsdataanalysisinformedbymechanisticmodels AT davidhorner longitudinalmetabolomicsdataanalysisinformedbymechanisticmodels AT mortenarasmussen longitudinalmetabolomicsdataanalysisinformedbymechanisticmodels AT ageksmilde longitudinalmetabolomicsdataanalysisinformedbymechanisticmodels AT evrimacar longitudinalmetabolomicsdataanalysisinformedbymechanisticmodels |