A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures

Background. A key mechanism of obesity involves dysregulation of metabolic and inflammatory markers. This study aimed to identify salivary biomarkers and other factors associated with obesity using an ensemble data mining approach. Methods. For a random cohort of over 700 subjects from 8137 Kuwait c...

Full description

Saved in:
Bibliographic Details
Main Authors: Ping Shi, J. Max Goodson
Format: Article
Language:English
Published: Wiley 2019-01-01
Series:Journal of Obesity
Online Access:http://dx.doi.org/10.1155/2019/9570218
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832563145816670208
author Ping Shi
J. Max Goodson
author_facet Ping Shi
J. Max Goodson
author_sort Ping Shi
collection DOAJ
description Background. A key mechanism of obesity involves dysregulation of metabolic and inflammatory markers. This study aimed to identify salivary biomarkers and other factors associated with obesity using an ensemble data mining approach. Methods. For a random cohort of over 700 subjects from 8137 Kuwait children (10.00 ± 0.67 years), four data mining methods were applied to identify important variables associated with obesity, including logistic regression by lasso regularization (Lasso), multivariate adaptive regression spline (MARS), random forests (RF), and boosting classification trees (BT). Each algorithm generated a variable importance rank list, based on an internal cross-validation procedure. An aggregated importance ranking was constructed by averaging the rank ordering of variables from individual list, weighted by the classification performance of respective models. Subsequently, the subset of top-ranking variables that were identified with at least three algorithms was evaluated by classification performance using receiver operating characteristic (ROC) analysis with bootstrap percentile resampling. Results. Obesity was defined either by the waist circumference (OBW) or by the body mass index (BMI) (OBWHO). We identified C-reactive protein (CRP), insulin, leptin, adiponectin, as salivary biomarkers associated with OBW, plus a clinical feature fitness level. A similar set of biomarkers was identified for OBWHO, but not including leptin. Tree-based clustering analysis revealed patterns that were significantly different between the OBW and OBWHO subjects. Conclusion. A data mining approach based on multiple algorithms is useful for identifying factors associated with phenotypes, especially in cases where relationships are not salient, and a consensus from multiple methods can help produce a more generalizable subset of features. In this case, we have demonstrated that evaluation using the waist circumference includes association with high levels of salivary leptin, which is not seen with evaluation by BMI.
format Article
id doaj-art-4bcbe1f5c366474c9aedf2908898bcbc
institution Kabale University
issn 2090-0708
2090-0716
language English
publishDate 2019-01-01
publisher Wiley
record_format Article
series Journal of Obesity
spelling doaj-art-4bcbe1f5c366474c9aedf2908898bcbc2025-02-03T01:20:54ZengWileyJournal of Obesity2090-07082090-07162019-01-01201910.1155/2019/95702189570218A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity MeasuresPing Shi0J. Max Goodson1Department of Applied Oral Sciences, The Forsyth Institute, Cambridge, MA 02142, USADepartment of Applied Oral Sciences, The Forsyth Institute, Cambridge, MA 02142, USABackground. A key mechanism of obesity involves dysregulation of metabolic and inflammatory markers. This study aimed to identify salivary biomarkers and other factors associated with obesity using an ensemble data mining approach. Methods. For a random cohort of over 700 subjects from 8137 Kuwait children (10.00 ± 0.67 years), four data mining methods were applied to identify important variables associated with obesity, including logistic regression by lasso regularization (Lasso), multivariate adaptive regression spline (MARS), random forests (RF), and boosting classification trees (BT). Each algorithm generated a variable importance rank list, based on an internal cross-validation procedure. An aggregated importance ranking was constructed by averaging the rank ordering of variables from individual list, weighted by the classification performance of respective models. Subsequently, the subset of top-ranking variables that were identified with at least three algorithms was evaluated by classification performance using receiver operating characteristic (ROC) analysis with bootstrap percentile resampling. Results. Obesity was defined either by the waist circumference (OBW) or by the body mass index (BMI) (OBWHO). We identified C-reactive protein (CRP), insulin, leptin, adiponectin, as salivary biomarkers associated with OBW, plus a clinical feature fitness level. A similar set of biomarkers was identified for OBWHO, but not including leptin. Tree-based clustering analysis revealed patterns that were significantly different between the OBW and OBWHO subjects. Conclusion. A data mining approach based on multiple algorithms is useful for identifying factors associated with phenotypes, especially in cases where relationships are not salient, and a consensus from multiple methods can help produce a more generalizable subset of features. In this case, we have demonstrated that evaluation using the waist circumference includes association with high levels of salivary leptin, which is not seen with evaluation by BMI.http://dx.doi.org/10.1155/2019/9570218
spellingShingle Ping Shi
J. Max Goodson
A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures
Journal of Obesity
title A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures
title_full A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures
title_fullStr A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures
title_full_unstemmed A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures
title_short A Data Mining Approach Identified Salivary Biomarkers That Discriminate between Two Obesity Measures
title_sort data mining approach identified salivary biomarkers that discriminate between two obesity measures
url http://dx.doi.org/10.1155/2019/9570218
work_keys_str_mv AT pingshi adataminingapproachidentifiedsalivarybiomarkersthatdiscriminatebetweentwoobesitymeasures
AT jmaxgoodson adataminingapproachidentifiedsalivarybiomarkersthatdiscriminatebetweentwoobesitymeasures
AT pingshi dataminingapproachidentifiedsalivarybiomarkersthatdiscriminatebetweentwoobesitymeasures
AT jmaxgoodson dataminingapproachidentifiedsalivarybiomarkersthatdiscriminatebetweentwoobesitymeasures