Comparing the Linear and Quadratic Discriminant Analysis of Diabetes Disease Classification Based on Data Multicollinearity

Linear and quadratic discriminant analysis are two fundamental classification methods used in statistical learning. Moments (MM), maximum likelihood (ML), minimum volume ellipsoids (MVE), and t-distribution methods are used to estimate the parameter of independent variables on the multivariate norma...

Full description

Saved in:

Bibliographic Details
Main Author:	Autcha Araveeporn
Format:	Article
Language:	English
Published:	Wiley 2022-01-01
Series:	International Journal of Mathematics and Mathematical Sciences
Online Access:	http://dx.doi.org/10.1155/2022/7829795
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832562365180149760
author	Autcha Araveeporn
author_facet	Autcha Araveeporn
author_sort	Autcha Araveeporn
collection	DOAJ
description	Linear and quadratic discriminant analysis are two fundamental classification methods used in statistical learning. Moments (MM), maximum likelihood (ML), minimum volume ellipsoids (MVE), and t-distribution methods are used to estimate the parameter of independent variables on the multivariate normal distribution in order to classify binary dependent variables. The MM and ML methods are popular and effective methods that approximate the distribution parameter and use observed data. However, the MVE and t-distribution methods focus on the resampling algorithm, a reliable tool for high resistance. This paper starts by explaining the concepts of linear and quadratic discriminant analysis and then presents the four other methods used to create the decision boundary. Our simulation study generated the independent variables by setting the coefficient correlation via multivariate normal distribution or multicollinearity, often through basic logistic regression used to construct the binary dependent variable. For application to Pima Indian diabetic dataset, we expressed the classification of diabetes as the dependent variable and used a dataset of eight independent variables. This paper aimed to determine the highest average percentage of accuracy. Our results showed that the MM and ML methods successfully used large independent variables for linear discriminant analysis (LDA). However, the t-distribution method of quadratic discriminant analysis (QDA) performed better when using small independent variables.
format	Article
id	doaj-art-c719fc37362b462e919c49d4c5444d44
institution	Kabale University
issn	1687-0425
language	English
publishDate	2022-01-01
publisher	Wiley
record_format	Article
series	International Journal of Mathematics and Mathematical Sciences
spelling	doaj-art-c719fc37362b462e919c49d4c5444d442025-02-03T01:22:48ZengWileyInternational Journal of Mathematics and Mathematical Sciences1687-04252022-01-01202210.1155/2022/7829795Comparing the Linear and Quadratic Discriminant Analysis of Diabetes Disease Classification Based on Data MulticollinearityAutcha Araveeporn0Department of StatisticsLinear and quadratic discriminant analysis are two fundamental classification methods used in statistical learning. Moments (MM), maximum likelihood (ML), minimum volume ellipsoids (MVE), and t-distribution methods are used to estimate the parameter of independent variables on the multivariate normal distribution in order to classify binary dependent variables. The MM and ML methods are popular and effective methods that approximate the distribution parameter and use observed data. However, the MVE and t-distribution methods focus on the resampling algorithm, a reliable tool for high resistance. This paper starts by explaining the concepts of linear and quadratic discriminant analysis and then presents the four other methods used to create the decision boundary. Our simulation study generated the independent variables by setting the coefficient correlation via multivariate normal distribution or multicollinearity, often through basic logistic regression used to construct the binary dependent variable. For application to Pima Indian diabetic dataset, we expressed the classification of diabetes as the dependent variable and used a dataset of eight independent variables. This paper aimed to determine the highest average percentage of accuracy. Our results showed that the MM and ML methods successfully used large independent variables for linear discriminant analysis (LDA). However, the t-distribution method of quadratic discriminant analysis (QDA) performed better when using small independent variables.http://dx.doi.org/10.1155/2022/7829795
spellingShingle	Autcha Araveeporn Comparing the Linear and Quadratic Discriminant Analysis of Diabetes Disease Classification Based on Data Multicollinearity International Journal of Mathematics and Mathematical Sciences
title	Comparing the Linear and Quadratic Discriminant Analysis of Diabetes Disease Classification Based on Data Multicollinearity
title_full	Comparing the Linear and Quadratic Discriminant Analysis of Diabetes Disease Classification Based on Data Multicollinearity
title_fullStr	Comparing the Linear and Quadratic Discriminant Analysis of Diabetes Disease Classification Based on Data Multicollinearity
title_full_unstemmed	Comparing the Linear and Quadratic Discriminant Analysis of Diabetes Disease Classification Based on Data Multicollinearity
title_short	Comparing the Linear and Quadratic Discriminant Analysis of Diabetes Disease Classification Based on Data Multicollinearity
title_sort	comparing the linear and quadratic discriminant analysis of diabetes disease classification based on data multicollinearity
url	http://dx.doi.org/10.1155/2022/7829795
work_keys_str_mv	AT autchaaraveeporn comparingthelinearandquadraticdiscriminantanalysisofdiabetesdiseaseclassificationbasedondatamulticollinearity

Comparing the Linear and Quadratic Discriminant Analysis of Diabetes Disease Classification Based on Data Multicollinearity

Similar Items