A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity
Machine learning algorithms are rapidly deploying and have made manifold breakthroughs in various fields. The optimization of algorithms got abundant attention of researchers being a core component for deploying the machine learning model (MLM) abled to learn the parameters in significant ways for t...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2022-01-01
|
Series: | Scientifica |
Online Access: | http://dx.doi.org/10.1155/2022/7271293 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832564478315593728 |
---|---|
author | Muhammad Islam Farrukh Shehzad |
author_facet | Muhammad Islam Farrukh Shehzad |
author_sort | Muhammad Islam |
collection | DOAJ |
description | Machine learning algorithms are rapidly deploying and have made manifold breakthroughs in various fields. The optimization of algorithms got abundant attention of researchers being a core component for deploying the machine learning model (MLM) abled to learn the parameters in significant ways for the given data. Modeling crop productivity through innumerable agronomical constraints has become a crucial task for evolving sustainable agricultural policies. The cross-sectional datasets of 26430 (D1) crop-cut experiments are taken by 2nd-stage area frame sampling, collected from crop reporting service. This research is taken as follows: firstly three more effective numerical optimized datasets are generated (D1, D2, and D3) from D1 by taking the centroid points of features which decrease the sample size; secondly MLM is integrated with the traditional statistical models (TSMs) for multiple linear regression (MLR), and thirdly decision tree regression (DTR) and random forest regression (RFR) are deployed to get the optimized models able to predict the wheat productivity well with 75% datasets to train and 25% to test the model using the evaluation metrics (R2, RMSE), information criterion (AIC) with weights (AICW), evidence ration (E.R), and decompositions of prediction error. The MLR outperformed for MLM than TSM. The performance capability of MLM and TSM got upswing for generated datasets. RFR got optimized and superperformed for D1, D2, D3, and D4. This study demonstrated strong evidences for deploying MLM for prediction of wheat productivity as an alternative of traditional statistical modeling. |
format | Article |
id | doaj-art-6584a8ad0b064338873f87dcecacbe99 |
institution | Kabale University |
issn | 2090-908X |
language | English |
publishDate | 2022-01-01 |
publisher | Wiley |
record_format | Article |
series | Scientifica |
spelling | doaj-art-6584a8ad0b064338873f87dcecacbe992025-02-03T01:10:58ZengWileyScientifica2090-908X2022-01-01202210.1155/2022/7271293A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat ProductivityMuhammad Islam0Farrukh Shehzad1Department of StatisticsDepartment of StatisticsMachine learning algorithms are rapidly deploying and have made manifold breakthroughs in various fields. The optimization of algorithms got abundant attention of researchers being a core component for deploying the machine learning model (MLM) abled to learn the parameters in significant ways for the given data. Modeling crop productivity through innumerable agronomical constraints has become a crucial task for evolving sustainable agricultural policies. The cross-sectional datasets of 26430 (D1) crop-cut experiments are taken by 2nd-stage area frame sampling, collected from crop reporting service. This research is taken as follows: firstly three more effective numerical optimized datasets are generated (D1, D2, and D3) from D1 by taking the centroid points of features which decrease the sample size; secondly MLM is integrated with the traditional statistical models (TSMs) for multiple linear regression (MLR), and thirdly decision tree regression (DTR) and random forest regression (RFR) are deployed to get the optimized models able to predict the wheat productivity well with 75% datasets to train and 25% to test the model using the evaluation metrics (R2, RMSE), information criterion (AIC) with weights (AICW), evidence ration (E.R), and decompositions of prediction error. The MLR outperformed for MLM than TSM. The performance capability of MLM and TSM got upswing for generated datasets. RFR got optimized and superperformed for D1, D2, D3, and D4. This study demonstrated strong evidences for deploying MLM for prediction of wheat productivity as an alternative of traditional statistical modeling.http://dx.doi.org/10.1155/2022/7271293 |
spellingShingle | Muhammad Islam Farrukh Shehzad A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity Scientifica |
title | A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity |
title_full | A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity |
title_fullStr | A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity |
title_full_unstemmed | A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity |
title_short | A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity |
title_sort | prediction model optimization critiques through centroid clustering by reducing the sample size integrating statistical and machine learning techniques for wheat productivity |
url | http://dx.doi.org/10.1155/2022/7271293 |
work_keys_str_mv | AT muhammadislam apredictionmodeloptimizationcritiquesthroughcentroidclusteringbyreducingthesamplesizeintegratingstatisticalandmachinelearningtechniquesforwheatproductivity AT farrukhshehzad apredictionmodeloptimizationcritiquesthroughcentroidclusteringbyreducingthesamplesizeintegratingstatisticalandmachinelearningtechniquesforwheatproductivity AT muhammadislam predictionmodeloptimizationcritiquesthroughcentroidclusteringbyreducingthesamplesizeintegratingstatisticalandmachinelearningtechniquesforwheatproductivity AT farrukhshehzad predictionmodeloptimizationcritiquesthroughcentroidclusteringbyreducingthesamplesizeintegratingstatisticalandmachinelearningtechniquesforwheatproductivity |