Construction and Optimization of Integrated Yield Prediction Model Based on Phenotypic Characteristics of Rice Grown in Small–Scale Plantations

An intelligent prediction model for rice yield in small-scale cultivation areas can provide precise forecasting results for farmers, rice planting enterprises, and researchers, holding significant importance for agricultural industries and crop science research within small regions. Although machine...

Full description

Saved in:
Bibliographic Details
Main Authors: Jihong Sun, Peng Tian, Zhaowen Li, Xinrui Wang, Haokai Zhang, Jiangquan Chen, Ye Qian
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Agriculture
Subjects:
Online Access:https://www.mdpi.com/2077-0472/15/2/181
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832589484983582720
author Jihong Sun
Peng Tian
Zhaowen Li
Xinrui Wang
Haokai Zhang
Jiangquan Chen
Ye Qian
author_facet Jihong Sun
Peng Tian
Zhaowen Li
Xinrui Wang
Haokai Zhang
Jiangquan Chen
Ye Qian
author_sort Jihong Sun
collection DOAJ
description An intelligent prediction model for rice yield in small-scale cultivation areas can provide precise forecasting results for farmers, rice planting enterprises, and researchers, holding significant importance for agricultural industries and crop science research within small regions. Although machine learning can handle complex nonlinear problems to enhance prediction accuracy, further improvements in models are still needed to accurately predict rice yields in small areas facing complex planting environments, thereby enhancing model performance. This study employs four rice phenotypic traits, namely, panicle angle, panicle length, total branch length, and grain number, along with seven machine learning methods—multiple linear regression, support vector machine, MLP, random forest, GBR, XGBoost, and LightGBM—to construct a yield prediction model group. Subsequently, the top three models with the best performance in individual model predictions are integrated using voting and stacking ensemble methods to obtain the optimal integrated model. Finally, the impact of different rice phenotypic traits on the performance of the stacked ensemble model is explored. Experimental results indicate that the random forest model performs best after individual machine learning modeling, with RMSE, R<sup>2</sup>, and MAPE values of 0.2777, 0.9062, and 17.04%, respectively. After model integration, Stacking–3m demonstrates the best performance, with RMSE, R<sup>2</sup>, and MAPE values of 0.2483, 0.9250, and 6.90%, respectively. Compared to the performance after random forest modeling, the RMSE decreased by 10.58%, R<sup>2</sup> increased by 1.88%, and MAPE decreased by 0.76%, indicating improved model performance after stacking ensemble. The Stacking–3m model, which demonstrated the best comprehensive evaluation metrics, was selected for model validation, and the validation results were satisfactory, with MAE, R<sup>2</sup>, and MAPE values of 8.3384, 0.9285, and 0.2689, respectively. The above research findings demonstrate that this integrated model possesses high practical value and fills a gap in precise yield prediction for small-scale rice cultivation in the Yunnan Plateau region.
format Article
id doaj-art-93d4a89dbd7f46a5b9ac3a2d3b195468
institution Kabale University
issn 2077-0472
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Agriculture
spelling doaj-art-93d4a89dbd7f46a5b9ac3a2d3b1954682025-01-24T13:16:02ZengMDPI AGAgriculture2077-04722025-01-0115218110.3390/agriculture15020181Construction and Optimization of Integrated Yield Prediction Model Based on Phenotypic Characteristics of Rice Grown in Small–Scale PlantationsJihong Sun0Peng Tian1Zhaowen Li2Xinrui Wang3Haokai Zhang4Jiangquan Chen5Ye Qian6College of Agronomy and Biotechnology, Yunnan Agricultural University, Kunming 650201, ChinaThe Key Laboratory for Crop Production and Smart Agriculture of Yunnan Province, Yunnan Agricultural University, Kunming 650201, ChinaThe Key Laboratory for Crop Production and Smart Agriculture of Yunnan Province, Yunnan Agricultural University, Kunming 650201, ChinaThe Key Laboratory for Crop Production and Smart Agriculture of Yunnan Province, Yunnan Agricultural University, Kunming 650201, ChinaEngineering College, China Agricultural University, Beijing 100091, ChinaCollege of Big Data, Yunnan Agricultural University, Kunming 650201, ChinaThe Key Laboratory for Crop Production and Smart Agriculture of Yunnan Province, Yunnan Agricultural University, Kunming 650201, ChinaAn intelligent prediction model for rice yield in small-scale cultivation areas can provide precise forecasting results for farmers, rice planting enterprises, and researchers, holding significant importance for agricultural industries and crop science research within small regions. Although machine learning can handle complex nonlinear problems to enhance prediction accuracy, further improvements in models are still needed to accurately predict rice yields in small areas facing complex planting environments, thereby enhancing model performance. This study employs four rice phenotypic traits, namely, panicle angle, panicle length, total branch length, and grain number, along with seven machine learning methods—multiple linear regression, support vector machine, MLP, random forest, GBR, XGBoost, and LightGBM—to construct a yield prediction model group. Subsequently, the top three models with the best performance in individual model predictions are integrated using voting and stacking ensemble methods to obtain the optimal integrated model. Finally, the impact of different rice phenotypic traits on the performance of the stacked ensemble model is explored. Experimental results indicate that the random forest model performs best after individual machine learning modeling, with RMSE, R<sup>2</sup>, and MAPE values of 0.2777, 0.9062, and 17.04%, respectively. After model integration, Stacking–3m demonstrates the best performance, with RMSE, R<sup>2</sup>, and MAPE values of 0.2483, 0.9250, and 6.90%, respectively. Compared to the performance after random forest modeling, the RMSE decreased by 10.58%, R<sup>2</sup> increased by 1.88%, and MAPE decreased by 0.76%, indicating improved model performance after stacking ensemble. The Stacking–3m model, which demonstrated the best comprehensive evaluation metrics, was selected for model validation, and the validation results were satisfactory, with MAE, R<sup>2</sup>, and MAPE values of 8.3384, 0.9285, and 0.2689, respectively. The above research findings demonstrate that this integrated model possesses high practical value and fills a gap in precise yield prediction for small-scale rice cultivation in the Yunnan Plateau region.https://www.mdpi.com/2077-0472/15/2/181integrated modelmachine learningrice phenotypeStacking–3myield prediction
spellingShingle Jihong Sun
Peng Tian
Zhaowen Li
Xinrui Wang
Haokai Zhang
Jiangquan Chen
Ye Qian
Construction and Optimization of Integrated Yield Prediction Model Based on Phenotypic Characteristics of Rice Grown in Small–Scale Plantations
Agriculture
integrated model
machine learning
rice phenotype
Stacking–3m
yield prediction
title Construction and Optimization of Integrated Yield Prediction Model Based on Phenotypic Characteristics of Rice Grown in Small–Scale Plantations
title_full Construction and Optimization of Integrated Yield Prediction Model Based on Phenotypic Characteristics of Rice Grown in Small–Scale Plantations
title_fullStr Construction and Optimization of Integrated Yield Prediction Model Based on Phenotypic Characteristics of Rice Grown in Small–Scale Plantations
title_full_unstemmed Construction and Optimization of Integrated Yield Prediction Model Based on Phenotypic Characteristics of Rice Grown in Small–Scale Plantations
title_short Construction and Optimization of Integrated Yield Prediction Model Based on Phenotypic Characteristics of Rice Grown in Small–Scale Plantations
title_sort construction and optimization of integrated yield prediction model based on phenotypic characteristics of rice grown in small scale plantations
topic integrated model
machine learning
rice phenotype
Stacking–3m
yield prediction
url https://www.mdpi.com/2077-0472/15/2/181
work_keys_str_mv AT jihongsun constructionandoptimizationofintegratedyieldpredictionmodelbasedonphenotypiccharacteristicsofricegrowninsmallscaleplantations
AT pengtian constructionandoptimizationofintegratedyieldpredictionmodelbasedonphenotypiccharacteristicsofricegrowninsmallscaleplantations
AT zhaowenli constructionandoptimizationofintegratedyieldpredictionmodelbasedonphenotypiccharacteristicsofricegrowninsmallscaleplantations
AT xinruiwang constructionandoptimizationofintegratedyieldpredictionmodelbasedonphenotypiccharacteristicsofricegrowninsmallscaleplantations
AT haokaizhang constructionandoptimizationofintegratedyieldpredictionmodelbasedonphenotypiccharacteristicsofricegrowninsmallscaleplantations
AT jiangquanchen constructionandoptimizationofintegratedyieldpredictionmodelbasedonphenotypiccharacteristicsofricegrowninsmallscaleplantations
AT yeqian constructionandoptimizationofintegratedyieldpredictionmodelbasedonphenotypiccharacteristicsofricegrowninsmallscaleplantations