Integrating Remote Sensing and Soil Features for Enhanced Machine Learning-Based Corn Yield Prediction in the Southern US

Efficient and reliable corn (<i>Zea mays</i> L.) yield prediction is important for varietal selection by plant breeders and management decision-making by growers. Unlike prior studies that focus mainly on county-level or controlled laboratory-scale areas, this study targets a production-...

Full description

Saved in:
Bibliographic Details
Main Authors: Sayantan Sarkar, Javier M. Osorio Leyton, Efrain Noa-Yarasca, Kabindra Adhikari, Chad B. Hajda, Douglas R. Smith
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/25/2/543
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832587479518019584
author Sayantan Sarkar
Javier M. Osorio Leyton
Efrain Noa-Yarasca
Kabindra Adhikari
Chad B. Hajda
Douglas R. Smith
author_facet Sayantan Sarkar
Javier M. Osorio Leyton
Efrain Noa-Yarasca
Kabindra Adhikari
Chad B. Hajda
Douglas R. Smith
author_sort Sayantan Sarkar
collection DOAJ
description Efficient and reliable corn (<i>Zea mays</i> L.) yield prediction is important for varietal selection by plant breeders and management decision-making by growers. Unlike prior studies that focus mainly on county-level or controlled laboratory-scale areas, this study targets a production-scale area, better representing real-world agricultural conditions and offering more practical relevance for farmers. Therefore, the objective of our study was to determine the best combination of vegetation indices and abiotic factors for predicting corn yield in a rain-fed, production-scale area, identify the most suitable corn growth stage for yield estimation using machine learning, and identify the most effective machine learning model for corn yield estimation. Our study used high-resolution (6 cm) aerial multispectral imagery. Sixty-two different predictors, including soil properties (sand, silt, and clay percentages), slope, spectral bands (red, green, blue, red-edge, NIR), vegetation indices (GNDRE, NDRE, TGI), color-space indices, and wavelengths were derived from the multispectral data collected at the seven (V4, V5, V6, V7, V9, V12, and V14/VT) growth stages of corn. Four regression and machine learning algorithms were evaluated for yield prediction: linear regression, random forest, extreme gradient boosting, and gradient boosting regressor. A total of 6865 yield values were used for model training and 1716 for validation. Results show that, using random forest method, the V14/VT stage had the best yield predictions (RMSE of 0.52 Mg/ha for a mean yield of 10.19 Mg/ha), and yield estimation at V6 stage was still feasible. We concluded that integrating abiotic factors, such as slope and soil properties, significantly improved model accuracy. Among vegetation indices, TGI, HUE, and GNDRE performed better. Results from this study can help farmers or crop consultants plan ahead for future logistics through enhanced early-season yield predictions and support farm profitability and sustainability.
format Article
id doaj-art-196c0ff9a3fd4202ab0e868640c98b11
institution Kabale University
issn 1424-8220
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj-art-196c0ff9a3fd4202ab0e868640c98b112025-01-24T13:49:18ZengMDPI AGSensors1424-82202025-01-0125254310.3390/s25020543Integrating Remote Sensing and Soil Features for Enhanced Machine Learning-Based Corn Yield Prediction in the Southern USSayantan Sarkar0Javier M. Osorio Leyton1Efrain Noa-Yarasca2Kabindra Adhikari3Chad B. Hajda4Douglas R. Smith5Texas A&M AgriLife Blackland Research and Extension Center, Temple, TX 76502, USATexas A&M AgriLife Blackland Research and Extension Center, Temple, TX 76502, USATexas A&M AgriLife Blackland Research and Extension Center, Temple, TX 76502, USAUnited States Department of Agriculture–Agriculture Research Service, Grassland Soil and Water Research Laboratory, Temple, TX 76502, USAUnited States Department of Agriculture–Agriculture Research Service, Grassland Soil and Water Research Laboratory, Temple, TX 76502, USAUnited States Department of Agriculture–Agriculture Research Service, Grassland Soil and Water Research Laboratory, Temple, TX 76502, USAEfficient and reliable corn (<i>Zea mays</i> L.) yield prediction is important for varietal selection by plant breeders and management decision-making by growers. Unlike prior studies that focus mainly on county-level or controlled laboratory-scale areas, this study targets a production-scale area, better representing real-world agricultural conditions and offering more practical relevance for farmers. Therefore, the objective of our study was to determine the best combination of vegetation indices and abiotic factors for predicting corn yield in a rain-fed, production-scale area, identify the most suitable corn growth stage for yield estimation using machine learning, and identify the most effective machine learning model for corn yield estimation. Our study used high-resolution (6 cm) aerial multispectral imagery. Sixty-two different predictors, including soil properties (sand, silt, and clay percentages), slope, spectral bands (red, green, blue, red-edge, NIR), vegetation indices (GNDRE, NDRE, TGI), color-space indices, and wavelengths were derived from the multispectral data collected at the seven (V4, V5, V6, V7, V9, V12, and V14/VT) growth stages of corn. Four regression and machine learning algorithms were evaluated for yield prediction: linear regression, random forest, extreme gradient boosting, and gradient boosting regressor. A total of 6865 yield values were used for model training and 1716 for validation. Results show that, using random forest method, the V14/VT stage had the best yield predictions (RMSE of 0.52 Mg/ha for a mean yield of 10.19 Mg/ha), and yield estimation at V6 stage was still feasible. We concluded that integrating abiotic factors, such as slope and soil properties, significantly improved model accuracy. Among vegetation indices, TGI, HUE, and GNDRE performed better. Results from this study can help farmers or crop consultants plan ahead for future logistics through enhanced early-season yield predictions and support farm profitability and sustainability.https://www.mdpi.com/1424-8220/25/2/543cornmaizeyield predictionmachine learningvegetation indicesensemble methods
spellingShingle Sayantan Sarkar
Javier M. Osorio Leyton
Efrain Noa-Yarasca
Kabindra Adhikari
Chad B. Hajda
Douglas R. Smith
Integrating Remote Sensing and Soil Features for Enhanced Machine Learning-Based Corn Yield Prediction in the Southern US
Sensors
corn
maize
yield prediction
machine learning
vegetation indices
ensemble methods
title Integrating Remote Sensing and Soil Features for Enhanced Machine Learning-Based Corn Yield Prediction in the Southern US
title_full Integrating Remote Sensing and Soil Features for Enhanced Machine Learning-Based Corn Yield Prediction in the Southern US
title_fullStr Integrating Remote Sensing and Soil Features for Enhanced Machine Learning-Based Corn Yield Prediction in the Southern US
title_full_unstemmed Integrating Remote Sensing and Soil Features for Enhanced Machine Learning-Based Corn Yield Prediction in the Southern US
title_short Integrating Remote Sensing and Soil Features for Enhanced Machine Learning-Based Corn Yield Prediction in the Southern US
title_sort integrating remote sensing and soil features for enhanced machine learning based corn yield prediction in the southern us
topic corn
maize
yield prediction
machine learning
vegetation indices
ensemble methods
url https://www.mdpi.com/1424-8220/25/2/543
work_keys_str_mv AT sayantansarkar integratingremotesensingandsoilfeaturesforenhancedmachinelearningbasedcornyieldpredictioninthesouthernus
AT javiermosorioleyton integratingremotesensingandsoilfeaturesforenhancedmachinelearningbasedcornyieldpredictioninthesouthernus
AT efrainnoayarasca integratingremotesensingandsoilfeaturesforenhancedmachinelearningbasedcornyieldpredictioninthesouthernus
AT kabindraadhikari integratingremotesensingandsoilfeaturesforenhancedmachinelearningbasedcornyieldpredictioninthesouthernus
AT chadbhajda integratingremotesensingandsoilfeaturesforenhancedmachinelearningbasedcornyieldpredictioninthesouthernus
AT douglasrsmith integratingremotesensingandsoilfeaturesforenhancedmachinelearningbasedcornyieldpredictioninthesouthernus