Seed Protein Content Estimation with Bench-Top Hyperspectral Imaging and Attentive Convolutional Neural Network Models

Wheat is a globally cultivated cereal crop with substantial protein content present in its seeds. This research aimed to develop robust methods for predicting seed protein concentration in wheat seeds using bench-top hyperspectral imaging in the visible, near-infrared (VNIR), and shortwave infrared...

Full description

Saved in:
Bibliographic Details
Main Authors: Imran Said, Vasit Sagan, Kyle T. Peterson, Haireti Alifu, Abuduwanli Maiwulanjiang, Abby Stylianou, Omar Al Akkad, Supria Sarkar, Noor Al Shakarji
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Sensors
Subjects:
Online Access:https://www.mdpi.com/1424-8220/25/2/303
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832587585303609344
author Imran Said
Vasit Sagan
Kyle T. Peterson
Haireti Alifu
Abuduwanli Maiwulanjiang
Abby Stylianou
Omar Al Akkad
Supria Sarkar
Noor Al Shakarji
author_facet Imran Said
Vasit Sagan
Kyle T. Peterson
Haireti Alifu
Abuduwanli Maiwulanjiang
Abby Stylianou
Omar Al Akkad
Supria Sarkar
Noor Al Shakarji
author_sort Imran Said
collection DOAJ
description Wheat is a globally cultivated cereal crop with substantial protein content present in its seeds. This research aimed to develop robust methods for predicting seed protein concentration in wheat seeds using bench-top hyperspectral imaging in the visible, near-infrared (VNIR), and shortwave infrared (SWIR) regions. To fully utilize the spectral and texture features of the full VNIR and SWIR spectral domains, a computer-vision-aided image co-registration methodology was implemented to seamlessly align the VNIR and SWIR bands. Sensitivity analyses were also conducted to identify the most sensitive bands for seed protein estimation. Convolutional neural networks (CNNs) with attention mechanisms were proposed along with traditional machine learning models based on feature engineering including Random Forest (RF) and Support Vector Machine (SVM) regression for comparative analysis. Additionally, the CNN classification approach was used to estimate low, medium, and high protein concentrations because this type of classification is more applicable for breeding efforts. Our results showed that the proposed CNN with attention mechanisms predicted wheat protein content with R<sup>2</sup> values of 0.70 and 0.65 for ventral and dorsal seed orientations, respectively. Although, the R<sup>2</sup> of the CNN approach was lower than of the best performing feature-based method, RF (R<sup>2</sup> of 0.77), end-to-end prediction capabilities with CNN hold great promise for the automation of wheat protein estimation for breeding. The CNN model achieved better classification of protein concentrations between low, medium, and high protein contents, with an R<sup>2</sup> of 0.82. This study’s findings highlight the significant potential of hyperspectral imaging and machine learning techniques for advancing precision breeding practices, optimizing seed sorting processes, and enabling targeted agricultural input applications.
format Article
id doaj-art-998166f1bc5649e4aa201b8907430bc0
institution Kabale University
issn 1424-8220
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj-art-998166f1bc5649e4aa201b8907430bc02025-01-24T13:48:26ZengMDPI AGSensors1424-82202025-01-0125230310.3390/s25020303Seed Protein Content Estimation with Bench-Top Hyperspectral Imaging and Attentive Convolutional Neural Network ModelsImran Said0Vasit Sagan1Kyle T. Peterson2Haireti Alifu3Abuduwanli Maiwulanjiang4Abby Stylianou5Omar Al Akkad6Supria Sarkar7Noor Al Shakarji8Department of Computer Science, Saint Louis University, Saint Louis, MO 63104, USADepartment of Computer Science, Saint Louis University, Saint Louis, MO 63104, USABayer Crop Science, 800 N Lindbergh Blvd, Creve Coeur, MO 63141, USADepartment of Earth, Environment and Geospatial Sciences, Saint Louis University, Saint Louis, MO 63108, USADepartment of Earth, Environment and Geospatial Sciences, Saint Louis University, Saint Louis, MO 63108, USADepartment of Computer Science, Saint Louis University, Saint Louis, MO 63104, USADepartment of Earth, Environment and Geospatial Sciences, Saint Louis University, Saint Louis, MO 63108, USADepartment of Earth, Environment and Geospatial Sciences, Saint Louis University, Saint Louis, MO 63108, USADepartment of Earth, Environment and Geospatial Sciences, Saint Louis University, Saint Louis, MO 63108, USAWheat is a globally cultivated cereal crop with substantial protein content present in its seeds. This research aimed to develop robust methods for predicting seed protein concentration in wheat seeds using bench-top hyperspectral imaging in the visible, near-infrared (VNIR), and shortwave infrared (SWIR) regions. To fully utilize the spectral and texture features of the full VNIR and SWIR spectral domains, a computer-vision-aided image co-registration methodology was implemented to seamlessly align the VNIR and SWIR bands. Sensitivity analyses were also conducted to identify the most sensitive bands for seed protein estimation. Convolutional neural networks (CNNs) with attention mechanisms were proposed along with traditional machine learning models based on feature engineering including Random Forest (RF) and Support Vector Machine (SVM) regression for comparative analysis. Additionally, the CNN classification approach was used to estimate low, medium, and high protein concentrations because this type of classification is more applicable for breeding efforts. Our results showed that the proposed CNN with attention mechanisms predicted wheat protein content with R<sup>2</sup> values of 0.70 and 0.65 for ventral and dorsal seed orientations, respectively. Although, the R<sup>2</sup> of the CNN approach was lower than of the best performing feature-based method, RF (R<sup>2</sup> of 0.77), end-to-end prediction capabilities with CNN hold great promise for the automation of wheat protein estimation for breeding. The CNN model achieved better classification of protein concentrations between low, medium, and high protein contents, with an R<sup>2</sup> of 0.82. This study’s findings highlight the significant potential of hyperspectral imaging and machine learning techniques for advancing precision breeding practices, optimizing seed sorting processes, and enabling targeted agricultural input applications.https://www.mdpi.com/1424-8220/25/2/303hyperspectral imagingseed composition estimationmachine learning3D CNN modelingattentive models
spellingShingle Imran Said
Vasit Sagan
Kyle T. Peterson
Haireti Alifu
Abuduwanli Maiwulanjiang
Abby Stylianou
Omar Al Akkad
Supria Sarkar
Noor Al Shakarji
Seed Protein Content Estimation with Bench-Top Hyperspectral Imaging and Attentive Convolutional Neural Network Models
Sensors
hyperspectral imaging
seed composition estimation
machine learning
3D CNN modeling
attentive models
title Seed Protein Content Estimation with Bench-Top Hyperspectral Imaging and Attentive Convolutional Neural Network Models
title_full Seed Protein Content Estimation with Bench-Top Hyperspectral Imaging and Attentive Convolutional Neural Network Models
title_fullStr Seed Protein Content Estimation with Bench-Top Hyperspectral Imaging and Attentive Convolutional Neural Network Models
title_full_unstemmed Seed Protein Content Estimation with Bench-Top Hyperspectral Imaging and Attentive Convolutional Neural Network Models
title_short Seed Protein Content Estimation with Bench-Top Hyperspectral Imaging and Attentive Convolutional Neural Network Models
title_sort seed protein content estimation with bench top hyperspectral imaging and attentive convolutional neural network models
topic hyperspectral imaging
seed composition estimation
machine learning
3D CNN modeling
attentive models
url https://www.mdpi.com/1424-8220/25/2/303
work_keys_str_mv AT imransaid seedproteincontentestimationwithbenchtophyperspectralimagingandattentiveconvolutionalneuralnetworkmodels
AT vasitsagan seedproteincontentestimationwithbenchtophyperspectralimagingandattentiveconvolutionalneuralnetworkmodels
AT kyletpeterson seedproteincontentestimationwithbenchtophyperspectralimagingandattentiveconvolutionalneuralnetworkmodels
AT hairetialifu seedproteincontentestimationwithbenchtophyperspectralimagingandattentiveconvolutionalneuralnetworkmodels
AT abuduwanlimaiwulanjiang seedproteincontentestimationwithbenchtophyperspectralimagingandattentiveconvolutionalneuralnetworkmodels
AT abbystylianou seedproteincontentestimationwithbenchtophyperspectralimagingandattentiveconvolutionalneuralnetworkmodels
AT omaralakkad seedproteincontentestimationwithbenchtophyperspectralimagingandattentiveconvolutionalneuralnetworkmodels
AT supriasarkar seedproteincontentestimationwithbenchtophyperspectralimagingandattentiveconvolutionalneuralnetworkmodels
AT nooralshakarji seedproteincontentestimationwithbenchtophyperspectralimagingandattentiveconvolutionalneuralnetworkmodels