Cyberinfrastructure for machine learning applications in agriculture: experiences, analysis, and vision

IntroductionAdvancements in machine learning (ML) algorithms that make predictions from data without being explicitly programmed and the increased computational speeds of graphics processing units (GPUs) over the last decade have led to remarkable progress in the capabilities of ML. In many fields,...

Full description

Saved in:
Bibliographic Details
Main Authors: Lucas Waltz, Sushma Katari, Chaeun Hong, Adit Anup, Julian Colbert, Anirudh Potlapally, Taylor Dill, Canaan Porter, John Engle, Christopher Stewart, Hari Subramoni, Scott Shearer, Raghu Machiraju, Osler Ortez, Laura Lindsey, Arnab Nandi, Sami Khanal
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-01-01
Series:Frontiers in Artificial Intelligence
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/frai.2024.1496066/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832590862200078336
author Lucas Waltz
Sushma Katari
Chaeun Hong
Adit Anup
Julian Colbert
Anirudh Potlapally
Taylor Dill
Canaan Porter
John Engle
John Engle
Christopher Stewart
Hari Subramoni
Scott Shearer
Raghu Machiraju
Osler Ortez
Laura Lindsey
Arnab Nandi
Sami Khanal
author_facet Lucas Waltz
Sushma Katari
Chaeun Hong
Adit Anup
Julian Colbert
Anirudh Potlapally
Taylor Dill
Canaan Porter
John Engle
John Engle
Christopher Stewart
Hari Subramoni
Scott Shearer
Raghu Machiraju
Osler Ortez
Laura Lindsey
Arnab Nandi
Sami Khanal
author_sort Lucas Waltz
collection DOAJ
description IntroductionAdvancements in machine learning (ML) algorithms that make predictions from data without being explicitly programmed and the increased computational speeds of graphics processing units (GPUs) over the last decade have led to remarkable progress in the capabilities of ML. In many fields, including agriculture, this progress has outpaced the availability of sufficiently diverse and high-quality datasets, which now serve as a limiting factor. While many agricultural use cases appear feasible with current compute resources and ML algorithms, the lack of reusable hardware and software components, referred to as cyberinfrastructure (CI), for collecting, transmitting, cleaning, labeling, and training datasets is a major hindrance toward developing solutions to address agricultural use cases. This study focuses on addressing these challenges by exploring the collection, processing, and training of ML models using a multimodal dataset and providing a vision for agriculture-focused CI to accelerate innovation in the field.MethodsData were collected during the 2023 growing season from three agricultural research locations across Ohio. The dataset includes 1 terabyte (TB) of multimodal data, comprising Unmanned Aerial System (UAS) imagery (RGB and multispectral), as well as soil and weather sensor data. The two primary crops studied were corn and soybean, which are the state's most widely cultivated crops. The data collected and processed from this study were used to train ML models to make predictions of crop growth stage, soil moisture, and final yield.ResultsThe exercise of processing this dataset resulted in four CI components that can be used to provide higher accuracy predictions in the agricultural domain. These components included (1) a UAS imagery pipeline that reduced processing time and improved image quality over standard methods, (2) a tabular data pipeline that aggregated data from multiple sources and temporal resolutions and aligned it with a common temporal resolution, (3) an approach to adapting the model architecture for a vision transformer (ViT) that incorporates agricultural domain expertise, and (4) a data visualization prototype that was used to identify outliers and improve trust in the data.DiscussionFurther work will be aimed at maturing the CI components and implementing them on high performance computing (HPC). There are open questions as to how CI components like these can best be leveraged to serve the needs of the agricultural community to accelerate the development of ML applications in agriculture.
format Article
id doaj-art-0bcf468483124fd3aa2760db5a22e421
institution Kabale University
issn 2624-8212
language English
publishDate 2025-01-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Artificial Intelligence
spelling doaj-art-0bcf468483124fd3aa2760db5a22e4212025-01-23T06:56:02ZengFrontiers Media S.A.Frontiers in Artificial Intelligence2624-82122025-01-01710.3389/frai.2024.14960661496066Cyberinfrastructure for machine learning applications in agriculture: experiences, analysis, and visionLucas Waltz0Sushma Katari1Chaeun Hong2Adit Anup3Julian Colbert4Anirudh Potlapally5Taylor Dill6Canaan Porter7John Engle8John Engle9Christopher Stewart10Hari Subramoni11Scott Shearer12Raghu Machiraju13Osler Ortez14Laura Lindsey15Arnab Nandi16Sami Khanal17Department of Food, Agricultural, and Biological Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Food, Agricultural, and Biological Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Horticulture and Crop Science, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Food, Agricultural, and Biological Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Food, Agricultural, and Biological Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Horticulture and Crop Science, The Ohio State University, Columbus, OH, United StatesDepartment of Horticulture and Crop Science, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Food, Agricultural, and Biological Engineering, The Ohio State University, Columbus, OH, United StatesIntroductionAdvancements in machine learning (ML) algorithms that make predictions from data without being explicitly programmed and the increased computational speeds of graphics processing units (GPUs) over the last decade have led to remarkable progress in the capabilities of ML. In many fields, including agriculture, this progress has outpaced the availability of sufficiently diverse and high-quality datasets, which now serve as a limiting factor. While many agricultural use cases appear feasible with current compute resources and ML algorithms, the lack of reusable hardware and software components, referred to as cyberinfrastructure (CI), for collecting, transmitting, cleaning, labeling, and training datasets is a major hindrance toward developing solutions to address agricultural use cases. This study focuses on addressing these challenges by exploring the collection, processing, and training of ML models using a multimodal dataset and providing a vision for agriculture-focused CI to accelerate innovation in the field.MethodsData were collected during the 2023 growing season from three agricultural research locations across Ohio. The dataset includes 1 terabyte (TB) of multimodal data, comprising Unmanned Aerial System (UAS) imagery (RGB and multispectral), as well as soil and weather sensor data. The two primary crops studied were corn and soybean, which are the state's most widely cultivated crops. The data collected and processed from this study were used to train ML models to make predictions of crop growth stage, soil moisture, and final yield.ResultsThe exercise of processing this dataset resulted in four CI components that can be used to provide higher accuracy predictions in the agricultural domain. These components included (1) a UAS imagery pipeline that reduced processing time and improved image quality over standard methods, (2) a tabular data pipeline that aggregated data from multiple sources and temporal resolutions and aligned it with a common temporal resolution, (3) an approach to adapting the model architecture for a vision transformer (ViT) that incorporates agricultural domain expertise, and (4) a data visualization prototype that was used to identify outliers and improve trust in the data.DiscussionFurther work will be aimed at maturing the CI components and implementing them on high performance computing (HPC). There are open questions as to how CI components like these can best be leveraged to serve the needs of the agricultural community to accelerate the development of ML applications in agriculture.https://www.frontiersin.org/articles/10.3389/frai.2024.1496066/fullprecision agriculturemultimodal datamachine learningUnmanned Aerial Systemscrop phenotypingcyberinfrastructure
spellingShingle Lucas Waltz
Sushma Katari
Chaeun Hong
Adit Anup
Julian Colbert
Anirudh Potlapally
Taylor Dill
Canaan Porter
John Engle
John Engle
Christopher Stewart
Hari Subramoni
Scott Shearer
Raghu Machiraju
Osler Ortez
Laura Lindsey
Arnab Nandi
Sami Khanal
Cyberinfrastructure for machine learning applications in agriculture: experiences, analysis, and vision
Frontiers in Artificial Intelligence
precision agriculture
multimodal data
machine learning
Unmanned Aerial Systems
crop phenotyping
cyberinfrastructure
title Cyberinfrastructure for machine learning applications in agriculture: experiences, analysis, and vision
title_full Cyberinfrastructure for machine learning applications in agriculture: experiences, analysis, and vision
title_fullStr Cyberinfrastructure for machine learning applications in agriculture: experiences, analysis, and vision
title_full_unstemmed Cyberinfrastructure for machine learning applications in agriculture: experiences, analysis, and vision
title_short Cyberinfrastructure for machine learning applications in agriculture: experiences, analysis, and vision
title_sort cyberinfrastructure for machine learning applications in agriculture experiences analysis and vision
topic precision agriculture
multimodal data
machine learning
Unmanned Aerial Systems
crop phenotyping
cyberinfrastructure
url https://www.frontiersin.org/articles/10.3389/frai.2024.1496066/full
work_keys_str_mv AT lucaswaltz cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT sushmakatari cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT chaeunhong cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT aditanup cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT juliancolbert cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT anirudhpotlapally cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT taylordill cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT canaanporter cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT johnengle cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT johnengle cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT christopherstewart cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT harisubramoni cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT scottshearer cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT raghumachiraju cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT oslerortez cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT lauralindsey cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT arnabnandi cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision
AT samikhanal cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision