Cyberinfrastructure for machine learning applications in agriculture: experiences, analysis, and vision
IntroductionAdvancements in machine learning (ML) algorithms that make predictions from data without being explicitly programmed and the increased computational speeds of graphics processing units (GPUs) over the last decade have led to remarkable progress in the capabilities of ML. In many fields,...
Saved in:
Main Authors: | , , , , , , , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Frontiers Media S.A.
2025-01-01
|
Series: | Frontiers in Artificial Intelligence |
Subjects: | |
Online Access: | https://www.frontiersin.org/articles/10.3389/frai.2024.1496066/full |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832590862200078336 |
---|---|
author | Lucas Waltz Sushma Katari Chaeun Hong Adit Anup Julian Colbert Anirudh Potlapally Taylor Dill Canaan Porter John Engle John Engle Christopher Stewart Hari Subramoni Scott Shearer Raghu Machiraju Osler Ortez Laura Lindsey Arnab Nandi Sami Khanal |
author_facet | Lucas Waltz Sushma Katari Chaeun Hong Adit Anup Julian Colbert Anirudh Potlapally Taylor Dill Canaan Porter John Engle John Engle Christopher Stewart Hari Subramoni Scott Shearer Raghu Machiraju Osler Ortez Laura Lindsey Arnab Nandi Sami Khanal |
author_sort | Lucas Waltz |
collection | DOAJ |
description | IntroductionAdvancements in machine learning (ML) algorithms that make predictions from data without being explicitly programmed and the increased computational speeds of graphics processing units (GPUs) over the last decade have led to remarkable progress in the capabilities of ML. In many fields, including agriculture, this progress has outpaced the availability of sufficiently diverse and high-quality datasets, which now serve as a limiting factor. While many agricultural use cases appear feasible with current compute resources and ML algorithms, the lack of reusable hardware and software components, referred to as cyberinfrastructure (CI), for collecting, transmitting, cleaning, labeling, and training datasets is a major hindrance toward developing solutions to address agricultural use cases. This study focuses on addressing these challenges by exploring the collection, processing, and training of ML models using a multimodal dataset and providing a vision for agriculture-focused CI to accelerate innovation in the field.MethodsData were collected during the 2023 growing season from three agricultural research locations across Ohio. The dataset includes 1 terabyte (TB) of multimodal data, comprising Unmanned Aerial System (UAS) imagery (RGB and multispectral), as well as soil and weather sensor data. The two primary crops studied were corn and soybean, which are the state's most widely cultivated crops. The data collected and processed from this study were used to train ML models to make predictions of crop growth stage, soil moisture, and final yield.ResultsThe exercise of processing this dataset resulted in four CI components that can be used to provide higher accuracy predictions in the agricultural domain. These components included (1) a UAS imagery pipeline that reduced processing time and improved image quality over standard methods, (2) a tabular data pipeline that aggregated data from multiple sources and temporal resolutions and aligned it with a common temporal resolution, (3) an approach to adapting the model architecture for a vision transformer (ViT) that incorporates agricultural domain expertise, and (4) a data visualization prototype that was used to identify outliers and improve trust in the data.DiscussionFurther work will be aimed at maturing the CI components and implementing them on high performance computing (HPC). There are open questions as to how CI components like these can best be leveraged to serve the needs of the agricultural community to accelerate the development of ML applications in agriculture. |
format | Article |
id | doaj-art-0bcf468483124fd3aa2760db5a22e421 |
institution | Kabale University |
issn | 2624-8212 |
language | English |
publishDate | 2025-01-01 |
publisher | Frontiers Media S.A. |
record_format | Article |
series | Frontiers in Artificial Intelligence |
spelling | doaj-art-0bcf468483124fd3aa2760db5a22e4212025-01-23T06:56:02ZengFrontiers Media S.A.Frontiers in Artificial Intelligence2624-82122025-01-01710.3389/frai.2024.14960661496066Cyberinfrastructure for machine learning applications in agriculture: experiences, analysis, and visionLucas Waltz0Sushma Katari1Chaeun Hong2Adit Anup3Julian Colbert4Anirudh Potlapally5Taylor Dill6Canaan Porter7John Engle8John Engle9Christopher Stewart10Hari Subramoni11Scott Shearer12Raghu Machiraju13Osler Ortez14Laura Lindsey15Arnab Nandi16Sami Khanal17Department of Food, Agricultural, and Biological Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Food, Agricultural, and Biological Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Horticulture and Crop Science, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Food, Agricultural, and Biological Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Food, Agricultural, and Biological Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Horticulture and Crop Science, The Ohio State University, Columbus, OH, United StatesDepartment of Horticulture and Crop Science, The Ohio State University, Columbus, OH, United StatesDepartment of Computer Science and Engineering, The Ohio State University, Columbus, OH, United StatesDepartment of Food, Agricultural, and Biological Engineering, The Ohio State University, Columbus, OH, United StatesIntroductionAdvancements in machine learning (ML) algorithms that make predictions from data without being explicitly programmed and the increased computational speeds of graphics processing units (GPUs) over the last decade have led to remarkable progress in the capabilities of ML. In many fields, including agriculture, this progress has outpaced the availability of sufficiently diverse and high-quality datasets, which now serve as a limiting factor. While many agricultural use cases appear feasible with current compute resources and ML algorithms, the lack of reusable hardware and software components, referred to as cyberinfrastructure (CI), for collecting, transmitting, cleaning, labeling, and training datasets is a major hindrance toward developing solutions to address agricultural use cases. This study focuses on addressing these challenges by exploring the collection, processing, and training of ML models using a multimodal dataset and providing a vision for agriculture-focused CI to accelerate innovation in the field.MethodsData were collected during the 2023 growing season from three agricultural research locations across Ohio. The dataset includes 1 terabyte (TB) of multimodal data, comprising Unmanned Aerial System (UAS) imagery (RGB and multispectral), as well as soil and weather sensor data. The two primary crops studied were corn and soybean, which are the state's most widely cultivated crops. The data collected and processed from this study were used to train ML models to make predictions of crop growth stage, soil moisture, and final yield.ResultsThe exercise of processing this dataset resulted in four CI components that can be used to provide higher accuracy predictions in the agricultural domain. These components included (1) a UAS imagery pipeline that reduced processing time and improved image quality over standard methods, (2) a tabular data pipeline that aggregated data from multiple sources and temporal resolutions and aligned it with a common temporal resolution, (3) an approach to adapting the model architecture for a vision transformer (ViT) that incorporates agricultural domain expertise, and (4) a data visualization prototype that was used to identify outliers and improve trust in the data.DiscussionFurther work will be aimed at maturing the CI components and implementing them on high performance computing (HPC). There are open questions as to how CI components like these can best be leveraged to serve the needs of the agricultural community to accelerate the development of ML applications in agriculture.https://www.frontiersin.org/articles/10.3389/frai.2024.1496066/fullprecision agriculturemultimodal datamachine learningUnmanned Aerial Systemscrop phenotypingcyberinfrastructure |
spellingShingle | Lucas Waltz Sushma Katari Chaeun Hong Adit Anup Julian Colbert Anirudh Potlapally Taylor Dill Canaan Porter John Engle John Engle Christopher Stewart Hari Subramoni Scott Shearer Raghu Machiraju Osler Ortez Laura Lindsey Arnab Nandi Sami Khanal Cyberinfrastructure for machine learning applications in agriculture: experiences, analysis, and vision Frontiers in Artificial Intelligence precision agriculture multimodal data machine learning Unmanned Aerial Systems crop phenotyping cyberinfrastructure |
title | Cyberinfrastructure for machine learning applications in agriculture: experiences, analysis, and vision |
title_full | Cyberinfrastructure for machine learning applications in agriculture: experiences, analysis, and vision |
title_fullStr | Cyberinfrastructure for machine learning applications in agriculture: experiences, analysis, and vision |
title_full_unstemmed | Cyberinfrastructure for machine learning applications in agriculture: experiences, analysis, and vision |
title_short | Cyberinfrastructure for machine learning applications in agriculture: experiences, analysis, and vision |
title_sort | cyberinfrastructure for machine learning applications in agriculture experiences analysis and vision |
topic | precision agriculture multimodal data machine learning Unmanned Aerial Systems crop phenotyping cyberinfrastructure |
url | https://www.frontiersin.org/articles/10.3389/frai.2024.1496066/full |
work_keys_str_mv | AT lucaswaltz cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT sushmakatari cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT chaeunhong cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT aditanup cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT juliancolbert cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT anirudhpotlapally cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT taylordill cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT canaanporter cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT johnengle cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT johnengle cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT christopherstewart cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT harisubramoni cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT scottshearer cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT raghumachiraju cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT oslerortez cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT lauralindsey cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT arnabnandi cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision AT samikhanal cyberinfrastructureformachinelearningapplicationsinagricultureexperiencesanalysisandvision |