Analysing the potential of ChatGPT to support plant disease risk forecasting systems

This study explores the potential of two versions of the ChatGPT large language model (GPT-3.5 Turbo and GPT-4) in supporting plant disease risk forecasting through the translation of model-based predictions into advisory messages. A dataset of 3125 messages, each referred to an artificially generat...

Full description

Saved in:
Bibliographic Details
Main Authors: Roberta Calone, Elisabetta Raparelli, Sofia Bajocco, Eugenio Rossi, Lorenzo Crecco, Danilo Morelli, Chiara Bassi, Rocchina Tiso, Riccardo Bugiani, Fabio Pietrangeli, Giovanna Cattaneo, Camilla Nigro, Marco Secondo Gerardi, Simone Bussotti, Angela Sanchioni, Danilo Tognetti, Mariangela Sandra, Irene De Lillo, Paolo Framarin, Sandra Di Ferdinando, Simone Bregaglio
Format: Article
Language:English
Published: Elsevier 2025-03-01
Series:Smart Agricultural Technology
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2772375525000589
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This study explores the potential of two versions of the ChatGPT large language model (GPT-3.5 Turbo and GPT-4) in supporting plant disease risk forecasting through the translation of model-based predictions into advisory messages. A dataset of 3125 messages, each referred to an artificially generated five-day disease risk scenario, was inspected using lexical, consistency, and sentiment analyses. A participatory approach was adopted involving officers and technicians from eleven Italian regional phytosanitary services in the message evaluation. Lexical analysis indicated that GPT-4 produced more detailed advises, leading to diversified responses across disease pressure scenarios. In contrast, GPT-3.5 generated concise and straightforward messages, making it well-suited for routine tasks requiring clarity and brevity. Sentiment analysis revealed the greater adaptability of GPT-4 in shifting from a reassurance to an urgency tone as the risk level increased, while GPT-3.5 maintained a more neutral stance across disease pressure scenarios. Consistency analysis demonstrated greater stability in GPT-3.5 messages, whereas GPT-4 exhibited more expressivity and creativity. Expert evaluations highlighted promising potential of both models for operational use, with GPT-4 noted for its precision in communicating disease risk and supporting technical bulletins drafting. However, both GPT versions were criticized for producing overly generic advices, highlighting the importance of domain-specific training to integrate information on best management practices, locally authorized substances, and historical treatment schedules. Such implementation is crucial to align large language models with Integrated Pest Management principles and improve their precision towards their operational use.
ISSN:2772-3755