Extracting Fruit Disease Knowledge from Research Papers Based on Large Language Models and Prompt Engineering

In China, fruit tree diseases are a significant threat to the development of the fruit tree industry, and knowledge about fruit tree diseases is the most needed professional knowledge for fruit farmers and other practitioners in the fruit tree industry. Research papers are the primary sources of pro...

Full description

Saved in:
Bibliographic Details
Main Authors: Yunqiao Fei, Jingchao Fan, Guomin Zhou
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/2/628
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832589268467318784
author Yunqiao Fei
Jingchao Fan
Guomin Zhou
author_facet Yunqiao Fei
Jingchao Fan
Guomin Zhou
author_sort Yunqiao Fei
collection DOAJ
description In China, fruit tree diseases are a significant threat to the development of the fruit tree industry, and knowledge about fruit tree diseases is the most needed professional knowledge for fruit farmers and other practitioners in the fruit tree industry. Research papers are the primary sources of professional knowledge that represent the cutting-edge progress in fruit disease research. Traditional knowledge engineering methods for knowledge acquisition require extensive and cumbersome preparatory work, and they demand a high level of professional background and information technology skills from the handlers. This paper, from the perspective of fruit tree industry knowledge dissemination, aims at users such as fruit farmers, fruit tree experts, fruit tree knowledge communicators, and information gatherers. It proposes a fast, cost-effective, and low-technical-barrier method for extracting fruit tree disease knowledge from research paper abstracts—K-Extract, based on large language models (LLMs) and prompt engineering. Under zero-shot conditions, K-Extract utilizes conversational LLMs to automate the extraction of fruit tree disease knowledge. The K-Extract method has constructed a comprehensive classification system for fruit tree diseases and, through a series of optimized prompt questions, effectively overcomes the deficiencies of LLM models in providing factual accuracy. This paper tests multiple LLM models available in the Chinese market, and the results show that K-Extract can seamlessly integrate with any conversational LLM model, with the DeepSeek model and the Kimi model performing particularly well. The experimental results indicate that LLM models have a high accuracy rate in handling judgment tasks and simple knowledge Q&A tasks. The K-Extract method is simple, efficient, and accurate, and can serve as a convenient tool for knowledge extraction in the agricultural field.
format Article
id doaj-art-755a570e7842427a867d6c0ae31059b7
institution Kabale University
issn 2076-3417
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj-art-755a570e7842427a867d6c0ae31059b72025-01-24T13:20:10ZengMDPI AGApplied Sciences2076-34172025-01-0115262810.3390/app15020628Extracting Fruit Disease Knowledge from Research Papers Based on Large Language Models and Prompt EngineeringYunqiao Fei0Jingchao Fan1Guomin Zhou2Agricultural Information Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, ChinaAgricultural Information Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, ChinaNational Agricultural Science Data Center, Beijing 100081, ChinaIn China, fruit tree diseases are a significant threat to the development of the fruit tree industry, and knowledge about fruit tree diseases is the most needed professional knowledge for fruit farmers and other practitioners in the fruit tree industry. Research papers are the primary sources of professional knowledge that represent the cutting-edge progress in fruit disease research. Traditional knowledge engineering methods for knowledge acquisition require extensive and cumbersome preparatory work, and they demand a high level of professional background and information technology skills from the handlers. This paper, from the perspective of fruit tree industry knowledge dissemination, aims at users such as fruit farmers, fruit tree experts, fruit tree knowledge communicators, and information gatherers. It proposes a fast, cost-effective, and low-technical-barrier method for extracting fruit tree disease knowledge from research paper abstracts—K-Extract, based on large language models (LLMs) and prompt engineering. Under zero-shot conditions, K-Extract utilizes conversational LLMs to automate the extraction of fruit tree disease knowledge. The K-Extract method has constructed a comprehensive classification system for fruit tree diseases and, through a series of optimized prompt questions, effectively overcomes the deficiencies of LLM models in providing factual accuracy. This paper tests multiple LLM models available in the Chinese market, and the results show that K-Extract can seamlessly integrate with any conversational LLM model, with the DeepSeek model and the Kimi model performing particularly well. The experimental results indicate that LLM models have a high accuracy rate in handling judgment tasks and simple knowledge Q&A tasks. The K-Extract method is simple, efficient, and accurate, and can serve as a convenient tool for knowledge extraction in the agricultural field.https://www.mdpi.com/2076-3417/15/2/628research papersknowledge extractionlarge language modelsprompt engineeringfruit tree diseases
spellingShingle Yunqiao Fei
Jingchao Fan
Guomin Zhou
Extracting Fruit Disease Knowledge from Research Papers Based on Large Language Models and Prompt Engineering
Applied Sciences
research papers
knowledge extraction
large language models
prompt engineering
fruit tree diseases
title Extracting Fruit Disease Knowledge from Research Papers Based on Large Language Models and Prompt Engineering
title_full Extracting Fruit Disease Knowledge from Research Papers Based on Large Language Models and Prompt Engineering
title_fullStr Extracting Fruit Disease Knowledge from Research Papers Based on Large Language Models and Prompt Engineering
title_full_unstemmed Extracting Fruit Disease Knowledge from Research Papers Based on Large Language Models and Prompt Engineering
title_short Extracting Fruit Disease Knowledge from Research Papers Based on Large Language Models and Prompt Engineering
title_sort extracting fruit disease knowledge from research papers based on large language models and prompt engineering
topic research papers
knowledge extraction
large language models
prompt engineering
fruit tree diseases
url https://www.mdpi.com/2076-3417/15/2/628
work_keys_str_mv AT yunqiaofei extractingfruitdiseaseknowledgefromresearchpapersbasedonlargelanguagemodelsandpromptengineering
AT jingchaofan extractingfruitdiseaseknowledgefromresearchpapersbasedonlargelanguagemodelsandpromptengineering
AT guominzhou extractingfruitdiseaseknowledgefromresearchpapersbasedonlargelanguagemodelsandpromptengineering