Improving food safety: Synthetic data augmentation for accurate mushroom species identification in complex environments

Growing global demand for mushrooms in the agricultural, pharmaceutical, and food sectors raises pressing concerns for food safety, as inadequate identification can lead to the inadvertent inclusion of toxic species. Although deep learning offers promising solutions for large-scale classification an...

Full description

Saved in:
Bibliographic Details
Main Authors: Mengze Du, Fei Wang, Weibing Yan, Jie Guo, Lu Liu, Ping Lv, Yong He, Xuping Feng, Yuwei Wang
Format: Article
Language:English
Published: Elsevier 2025-06-01
Series:Applied Food Research
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2772502225003476
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Growing global demand for mushrooms in the agricultural, pharmaceutical, and food sectors raises pressing concerns for food safety, as inadequate identification can lead to the inadvertent inclusion of toxic species. Although deep learning offers promising solutions for large-scale classification and quality control, its effectiveness is often limited by the scarcity of high-quality training datasets and the inherently complex morphological traits of mushrooms. To address these challenges, this study proposes a diffusion model-based data augmentation that preserves essential morphological features while diversifying training samples. By generating realistic images, this approach significantly improves classification accuracy, especially for underrepresented species, and achieves superior results across various image quality indicators. Furthermore, a Top-k recommendation method ranks the most likely classifications, bolstering recognition of rarer species and reducing food safety risks linked to misidentification. Experiments on 110 mushroom species demonstrated a 13.51 % increase in mean recall rate across eight deep learning models, with Top-3 and Top-5 recall rates further improved by 7.56 % and 6.79 %, respectively. These findings highlight the practical utility of diffusion-based augmentation in enhancing model robustness, ensuring more reliable monitoring compared to traditional methods, and mitigating the likelihood of toxic mushrooms entering the food chain. Overall, this study underscores the critical importance of leveraging synthetic data generation to advance mushroom recognition, supporting safer and more sustainable food systems.
ISSN:2772-5022