Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AI

Recent advancements in deep learning have significantly improved medical image segmentation. However, the generalization performance and potential risks of data-driven models remain insufficiently validated. Specifically, unrealistic segmentation predictions deviating from actual anatomical structur...

Full description

Saved in:

Bibliographic Details
Main Authors:	Sihwan Kim, Changmin Park, Gwanghyeon Jeon, Seohee Kim, Jong Hyo Kim
Format:	Article
Language:	English
Published:	MDPI AG 2025-01-01
Series:	Bioengineering
Subjects:	AI audit Seg-Hallucination uncertainty anomaly screening segmentation
Online Access:	https://www.mdpi.com/2306-5354/12/1/81
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832589024333660160
author	Sihwan Kim Changmin Park Gwanghyeon Jeon Seohee Kim Jong Hyo Kim
author_facet	Sihwan Kim Changmin Park Gwanghyeon Jeon Seohee Kim Jong Hyo Kim
author_sort	Sihwan Kim
collection	DOAJ
description	Recent advancements in deep learning have significantly improved medical image segmentation. However, the generalization performance and potential risks of data-driven models remain insufficiently validated. Specifically, unrealistic segmentation predictions deviating from actual anatomical structures, known as a Seg-Hallucination, often occur in deep learning-based models. The Seg-Hallucinations can result in erroneous quantitative analyses and distort critical imaging biomarker information, yet effective audits or corrections to address these issues are rare. Therefore, we propose an automated Seg-Hallucination surveillance and correction (ASHSC) algorithm utilizing only 3D organ mask information derived from CT images without reliance on the ground truth. Two publicly available datasets were used in developing the ASHSC algorithm: 280 CT scans from the TotalSegmentator dataset for training and 274 CT scans from the Cancer Imaging Archive (TCIA) dataset for performance evaluation. The ASHSC algorithm utilizes a two-stage on-demand strategy with mesh-based convolutional neural networks and generative artificial intelligence. The segmentation quality level (SQ-level)-based surveillance stage was evaluated using the area under the receiver operating curve, sensitivity, specificity, and positive predictive value. The on-demand correction performance of the algorithm was assessed using similarity metrics: volumetric Dice score, volume error percentage, average surface distance, and Hausdorff distance. Average performance of the surveillance stage resulted in an AUROC of 0.94 ± 0.01, sensitivity of 0.82 ± 0.03, specificity of 0.90 ± 0.01, and PPV of 0.92 ± 0.01 for test dataset. After the on-demand refinement of the correction stage, all the four similarity metrics were improved compared to a single use of the AI-segmentation model. This study not only enhances the efficiency and reliability of handling the Seg-Hallucination but also eliminates the reliance on ground truth. The ASHSC algorithm offers intuitive 3D guidance for uncertainty regions, while maintaining manageable computational complexity. The SQ-level-based on-demand correction strategy adaptively minimizes uncertainties inherent in deep-learning-based organ masks and advances automated auditing and correction methodologies.
format	Article
id	doaj-art-0cdc7eaca1f641709f9a5e683f8c8548
institution	Kabale University
issn	2306-5354
language	English
publishDate	2025-01-01
publisher	MDPI AG
record_format	Article
series	Bioengineering
spelling	doaj-art-0cdc7eaca1f641709f9a5e683f8c85482025-01-24T13:23:12ZengMDPI AGBioengineering2306-53542025-01-011218110.3390/bioengineering12010081Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AISihwan Kim0Changmin Park1Gwanghyeon Jeon2Seohee Kim3Jong Hyo Kim4Department of Applied Bioengineering, Graduate School of Convergence Science and Technology, Seoul National University, Seoul 08826, Republic of KoreaDepartment of Applied Bioengineering, Graduate School of Convergence Science and Technology, Seoul National University, Seoul 08826, Republic of KoreaClariPi Research, ClariPi Inc., Seoul 03088, Republic of KoreaDepartment of Applied Bioengineering, Graduate School of Convergence Science and Technology, Seoul National University, Seoul 08826, Republic of KoreaDepartment of Applied Bioengineering, Graduate School of Convergence Science and Technology, Seoul National University, Seoul 08826, Republic of KoreaRecent advancements in deep learning have significantly improved medical image segmentation. However, the generalization performance and potential risks of data-driven models remain insufficiently validated. Specifically, unrealistic segmentation predictions deviating from actual anatomical structures, known as a Seg-Hallucination, often occur in deep learning-based models. The Seg-Hallucinations can result in erroneous quantitative analyses and distort critical imaging biomarker information, yet effective audits or corrections to address these issues are rare. Therefore, we propose an automated Seg-Hallucination surveillance and correction (ASHSC) algorithm utilizing only 3D organ mask information derived from CT images without reliance on the ground truth. Two publicly available datasets were used in developing the ASHSC algorithm: 280 CT scans from the TotalSegmentator dataset for training and 274 CT scans from the Cancer Imaging Archive (TCIA) dataset for performance evaluation. The ASHSC algorithm utilizes a two-stage on-demand strategy with mesh-based convolutional neural networks and generative artificial intelligence. The segmentation quality level (SQ-level)-based surveillance stage was evaluated using the area under the receiver operating curve, sensitivity, specificity, and positive predictive value. The on-demand correction performance of the algorithm was assessed using similarity metrics: volumetric Dice score, volume error percentage, average surface distance, and Hausdorff distance. Average performance of the surveillance stage resulted in an AUROC of 0.94 ± 0.01, sensitivity of 0.82 ± 0.03, specificity of 0.90 ± 0.01, and PPV of 0.92 ± 0.01 for test dataset. After the on-demand refinement of the correction stage, all the four similarity metrics were improved compared to a single use of the AI-segmentation model. This study not only enhances the efficiency and reliability of handling the Seg-Hallucination but also eliminates the reliance on ground truth. The ASHSC algorithm offers intuitive 3D guidance for uncertainty regions, while maintaining manageable computational complexity. The SQ-level-based on-demand correction strategy adaptively minimizes uncertainties inherent in deep-learning-based organ masks and advances automated auditing and correction methodologies.https://www.mdpi.com/2306-5354/12/1/81AI auditSeg-Hallucinationuncertaintyanomaly screeningsegmentation
spellingShingle	Sihwan Kim Changmin Park Gwanghyeon Jeon Seohee Kim Jong Hyo Kim Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AI Bioengineering AI audit Seg-Hallucination uncertainty anomaly screening segmentation
title	Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AI
title_full	Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AI
title_fullStr	Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AI
title_full_unstemmed	Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AI
title_short	Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AI
title_sort	automated audit and self correction algorithm for seg hallucination using meshcnn based on demand generative ai
topic	AI audit Seg-Hallucination uncertainty anomaly screening segmentation
url	https://www.mdpi.com/2306-5354/12/1/81
work_keys_str_mv	AT sihwankim automatedauditandselfcorrectionalgorithmforseghallucinationusingmeshcnnbasedondemandgenerativeai AT changminpark automatedauditandselfcorrectionalgorithmforseghallucinationusingmeshcnnbasedondemandgenerativeai AT gwanghyeonjeon automatedauditandselfcorrectionalgorithmforseghallucinationusingmeshcnnbasedondemandgenerativeai AT seoheekim automatedauditandselfcorrectionalgorithmforseghallucinationusingmeshcnnbasedondemandgenerativeai AT jonghyokim automatedauditandselfcorrectionalgorithmforseghallucinationusingmeshcnnbasedondemandgenerativeai

Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AI

Similar Items