Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AI

Recent advancements in deep learning have significantly improved medical image segmentation. However, the generalization performance and potential risks of data-driven models remain insufficiently validated. Specifically, unrealistic segmentation predictions deviating from actual anatomical structur...

Full description

Saved in:
Bibliographic Details
Main Authors: Sihwan Kim, Changmin Park, Gwanghyeon Jeon, Seohee Kim, Jong Hyo Kim
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Bioengineering
Subjects:
Online Access:https://www.mdpi.com/2306-5354/12/1/81
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832589024333660160
author Sihwan Kim
Changmin Park
Gwanghyeon Jeon
Seohee Kim
Jong Hyo Kim
author_facet Sihwan Kim
Changmin Park
Gwanghyeon Jeon
Seohee Kim
Jong Hyo Kim
author_sort Sihwan Kim
collection DOAJ
description Recent advancements in deep learning have significantly improved medical image segmentation. However, the generalization performance and potential risks of data-driven models remain insufficiently validated. Specifically, unrealistic segmentation predictions deviating from actual anatomical structures, known as a Seg-Hallucination, often occur in deep learning-based models. The Seg-Hallucinations can result in erroneous quantitative analyses and distort critical imaging biomarker information, yet effective audits or corrections to address these issues are rare. Therefore, we propose an automated Seg-Hallucination surveillance and correction (ASHSC) algorithm utilizing only 3D organ mask information derived from CT images without reliance on the ground truth. Two publicly available datasets were used in developing the ASHSC algorithm: 280 CT scans from the TotalSegmentator dataset for training and 274 CT scans from the Cancer Imaging Archive (TCIA) dataset for performance evaluation. The ASHSC algorithm utilizes a two-stage on-demand strategy with mesh-based convolutional neural networks and generative artificial intelligence. The segmentation quality level (SQ-level)-based surveillance stage was evaluated using the area under the receiver operating curve, sensitivity, specificity, and positive predictive value. The on-demand correction performance of the algorithm was assessed using similarity metrics: volumetric Dice score, volume error percentage, average surface distance, and Hausdorff distance. Average performance of the surveillance stage resulted in an AUROC of 0.94 ± 0.01, sensitivity of 0.82 ± 0.03, specificity of 0.90 ± 0.01, and PPV of 0.92 ± 0.01 for test dataset. After the on-demand refinement of the correction stage, all the four similarity metrics were improved compared to a single use of the AI-segmentation model. This study not only enhances the efficiency and reliability of handling the Seg-Hallucination but also eliminates the reliance on ground truth. The ASHSC algorithm offers intuitive 3D guidance for uncertainty regions, while maintaining manageable computational complexity. The SQ-level-based on-demand correction strategy adaptively minimizes uncertainties inherent in deep-learning-based organ masks and advances automated auditing and correction methodologies.
format Article
id doaj-art-0cdc7eaca1f641709f9a5e683f8c8548
institution Kabale University
issn 2306-5354
language English
publishDate 2025-01-01
publisher MDPI AG
record_format Article
series Bioengineering
spelling doaj-art-0cdc7eaca1f641709f9a5e683f8c85482025-01-24T13:23:12ZengMDPI AGBioengineering2306-53542025-01-011218110.3390/bioengineering12010081Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AISihwan Kim0Changmin Park1Gwanghyeon Jeon2Seohee Kim3Jong Hyo Kim4Department of Applied Bioengineering, Graduate School of Convergence Science and Technology, Seoul National University, Seoul 08826, Republic of KoreaDepartment of Applied Bioengineering, Graduate School of Convergence Science and Technology, Seoul National University, Seoul 08826, Republic of KoreaClariPi Research, ClariPi Inc., Seoul 03088, Republic of KoreaDepartment of Applied Bioengineering, Graduate School of Convergence Science and Technology, Seoul National University, Seoul 08826, Republic of KoreaDepartment of Applied Bioengineering, Graduate School of Convergence Science and Technology, Seoul National University, Seoul 08826, Republic of KoreaRecent advancements in deep learning have significantly improved medical image segmentation. However, the generalization performance and potential risks of data-driven models remain insufficiently validated. Specifically, unrealistic segmentation predictions deviating from actual anatomical structures, known as a Seg-Hallucination, often occur in deep learning-based models. The Seg-Hallucinations can result in erroneous quantitative analyses and distort critical imaging biomarker information, yet effective audits or corrections to address these issues are rare. Therefore, we propose an automated Seg-Hallucination surveillance and correction (ASHSC) algorithm utilizing only 3D organ mask information derived from CT images without reliance on the ground truth. Two publicly available datasets were used in developing the ASHSC algorithm: 280 CT scans from the TotalSegmentator dataset for training and 274 CT scans from the Cancer Imaging Archive (TCIA) dataset for performance evaluation. The ASHSC algorithm utilizes a two-stage on-demand strategy with mesh-based convolutional neural networks and generative artificial intelligence. The segmentation quality level (SQ-level)-based surveillance stage was evaluated using the area under the receiver operating curve, sensitivity, specificity, and positive predictive value. The on-demand correction performance of the algorithm was assessed using similarity metrics: volumetric Dice score, volume error percentage, average surface distance, and Hausdorff distance. Average performance of the surveillance stage resulted in an AUROC of 0.94 ± 0.01, sensitivity of 0.82 ± 0.03, specificity of 0.90 ± 0.01, and PPV of 0.92 ± 0.01 for test dataset. After the on-demand refinement of the correction stage, all the four similarity metrics were improved compared to a single use of the AI-segmentation model. This study not only enhances the efficiency and reliability of handling the Seg-Hallucination but also eliminates the reliance on ground truth. The ASHSC algorithm offers intuitive 3D guidance for uncertainty regions, while maintaining manageable computational complexity. The SQ-level-based on-demand correction strategy adaptively minimizes uncertainties inherent in deep-learning-based organ masks and advances automated auditing and correction methodologies.https://www.mdpi.com/2306-5354/12/1/81AI auditSeg-Hallucinationuncertaintyanomaly screeningsegmentation
spellingShingle Sihwan Kim
Changmin Park
Gwanghyeon Jeon
Seohee Kim
Jong Hyo Kim
Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AI
Bioengineering
AI audit
Seg-Hallucination
uncertainty
anomaly screening
segmentation
title Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AI
title_full Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AI
title_fullStr Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AI
title_full_unstemmed Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AI
title_short Automated Audit and Self-Correction Algorithm for Seg-Hallucination Using MeshCNN-Based On-Demand Generative AI
title_sort automated audit and self correction algorithm for seg hallucination using meshcnn based on demand generative ai
topic AI audit
Seg-Hallucination
uncertainty
anomaly screening
segmentation
url https://www.mdpi.com/2306-5354/12/1/81
work_keys_str_mv AT sihwankim automatedauditandselfcorrectionalgorithmforseghallucinationusingmeshcnnbasedondemandgenerativeai
AT changminpark automatedauditandselfcorrectionalgorithmforseghallucinationusingmeshcnnbasedondemandgenerativeai
AT gwanghyeonjeon automatedauditandselfcorrectionalgorithmforseghallucinationusingmeshcnnbasedondemandgenerativeai
AT seoheekim automatedauditandselfcorrectionalgorithmforseghallucinationusingmeshcnnbasedondemandgenerativeai
AT jonghyokim automatedauditandselfcorrectionalgorithmforseghallucinationusingmeshcnnbasedondemandgenerativeai