SLG-Net: Small-Large-Global Feature-Based Multilevel Feature Extraction Network for Ultrasound Image Segmentation
Automatic ultrasound image segmentation improves the efficiency of clinical diagnosis and decreases the workload of doctors. Many ultrasound image segmentation methods only focus on capturing local details and global dependencies, whereas ignoring large-scale context information. However, it is esse...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2025-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10836679/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832590323956580352 |
---|---|
author | Xinya Fan Jianwen Hu Kai Hu |
author_facet | Xinya Fan Jianwen Hu Kai Hu |
author_sort | Xinya Fan |
collection | DOAJ |
description | Automatic ultrasound image segmentation improves the efficiency of clinical diagnosis and decreases the workload of doctors. Many ultrasound image segmentation methods only focus on capturing local details and global dependencies, whereas ignoring large-scale context information. However, it is essential to extract large-scale context features for large targets in images. To enhance the capability of feature extraction of the model for targets with various sizes and improve segmentation performance, we propose an effective multilevel feature extraction network (SLG-Net) which can extract features from local small details, large-scale context to global dependencies. The SLG-Net is parallel dual-encoder architecture which consists of a CNN encoder and a transformer encoder. Specifically, the CNN encoder improves the representation and interaction of fine feature and large-scale context feature for targets of different sizes by large-small kernel attention (LSKA) modules. The LSKA module firstly extracts features by parallel small kernel module and large-scale feature selection (LSFS) module. The extracted features from above modules are added for further information interaction through a following multi-scale feature interaction module. To fully leverage the feature extraction capability of large kernel convolutions and decrease the number of parameters, we design the large kernel decomposition module (LKDM) to extract large-scale context features in LSFS module. The transformer encoder is used to capture global features for compensating the limitations of CNN encoder. To merge multilevel features, a multi-scale feature fusion module is introduced after the dual-encoder. In addition, at the skip connection, a multi-scale attention module is integrated to retain significant shallow features for subsequent fusion of deep and shallow features. Experiments on three public ultrasound datasets indicate that the proposed network accomplishes the prominent performance for ultrasound image segmentation. It shows the potential of our study to promote intelligence in clinical medicine. |
format | Article |
id | doaj-art-4d05aae63a0a4bd2be0eaf6cd7666410 |
institution | Kabale University |
issn | 2169-3536 |
language | English |
publishDate | 2025-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj-art-4d05aae63a0a4bd2be0eaf6cd76664102025-01-24T00:01:17ZengIEEEIEEE Access2169-35362025-01-0113117201173310.1109/ACCESS.2025.352838010836679SLG-Net: Small-Large-Global Feature-Based Multilevel Feature Extraction Network for Ultrasound Image SegmentationXinya Fan0https://orcid.org/0009-0000-5438-0991Jianwen Hu1https://orcid.org/0000-0001-9849-1327Kai Hu2School of Electrical and Information Engineering, Changsha University of Science and Technology, Changsha, ChinaSchool of Electrical and Information Engineering, Changsha University of Science and Technology, Changsha, ChinaDepartment of Neurology, Xiangya Hospital, Central South University, Changsha, ChinaAutomatic ultrasound image segmentation improves the efficiency of clinical diagnosis and decreases the workload of doctors. Many ultrasound image segmentation methods only focus on capturing local details and global dependencies, whereas ignoring large-scale context information. However, it is essential to extract large-scale context features for large targets in images. To enhance the capability of feature extraction of the model for targets with various sizes and improve segmentation performance, we propose an effective multilevel feature extraction network (SLG-Net) which can extract features from local small details, large-scale context to global dependencies. The SLG-Net is parallel dual-encoder architecture which consists of a CNN encoder and a transformer encoder. Specifically, the CNN encoder improves the representation and interaction of fine feature and large-scale context feature for targets of different sizes by large-small kernel attention (LSKA) modules. The LSKA module firstly extracts features by parallel small kernel module and large-scale feature selection (LSFS) module. The extracted features from above modules are added for further information interaction through a following multi-scale feature interaction module. To fully leverage the feature extraction capability of large kernel convolutions and decrease the number of parameters, we design the large kernel decomposition module (LKDM) to extract large-scale context features in LSFS module. The transformer encoder is used to capture global features for compensating the limitations of CNN encoder. To merge multilevel features, a multi-scale feature fusion module is introduced after the dual-encoder. In addition, at the skip connection, a multi-scale attention module is integrated to retain significant shallow features for subsequent fusion of deep and shallow features. Experiments on three public ultrasound datasets indicate that the proposed network accomplishes the prominent performance for ultrasound image segmentation. It shows the potential of our study to promote intelligence in clinical medicine.https://ieeexplore.ieee.org/document/10836679/Ultrasound image segmentationtransformerlarge kernelattention mechanismconvolutional neural network |
spellingShingle | Xinya Fan Jianwen Hu Kai Hu SLG-Net: Small-Large-Global Feature-Based Multilevel Feature Extraction Network for Ultrasound Image Segmentation IEEE Access Ultrasound image segmentation transformer large kernel attention mechanism convolutional neural network |
title | SLG-Net: Small-Large-Global Feature-Based Multilevel Feature Extraction Network for Ultrasound Image Segmentation |
title_full | SLG-Net: Small-Large-Global Feature-Based Multilevel Feature Extraction Network for Ultrasound Image Segmentation |
title_fullStr | SLG-Net: Small-Large-Global Feature-Based Multilevel Feature Extraction Network for Ultrasound Image Segmentation |
title_full_unstemmed | SLG-Net: Small-Large-Global Feature-Based Multilevel Feature Extraction Network for Ultrasound Image Segmentation |
title_short | SLG-Net: Small-Large-Global Feature-Based Multilevel Feature Extraction Network for Ultrasound Image Segmentation |
title_sort | slg net small large global feature based multilevel feature extraction network for ultrasound image segmentation |
topic | Ultrasound image segmentation transformer large kernel attention mechanism convolutional neural network |
url | https://ieeexplore.ieee.org/document/10836679/ |
work_keys_str_mv | AT xinyafan slgnetsmalllargeglobalfeaturebasedmultilevelfeatureextractionnetworkforultrasoundimagesegmentation AT jianwenhu slgnetsmalllargeglobalfeaturebasedmultilevelfeatureextractionnetworkforultrasoundimagesegmentation AT kaihu slgnetsmalllargeglobalfeaturebasedmultilevelfeatureextractionnetworkforultrasoundimagesegmentation |