-
121
ViT-DualAtt: An efficient pornographic image classification method based on Vision Transformer with dual attention
Published 2024-12-01“…The model adopts a CNN-Transformer hierarchical structure, combining the strengths of Convolutional Neural Networks (CNNs) and Transformers to effectively capture and integrate both local and global features, thereby enhancing feature representation accuracy and diversity. …”
Get full text
Article -
122
Two-stream spatio-temporal GCN-transformer networks for skeleton-based action recognition
Published 2025-02-01“…This study proposes a novel architecture addressing this limitation by implementing a parallel configuration of GCNs and the Transformer model (SA-TDGFormer). This parallel structure integrates the advantages of both the GCN model and the Transformer model, facilitating the extraction of both local and global spatio-temporal features, leading to more accurate motion information encoding and improved recognition performance. …”
Get full text
Article -
123
A Rotation Target Detection Network Based on Multi-Kernel Interaction and Hierarchical Expansion
Published 2025-08-01Get full text
Article -
124
VE-GCN: A Geography-Aware Approach for Polyline Simplification in Cartographic Generalization
Published 2025-02-01“…To enhance the graph convolutional structure for capturing crucial geographic element features and simultaneously learning vertex and edge features within map polylines, this study introduces a joint vertex–edge feature graph convolutional network (VE-GCN). …”
Get full text
Article -
125
-
126
CNN–Transformer gated fusion network for medical image super-resolution
Published 2025-05-01“…The network consists of two branches, one is the global branch based on residual Transformer network, and the other is the local branch based on dynamic convolutional neural network. …”
Get full text
Article -
127
MSA-Net: multiple self-attention mechanism for 3D lung nodule classification in CT images
Published 2025-05-01Get full text
Article -
128
A Parallel Image Denoising Network Based on Nonparametric Attention and Multiscale Feature Fusion
Published 2025-01-01“…The lower branch network used multiple dilation convolution residual blocks with different dilation rates to increase the receptive field and extend more contextual information to obtain the global features of the noise in the image. …”
Get full text
Article -
129
A Feature-Driven Inception Dilated Network for Infrared Image Super-Resolution Reconstruction
Published 2024-10-01“…Therefore, an Inception Dilated Super-Resolution (IDSR) network with multiple branches is proposed. A dilated convolutional branch captures high-frequency information to reconstruct edge details, while a non-local operation branch captures long-range dependencies between any two positions to maintain the global structure. …”
Get full text
Article -
130
MT-SCnet: multi-scale token divided and spatial-channel fusion transformer network for microscopic hyperspectral image segmentation
Published 2024-12-01“…IntroductionHybrid architectures based on convolutional neural networks and Transformers, effectively captures both the local details and the overall structural context of lesion tissues and cells, achieving highly competitive segmentation results in microscopic hyperspectral image (MHSI) segmentation tasks. …”
Get full text
Article -
131
Multiscale Wavelet and Graph Network With Spectral Self-Attention for Hyperspectral Image Classification
Published 2025-01-01“…Third, DH-GCN constructs a deep graph structure to model spatial topology and overcome oversmoothing. …”
Get full text
Article -
132
MCPA: multi-scale cross perceptron attention network for 2D medical image segmentation
Published 2024-12-01“…However, it faces challenges in capturing long-range dependencies due to the limited receptive fields and inherent bias of convolutional operations. Recently, numerous transformer-based techniques have been incorporated into the UNet architecture to overcome this limitation by effectively capturing global feature correlations. …”
Get full text
Article -
133
PI-ADFM: Enhancing Multimodal Remote Sensing Image Matching Through Phase-Integrated Aggregated Deep Features
Published 2025-01-01“…Geometric distortions and significant nonlinear radiometric differences in multimodal remote sensing images (MRSIs) introduce substantial noise in feature extraction. Single-branch convolutional neural networks fail to capture global image features and integrate local and global information effectively, yielding deep descriptors with low discriminability and limited robustness. …”
Get full text
Article -
134
High-Precision Qiantang River Water Body Recognition Based on Remote Sensing Image
Published 2024-01-01“…., are applied, Currently there are few works on the water body identification of Qiantang River, Here, one major challenge for high-precision Qiantang water body recognition is the real complex water body features and complicated geological environment, They are the dense distribution of small water bodies in the Qiantang River Basin, large differences in water body nutrition, and the high complexity of surface environments such as mountains and plains, We investigated two traditional and several deep learning methods and found that WatNet was the most effective model for Qiantang River, This model adopts the structure based on encoder-decoder convolutional network, It uses MobileNetV2 as the encoder, which makes it extract more water feature information while being lightweight and uses ASPP module to capture global multi-scale features in deep layers, Experimental results show that the MIoU and OA (Overall Accuracy) can reach 0. 97 and 0. 99 respectively.…”
Get full text
Article -
135
-
136
FD-YOLO11: A Feature-Enhanced Deep Learning Model for Steel Surface Defect Detection
Published 2025-01-01“…To enhance the multiscale feature extraction process, self-calibrated convolution is integrated into the C3k2 module. Additionally, an FSPPF structure is designed to optimize the process of fusing local and global information, improving the defect recognition ability of the model in complex backgrounds. …”
Get full text
Article -
137
Deep Time Series Intelligent Framework for Power Data Asset Evaluation
Published 2025-01-01“…In the evaluation of the complex and rich Solar-Power dataset and Electricity dataset, TSENet achieved significant performance improvements over other state-of-the-art baseline methods.Through the synergistic design of deep convolutional structures and an efficient memory mechanism, it effectively addresses issues such as inadequate modeling of long-term dependencies, insufficient extraction of short-term features, and high prediction volatility, thereby significantly enhancing both the accuracy and robustness of forecasting in power asset evaluation tasks.…”
Get full text
Article -
138
Diagnosis of Coronary Heart Disease Through Deep Learning-Based Segmentation and Localization in Computed Tomography Angiography
Published 2025-01-01“…Coronary computed tomography angiography (CCTA) has emerged as a non-invasive modality for detailed coronary artery visualization; however, automatic and accurate segmentation of coronary structures from CCTA images remains challenging. Conventional convolutional neural networks (CNNs), despite their success in medical imaging, face limitations in capturing the complex, long-range dependencies in coronary artery images due to their localized receptive fields. …”
Get full text
Article -
139
An Inverted Residual Cross Head Knowledge Distillation Network for Remote Sensing Scene Image Classification
Published 2025-01-01“…Then, a multiscale spatial attention module is constructed to further extract global and local features of the image through multiple dilated convolutions, using spatial attention to weight important features in each dilated convolution branch. …”
Get full text
Article -
140
An OGFA+CNN Approach for Multi-Level Disease Identification in Fundus Images
Published 2025-01-01“…Graph-based techniques are employed to capture the structural relationships between key elements such as blood vessels and the optic disc, providing valuable global context to the image. …”
Get full text
Article