-
261
CGFTNet: Content-Guided Frequency Domain Transform Network for Face Super-Resolution
Published 2024-12-01“…Recent advancements in face super resolution (FSR) have been propelled by deep learning techniques using convolutional neural networks (CNN). However, existing methods still struggle with effectively capturing global facial structure information, leading to reduced fidelity in reconstructed images, and often require additional manual data annotation. …”
Get full text
Article -
262
Lightweight human activity recognition method based on the MobileHARC model
Published 2024-12-01“…However, due to the fact that these models have sequential network structures and are unable to simultaneously focus on local and global features, thus, resulting in a reduction in recognition performance. …”
Get full text
Article -
263
HMA-Net: a hybrid mixer framework with multihead attention for breast ultrasound image segmentation
Published 2025-06-01“…The model achieved a Jaccard index of 98.04% and 94.84% and a Dice similarity coefficient of 99.01% and 97.35% on the BUSI and BrEaST datasets, respectively.DiscussionThe ConvMixer and ConvNeXT modules are integrated with convolution-enhanced multihead attention, which enhances the model's ability to capture local and global contextual information. …”
Get full text
Article -
264
Rotten strawberry classification based on EfficientNet V2 algorithm fused with GCN and CA-Transformer
Published 2024-12-01“…Secondly, this study integrated the Transformer structure with attention into the backbone of the baseline model, replacing some convolution operations with this structure to achieve the fusion of global and local features, thereby better identifying the rottenness of strawberries. …”
Get full text
Article -
265
LEAD-YOLO: A Lightweight and Accurate Network for Small Object Detection in Autonomous Driving
Published 2025-08-01“…The proposed framework incorporates three innovative components: First, the Backbone integrates a lightweight Convolutional Gated Transformer (CGF) module, which employs normalized gating mechanisms with residual connections, and a Dilated Feature Fusion (DFF) structure that enables progressive multi-scale context modeling through dilated convolutions. …”
Get full text
Article -
266
Multiscale Graph Transformer Network With Dynamic Superpixel Pyramid for Hyperspectral Image Classification
Published 2025-01-01“…To address these limitations, we propose a multi-scale graph transformer network (MSGTN), which captures spatial features at different scales through multiscale graph convolutional networks (GCNs) with adaptive graph structures. …”
Get full text
Article -
267
A Hybrid Learnable Fusion of ConvNeXt and Swin Transformer for Optimized Image Classification
Published 2025-05-01“…However, each paradigm alone is limited in addressing both fine-grained structures and broader anatomical context. We propose ConvTransGFusion, a hybrid model that fuses ConvNeXt (for refined convolutional features) and Swin Transformer (for hierarchical global attention) using a learnable dual-attention gating mechanism. …”
Get full text
Article -
268
RETINA: Reconstruction-based pre-trained enhanced TransUNet for electron microscopy segmentation on the CEM500K dataset.
Published 2025-05-01“…We developed the RETINA method, which combines pre-training on the large, unlabeled CEM500K EM image dataset with a hybrid neural-network model architecture that integrates both local (convolutional layer) and global (transformer layer) image processing to learn from manual image annotations. …”
Get full text
Article -
269
Attention residual network for medical ultrasound image segmentation
Published 2025-07-01“…Additionally, a spatial hybrid convolution module is integrated to augment the model’s ability to extract global information and deepen the vertical architecture of the network. …”
Get full text
Article -
270
A New Hybrid ConvViT Model for Dangerous Farm Insect Detection
Published 2025-02-01“…This study proposes a novel hybrid convolution and vision transformer model (ConvViT) designed to detect harmful insect species that adversely affect agricultural production and play a critical role in global food security. …”
Get full text
Article -
271
TFF-Net: A Feature Fusion Graph Neural Network-Based Vehicle Type Recognition Approach for Low-Light Conditions
Published 2025-06-01“…The model employs multi-scale convolutional operations combined with an Efficient Channel Attention (ECA) module to extract discriminative local features, while independent convolutional layers capture hierarchical global representations. …”
Get full text
Article -
272
MCGFE-CR: Cloud Removal With Multiscale Context-Guided Feature Enhancement Network
Published 2024-01-01“…Currently, cloud removal methods with better performance are mainly based on Convolutional Neural Networks (CNNs). However, they fail to capture global context information, resulting in the loss of global context features in image reconstruction. …”
Get full text
Article -
273
Dual-branch attention network-based stereoscopicvideo compression
Published 2025-01-01“…First, a Local and Global Encoder-decoder Block (LGEDB) based on Transformer and channel attention was proposed, which accurately captured non-repetitive texture details in local regions and global structural information by integrating pixel-level self-attention within each local area and global attention across channels. …”
Get full text
Article -
274
A Dual-Stream Dental Panoramic X-Ray Image Segmentation Method Based on Transformer Heterogeneous Feature Complementation
Published 2025-07-01“…Furthermore, a Pooling-Cooperative Convolutional Module was designed, which enhances the model’s capability in detail extraction and boundary localization through weighted centroid features of dental structures and a latent edge extraction module. …”
Get full text
Article -
275
Fusion of Recurrence Plots and Gramian Angular Fields with Bayesian Optimization for Enhanced Time-Series Classification
Published 2025-07-01“…Time-series classification remains a critical task across various domains, demanding models that effectively capture both local recurrence structures and global temporal dependencies. We introduce a novel framework that transforms time series into image representations by fusing recurrence plots (RPs) with both Gramian Angular Summation Fields (GASFs) and Gramian Angular Difference Fields (GADFs). …”
Get full text
Article -
276
Non-end-to-end adaptive graph learning for multi-scale temporal traffic flow prediction.
Published 2025-01-01“…The method incorporates a multi-scale temporal attention module and a multi-scale temporal convolution module to extract multi-scale information. …”
Get full text
Article -
277
Improved Asynchronous Federated Learning for Data Injection Pollution
Published 2025-05-01“…In our approach, the residual network is used to extract the static information of the image, the capsule network is used to extract the spatial dependence among the internal structures of the image, several layers of convolution are used to reduce the dimensions of both features, and the two extracted features are fused. …”
Get full text
Article -
278
A VAN-Based Multi-Scale Cross-Attention Mechanism for Skin Lesion Segmentation Network
Published 2023-01-01“…Although many neural networks based on U-shaped structures and methods, such as skip connections have achieved excellent results in medical image segmentation tasks, the properties of convolutional operations limit their ability to effectively learn local and global features. …”
Get full text
Article -
279
3D-SCUMamba: An Abdominal Tumor Segmentation Model
Published 2025-01-01“…Existing deep learning models typically adopt encoder-decoder architectures integrating convolutional layers with global dependency modeling to capture broader contextual information around tumors. …”
Get full text
Article -
280
Foreign object detection on coal conveyor belt enhanced by attention mechanism
Published 2025-06-01“…A unique combination of convolution and pooling operations was used by the CPCA attention mechanism to perform global average pooling and maximum pooling on the input feature map, multi-dimensional feature information was deeply mined, and then attention weights for each channel and spatial position were accurately generated through nonlinear transformation, guiding the model to focus on the key feature areas of foreign objects and enhance feature extraction capabilities. …”
Get full text
Article