-
261
RETINA: Reconstruction-based pre-trained enhanced TransUNet for electron microscopy segmentation on the CEM500K dataset.
Published 2025-05-01“…We developed the RETINA method, which combines pre-training on the large, unlabeled CEM500K EM image dataset with a hybrid neural-network model architecture that integrates both local (convolutional layer) and global (transformer layer) image processing to learn from manual image annotations. …”
Get full text
Article -
262
A New Hybrid ConvViT Model for Dangerous Farm Insect Detection
Published 2025-02-01“…This study proposes a novel hybrid convolution and vision transformer model (ConvViT) designed to detect harmful insect species that adversely affect agricultural production and play a critical role in global food security. …”
Get full text
Article -
263
Attention residual network for medical ultrasound image segmentation
Published 2025-07-01“…Additionally, a spatial hybrid convolution module is integrated to augment the model’s ability to extract global information and deepen the vertical architecture of the network. …”
Get full text
Article -
264
Remote sensing image Super-resolution reconstruction by fusing multi-scale receptive fields and hybrid transformer
Published 2025-01-01“…The discriminator combines multi-scale convolution, global Transformer, and hierarchical feature discriminators, providing a comprehensive and refined evaluation of image quality. …”
Get full text
Article -
265
MTGNet: Multi-Agent End-to-End Motion Trajectory Prediction with Multimodal Panoramic Dynamic Graph
Published 2025-05-01“…In addition, we utilize the graph convolutional neural network (GCN) to process graph-structured data. …”
Get full text
Article -
266
TMAR: 3-D Transformer Network via Masked Autoencoder Regularization for Hyperspectral Sharpening
Published 2025-01-01“…In this study, we focus on leveraging the power of CNN and transformer models and propose a multistage deep transformer-based super-resolution network that is regularized via an asymmetric autoencoder structure. In addition, we utilize a 3-D convolution layer in the light transformer structure because it allows for more flexible computation of correlations between HSI layers and better capturing of dependencies within spectral–spatial features. …”
Get full text
Article -
267
Application of Partial Differential Equation Image Classification Methods to the Aesthetic Evaluation of Images
Published 2021-01-01“…The structure of a convolution kernel learned by using parallel network structure achieves better classification performance. …”
Get full text
Article -
268
Non-end-to-end adaptive graph learning for multi-scale temporal traffic flow prediction.
Published 2025-01-01“…The method incorporates a multi-scale temporal attention module and a multi-scale temporal convolution module to extract multi-scale information. …”
Get full text
Article -
269
Foreign object detection on coal conveyor belt enhanced by attention mechanism
Published 2025-06-01“…A unique combination of convolution and pooling operations was used by the CPCA attention mechanism to perform global average pooling and maximum pooling on the input feature map, multi-dimensional feature information was deeply mined, and then attention weights for each channel and spatial position were accurately generated through nonlinear transformation, guiding the model to focus on the key feature areas of foreign objects and enhance feature extraction capabilities. …”
Get full text
Article -
270
Graph-Based Adaptive Network With Spatial-Spectral Features for Hyperspectral Unmixing
Published 2025-01-01“…In the method, HSIs are treated as data on manifold structures, with superpixels serving as graph nodes to construct a global graph-structured data. …”
Get full text
Article -
271
A VAN-Based Multi-Scale Cross-Attention Mechanism for Skin Lesion Segmentation Network
Published 2023-01-01“…Although many neural networks based on U-shaped structures and methods, such as skip connections have achieved excellent results in medical image segmentation tasks, the properties of convolutional operations limit their ability to effectively learn local and global features. …”
Get full text
Article -
272
LEAD-YOLO: A Lightweight and Accurate Network for Small Object Detection in Autonomous Driving
Published 2025-08-01“…The proposed framework incorporates three innovative components: First, the Backbone integrates a lightweight Convolutional Gated Transformer (CGF) module, which employs normalized gating mechanisms with residual connections, and a Dilated Feature Fusion (DFF) structure that enables progressive multi-scale context modeling through dilated convolutions. …”
Get full text
Article -
273
WDM-UNet: A Wavelet-Deformable Gated Fusion Network for Multi-Scale Retinal Vessel Segmentation
Published 2025-08-01“…To address these limitations, we propose WDM-UNet, a novel spatial-wavelet dual-domain fusion architecture that integrates spatial and wavelet-domain representations to simultaneously enhance the local detail and global context. The encoder combines a Deformable Convolution Encoder (DCE), which adaptively models complex vascular structures through dynamic receptive fields, and a Wavelet Convolution Encoder (WCE), which captures the semantic and structural contexts through low-frequency components and hierarchical wavelet convolution. …”
Get full text
Article -
274
3D-SCUMamba: An Abdominal Tumor Segmentation Model
Published 2025-01-01“…Existing deep learning models typically adopt encoder-decoder architectures integrating convolutional layers with global dependency modeling to capture broader contextual information around tumors. …”
Get full text
Article -
275
DSS-MobileNetV3: An Efficient Dynamic-State-Space- Enhanced Network for Concrete Crack Segmentation
Published 2025-06-01“…The DSS-MobileNetV3 adopts a U-shaped encoder–decoder architecture, and a dynamic-state-space (DSS) block is designed into the encoder to improve the MobileNetV3 bottleneck module in modeling global dependencies. The DSS block improves the MobileNetV3 model in structural perception and global dependency modeling for complex crack morphologies by integrating dynamic snake convolution and a state space model. …”
Get full text
Article -
276
HMA-Net: a hybrid mixer framework with multihead attention for breast ultrasound image segmentation
Published 2025-06-01“…The model achieved a Jaccard index of 98.04% and 94.84% and a Dice similarity coefficient of 99.01% and 97.35% on the BUSI and BrEaST datasets, respectively.DiscussionThe ConvMixer and ConvNeXT modules are integrated with convolution-enhanced multihead attention, which enhances the model's ability to capture local and global contextual information. …”
Get full text
Article -
277
Diagnosis of Alzheimer’s disease using brain $$^{18}\textrm{F}$$ -FDG PET imaging based on a state space model
Published 2025-07-01“…Building on this, we optimized the original purely convolutional structure into a hybrid architecture combining convolution and Transformer layers. …”
Get full text
Article -
278
Attention-enhanced StrongSORT for robust vehicle tracking in complex environments
Published 2025-05-01“…To address these challenges, we propose AE-StrongSORT (Attention-Enhanced StrongSORT), an attention-enhanced tracking framework featuring three systematic innovations: first, the GAM-YOLO (global attention mechanism-YOLO)hybrid architecture integrates multi-scale feature fusion with a global attention mechanism (GC2f structure). …”
Get full text
Article -
279
A high-precision edge detection technique for magnetic anomaly signals based on a self-attention mechanism
Published 2025-07-01“…Magnetic data boundary detection is a key technology in potential field data processing, providing an effective basis for the division of geological units and fault structures. It holds significant importance in geological structure analysis and mineral exploration. …”
Get full text
Article -
280
YOLOv10-kiwi: a YOLOv10-based lightweight kiwifruit detection model in trellised orchards
Published 2025-08-01“…Second, to further reduce model complexity, a novel C2fDualHet module is proposed by integrating two consecutive Heterogeneous Kernel Convolution (HetConv) layers as a replacement for the traditional Bottleneck structure. …”
Get full text
Article