-
261
DSS-MobileNetV3: An Efficient Dynamic-State-Space- Enhanced Network for Concrete Crack Segmentation
Published 2025-06-01“…The DSS-MobileNetV3 adopts a U-shaped encoder–decoder architecture, and a dynamic-state-space (DSS) block is designed into the encoder to improve the MobileNetV3 bottleneck module in modeling global dependencies. The DSS block improves the MobileNetV3 model in structural perception and global dependency modeling for complex crack morphologies by integrating dynamic snake convolution and a state space model. …”
Get full text
Article -
262
AnoViT: Unsupervised Anomaly Detection and Localization With Vision Transformer-Based Encoder-Decoder
Published 2022-01-01“…Therefore, current image anomaly detection methods have commonly used convolutional encoder-decoders to extract normal information through the local features of images. …”
Get full text
Article -
263
Lightweight human activity recognition method based on the MobileHARC model
Published 2024-12-01“…However, due to the fact that these models have sequential network structures and are unable to simultaneously focus on local and global features, thus, resulting in a reduction in recognition performance. …”
Get full text
Article -
264
LEAD-YOLO: A Lightweight and Accurate Network for Small Object Detection in Autonomous Driving
Published 2025-08-01“…The proposed framework incorporates three innovative components: First, the Backbone integrates a lightweight Convolutional Gated Transformer (CGF) module, which employs normalized gating mechanisms with residual connections, and a Dilated Feature Fusion (DFF) structure that enables progressive multi-scale context modeling through dilated convolutions. …”
Get full text
Article -
265
RETINA: Reconstruction-based pre-trained enhanced TransUNet for electron microscopy segmentation on the CEM500K dataset.
Published 2025-05-01“…We developed the RETINA method, which combines pre-training on the large, unlabeled CEM500K EM image dataset with a hybrid neural-network model architecture that integrates both local (convolutional layer) and global (transformer layer) image processing to learn from manual image annotations. …”
Get full text
Article -
266
A New Hybrid ConvViT Model for Dangerous Farm Insect Detection
Published 2025-02-01“…This study proposes a novel hybrid convolution and vision transformer model (ConvViT) designed to detect harmful insect species that adversely affect agricultural production and play a critical role in global food security. …”
Get full text
Article -
267
DeSPPNet: A Multiscale Deep Learning Model for Cardiac Segmentation
Published 2024-12-01“…By processing features at different spatial resolutions, the multiscale densely connected layer in the form of the Pyramid Pooling Dense Module (PPDM) helps the network to capture both local and global context, preserving finer details of the cardiac structure while also capturing the broader context required to accurately segment larger cardiac structures. …”
Get full text
Article -
268
Attention residual network for medical ultrasound image segmentation
Published 2025-07-01“…Additionally, a spatial hybrid convolution module is integrated to augment the model’s ability to extract global information and deepen the vertical architecture of the network. …”
Get full text
Article -
269
TMAR: 3-D Transformer Network via Masked Autoencoder Regularization for Hyperspectral Sharpening
Published 2025-01-01“…In this study, we focus on leveraging the power of CNN and transformer models and propose a multistage deep transformer-based super-resolution network that is regularized via an asymmetric autoencoder structure. In addition, we utilize a 3-D convolution layer in the light transformer structure because it allows for more flexible computation of correlations between HSI layers and better capturing of dependencies within spectral–spatial features. …”
Get full text
Article -
270
Application of Partial Differential Equation Image Classification Methods to the Aesthetic Evaluation of Images
Published 2021-01-01“…The structure of a convolution kernel learned by using parallel network structure achieves better classification performance. …”
Get full text
Article -
271
Power Equipment Image Recognition Method Based on Feature Extraction and Deep Learning
Published 2025-01-01“…We plan to introduce a lightweight convolutional structure combined with a graph neural network mechanism to strengthen global context modeling and device structural awareness. …”
Get full text
Article -
272
Non-end-to-end adaptive graph learning for multi-scale temporal traffic flow prediction.
Published 2025-01-01“…The method incorporates a multi-scale temporal attention module and a multi-scale temporal convolution module to extract multi-scale information. …”
Get full text
Article -
273
Foreign object detection on coal conveyor belt enhanced by attention mechanism
Published 2025-06-01“…A unique combination of convolution and pooling operations was used by the CPCA attention mechanism to perform global average pooling and maximum pooling on the input feature map, multi-dimensional feature information was deeply mined, and then attention weights for each channel and spatial position were accurately generated through nonlinear transformation, guiding the model to focus on the key feature areas of foreign objects and enhance feature extraction capabilities. …”
Get full text
Article -
274
Generation driven understanding of localized 3D scenes with 3D diffusion model
Published 2025-04-01“…However, the existing diffusion models primarily focus on the global structure and are constrained by predefined dataset categories, which are unable to accurately resolve the detailed structure of complex 3D scenes. …”
Get full text
Article -
275
A VAN-Based Multi-Scale Cross-Attention Mechanism for Skin Lesion Segmentation Network
Published 2023-01-01“…Although many neural networks based on U-shaped structures and methods, such as skip connections have achieved excellent results in medical image segmentation tasks, the properties of convolutional operations limit their ability to effectively learn local and global features. …”
Get full text
Article -
276
3D-SCUMamba: An Abdominal Tumor Segmentation Model
Published 2025-01-01“…Existing deep learning models typically adopt encoder-decoder architectures integrating convolutional layers with global dependency modeling to capture broader contextual information around tumors. …”
Get full text
Article -
277
HMA-Net: a hybrid mixer framework with multihead attention for breast ultrasound image segmentation
Published 2025-06-01“…The model achieved a Jaccard index of 98.04% and 94.84% and a Dice similarity coefficient of 99.01% and 97.35% on the BUSI and BrEaST datasets, respectively.DiscussionThe ConvMixer and ConvNeXT modules are integrated with convolution-enhanced multihead attention, which enhances the model's ability to capture local and global contextual information. …”
Get full text
Article -
278
Diagnosis of Alzheimer’s disease using brain $$^{18}\textrm{F}$$ -FDG PET imaging based on a state space model
Published 2025-07-01“…Building on this, we optimized the original purely convolutional structure into a hybrid architecture combining convolution and Transformer layers. …”
Get full text
Article -
279
Attention-enhanced StrongSORT for robust vehicle tracking in complex environments
Published 2025-05-01“…To address these challenges, we propose AE-StrongSORT (Attention-Enhanced StrongSORT), an attention-enhanced tracking framework featuring three systematic innovations: first, the GAM-YOLO (global attention mechanism-YOLO)hybrid architecture integrates multi-scale feature fusion with a global attention mechanism (GC2f structure). …”
Get full text
Article -
280
A high-precision edge detection technique for magnetic anomaly signals based on a self-attention mechanism
Published 2025-07-01“…Magnetic data boundary detection is a key technology in potential field data processing, providing an effective basis for the division of geological units and fault structures. It holds significant importance in geological structure analysis and mineral exploration. …”
Get full text
Article