-
261
DSS-MobileNetV3: An Efficient Dynamic-State-Space- Enhanced Network for Concrete Crack Segmentation
Published 2025-06-01“…The DSS-MobileNetV3 adopts a U-shaped encoder–decoder architecture, and a dynamic-state-space (DSS) block is designed into the encoder to improve the MobileNetV3 bottleneck module in modeling global dependencies. The DSS block improves the MobileNetV3 model in structural perception and global dependency modeling for complex crack morphologies by integrating dynamic snake convolution and a state space model. …”
Get full text
Article -
262
LEAD-YOLO: A Lightweight and Accurate Network for Small Object Detection in Autonomous Driving
Published 2025-08-01“…The proposed framework incorporates three innovative components: First, the Backbone integrates a lightweight Convolutional Gated Transformer (CGF) module, which employs normalized gating mechanisms with residual connections, and a Dilated Feature Fusion (DFF) structure that enables progressive multi-scale context modeling through dilated convolutions. …”
Get full text
Article -
263
Lightweight human activity recognition method based on the MobileHARC model
Published 2024-12-01“…However, due to the fact that these models have sequential network structures and are unable to simultaneously focus on local and global features, thus, resulting in a reduction in recognition performance. …”
Get full text
Article -
264
AnoViT: Unsupervised Anomaly Detection and Localization With Vision Transformer-Based Encoder-Decoder
Published 2022-01-01“…These methods are actively used in various fields such as manufacturing, medical care, and intelligent information. Encoder-decoder structures have been widely used in the field of anomaly detection because they can easily learn normal patterns in an unsupervised learning environment and calculate a score to identify abnormalities through a reconstruction error indicating the difference between input and reconstructed images. …”
Get full text
Article -
265
RETINA: Reconstruction-based pre-trained enhanced TransUNet for electron microscopy segmentation on the CEM500K dataset.
Published 2025-05-01“…We developed the RETINA method, which combines pre-training on the large, unlabeled CEM500K EM image dataset with a hybrid neural-network model architecture that integrates both local (convolutional layer) and global (transformer layer) image processing to learn from manual image annotations. …”
Get full text
Article -
266
DeSPPNet: A Multiscale Deep Learning Model for Cardiac Segmentation
Published 2024-12-01“…By processing features at different spatial resolutions, the multiscale densely connected layer in the form of the Pyramid Pooling Dense Module (PPDM) helps the network to capture both local and global context, preserving finer details of the cardiac structure while also capturing the broader context required to accurately segment larger cardiac structures. …”
Get full text
Article -
267
A New Hybrid ConvViT Model for Dangerous Farm Insect Detection
Published 2025-02-01“…This study proposes a novel hybrid convolution and vision transformer model (ConvViT) designed to detect harmful insect species that adversely affect agricultural production and play a critical role in global food security. …”
Get full text
Article -
268
Application of Partial Differential Equation Image Classification Methods to the Aesthetic Evaluation of Images
Published 2021-01-01“…The structure of a convolution kernel learned by using parallel network structure achieves better classification performance. …”
Get full text
Article -
269
TMAR: 3-D Transformer Network via Masked Autoencoder Regularization for Hyperspectral Sharpening
Published 2025-01-01“…In this study, we focus on leveraging the power of CNN and transformer models and propose a multistage deep transformer-based super-resolution network that is regularized via an asymmetric autoencoder structure. In addition, we utilize a 3-D convolution layer in the light transformer structure because it allows for more flexible computation of correlations between HSI layers and better capturing of dependencies within spectral–spatial features. …”
Get full text
Article -
270
Attention residual network for medical ultrasound image segmentation
Published 2025-07-01“…Additionally, a spatial hybrid convolution module is integrated to augment the model’s ability to extract global information and deepen the vertical architecture of the network. …”
Get full text
Article -
271
MTGNet: Multi-Agent End-to-End Motion Trajectory Prediction with Multimodal Panoramic Dynamic Graph
Published 2025-05-01“…In addition, we utilize the graph convolutional neural network (GCN) to process graph-structured data. …”
Get full text
Article -
272
Power Equipment Image Recognition Method Based on Feature Extraction and Deep Learning
Published 2025-01-01“…We plan to introduce a lightweight convolutional structure combined with a graph neural network mechanism to strengthen global context modeling and device structural awareness. …”
Get full text
Article -
273
Non-end-to-end adaptive graph learning for multi-scale temporal traffic flow prediction.
Published 2025-01-01“…Furthermore, the graph structure is dynamically updated using a weighted summation approach.Experiments demonstrate that the proposed method significantly improves prediction accuracy on the PeMSD4 and PeMSD8 datasets. …”
Get full text
Article -
274
Generation driven understanding of localized 3D scenes with 3D diffusion model
Published 2025-04-01“…However, the existing diffusion models primarily focus on the global structure and are constrained by predefined dataset categories, which are unable to accurately resolve the detailed structure of complex 3D scenes. …”
Get full text
Article -
275
Foreign object detection on coal conveyor belt enhanced by attention mechanism
Published 2025-06-01“…A unique combination of convolution and pooling operations was used by the CPCA attention mechanism to perform global average pooling and maximum pooling on the input feature map, multi-dimensional feature information was deeply mined, and then attention weights for each channel and spatial position were accurately generated through nonlinear transformation, guiding the model to focus on the key feature areas of foreign objects and enhance feature extraction capabilities. …”
Get full text
Article -
276
Graph-Based Adaptive Network With Spatial-Spectral Features for Hyperspectral Unmixing
Published 2025-01-01“…In the method, HSIs are treated as data on manifold structures, with superpixels serving as graph nodes to construct a global graph-structured data. …”
Get full text
Article -
277
A VAN-Based Multi-Scale Cross-Attention Mechanism for Skin Lesion Segmentation Network
Published 2023-01-01“…Although many neural networks based on U-shaped structures and methods, such as skip connections have achieved excellent results in medical image segmentation tasks, the properties of convolutional operations limit their ability to effectively learn local and global features. …”
Get full text
Article -
278
3D-SCUMamba: An Abdominal Tumor Segmentation Model
Published 2025-01-01“…Existing deep learning models typically adopt encoder-decoder architectures integrating convolutional layers with global dependency modeling to capture broader contextual information around tumors. …”
Get full text
Article -
279
HMA-Net: a hybrid mixer framework with multihead attention for breast ultrasound image segmentation
Published 2025-06-01“…The model achieved a Jaccard index of 98.04% and 94.84% and a Dice similarity coefficient of 99.01% and 97.35% on the BUSI and BrEaST datasets, respectively.DiscussionThe ConvMixer and ConvNeXT modules are integrated with convolution-enhanced multihead attention, which enhances the model's ability to capture local and global contextual information. …”
Get full text
Article -
280
Diagnosis of Alzheimer’s disease using brain $$^{18}\textrm{F}$$ -FDG PET imaging based on a state space model
Published 2025-07-01“…Building on this, we optimized the original purely convolutional structure into a hybrid architecture combining convolution and Transformer layers. …”
Get full text
Article