-
301
AfaMamba: Adaptive Feature Aggregation With Visual State Space Model for Remote Sensing Images Semantic Segmentation
Published 2025-01-01“…It employs a lightweight ResNet18 as the encoder, and during the decoding phase, it first utilizes a multiscale feature adaptive aggregation module to ensure that the output features from each stage of the encoder contain rich multiscale semantic information. Subsequently, the global-local Mamba structure combines the attention-optimized multiscale convolutional branches with the global branch of Mamba to facilitate effective interaction between global and local features. …”
Get full text
Article -
302
Infrared object detection for robot vision based on multiple focus diffusion and task interaction alignment
Published 2025-07-01“…The feature extraction module adopts a dual-stream fusion structure in the backbone network, which combines the local feature extraction of CNN with the global feature modeling of transformer. …”
Get full text
Article -
303
Vision Mamba and xLSTM-UNet for medical image segmentation
Published 2025-03-01“…Abstract Deep learning-based medical image segmentation methods are generally divided into convolutional neural networks (CNNs) and Transformer-based models. …”
Get full text
Article -
304
YOLO-HVS: Infrared Small Target Detection Inspired by the Human Visual System
Published 2025-07-01“…Meanwhile, the C2f_DWR (dilation-wise residual) module with regional-semantic dual residual structure is designed to significantly improve the efficiency of capturing multi-scale contextual information by expanding convolution and two-step feature extraction mechanism. …”
Get full text
Article -
305
StomaYOLO: A Lightweight Maize Phenotypic Stomatal Cell Detector Based on Multi-Task Training
Published 2025-07-01“…Maize (<i>Zea mays</i> L.), a vital global food crop, relies on its stomatal structure for regulating photosynthesis and responding to drought. …”
Get full text
Article -
306
A lightweight high-frequency mamba network for image super-resolution
Published 2025-07-01“…Various methods based on convolutional neural network (CNN) and Transformer structures have emerged, but few studies have mentioned how to combine these two parts of information. …”
Get full text
Article -
307
A small object detection model in aerial images based on CPDD-YOLOv8
Published 2025-01-01“…Thirdly, a new DSC2f structure is proposed, which uses Dynamic Snake Convolution (DSConv) to take the place of the first standard Conv of Bottleneck in the C2f structure, so that the model can adapt to different inputs more effectively. …”
Get full text
Article -
308
ST-AGRNN: A Spatio-Temporal Attention-Gated Recurrent Neural Network for Traffic State Forecasting
Published 2022-01-01“…In the proposed model, structure-based and location-based localized spatial features are obtained simultaneously by Graph Convolutional Networks (GCNs) and DeepWalk. …”
Get full text
Article -
309
VM-UNet++ research on crack image segmentation based on improved VM-UNet
Published 2025-03-01“…Abstract Cracks are common defects in physical structures, and if not detected and addressed in a timely manner, they can pose a severe threat to the overall safety of the structure. …”
Get full text
Article -
310
Predicting trajectories of coastal area vessels with a lightweight Slice-Diff self attention
Published 2025-04-01“…Recently, graph-based methods have also been used to predict trajectories, however processing graph-structured data introduces significant increase in computation. …”
Get full text
Article -
311
EFINet: Efficient Feature Interaction Network for Real-Time RGB-D Semantic Segmentation
Published 2024-01-01Get full text
Article -
312
Deep joint learning diagnosis of Alzheimer’s disease based on multimodal feature fusion
Published 2024-11-01“…The other branch learned the position information of brain regions with different changes in the different categories of subjects’ brains by introducing attention convolution, and then obtained the discriminative probability information from locations via convolution and global average pooling. …”
Get full text
Article -
313
Anomaly traffic detection method based on data augmentation and feature mining
Published 2025-01-01“…Finally, a multi-layer graph convolutional network with a hierarchical attention mechanism was designed, in which local and global features were hierarchically extracted and fused through a multi-level neighborhood aggregation strategy, significantly enhancing the model’s capability to identify key features. …”
Get full text
Article -
314
HTCNN-Attn: a fine-grained hierarchical multi-label deep learning model for disaster emergency information intelligent extraction from social media
Published 2025-07-01“…It integrates a three-level tree-structured labeling architecture, Transformer-based global feature extraction, convolutional neural network (CNN) layers for local pattern capture, and a hierarchical attention mechanism. …”
Get full text
Article -
315
Multiscale Graph Transformer Network With Dynamic Superpixel Pyramid for Hyperspectral Image Classification
Published 2025-01-01“…To address these limitations, we propose a multi-scale graph transformer network (MSGTN), which captures spatial features at different scales through multiscale graph convolutional networks (GCNs) with adaptive graph structures. …”
Get full text
Article -
316
A security data detection and management method in digital library network based on deep learning
Published 2025-01-01“…The method combines the structures of temporal convolutional network (TCN) and bidirectional gated recurrent unit (BiGRU) to extract spatial and temporal features from digital library network security data. …”
Get full text
Article -
317
CNN–Transformer Hybrid Architecture for Underwater Sonar Image Segmentation
Published 2025-02-01“…FLSSNet is built upon a CNN and Transformer backbone network, integrating four core submodules to address various technical challenges: (1) The asymmetric dual encoder–decoder (ADED) is capable of simultaneously extracting features from different modalities and systematically modeling both local contextual information and global spatial structure. (2) The Transformer feature converter (TFC) module optimizes the multimodal feature fusion process through feature transformation and channel compression. (3) The long-range correlation attention (LRCA) module enhances CNN’s ability to model long-range dependencies through the collaborative use of convolutional kernels, selective sequential scanning, and attention mechanisms, while effectively suppressing noise interference. (4) The recursive contour refinement (RCR) model refines edge contour information through a layer-by-layer recursive mechanism, achieving greater precision in boundary details. …”
Get full text
Article -
318
Pengembangan Deep Learning untuk Sistem Deteksi Dini Komplikasi Kaki Diabetik Menggunakan Citra Termogram
Published 2023-12-01“…In this study, four deep convolutional neural network models were designed with Occam's razor principle through hyperparameter settings on the algorithm structure aspect in the form of number of layers and optimization aspect in the form of optimizer type. …”
Get full text
Article -
319
A Malware Classification Method Based on Knowledge Distillation and Feature Fusion
Published 2025-01-01“…This approach incorporates image texture features with enhanced Local Binary Pattern (LBP), providing insights into the local structure and layout of images and aiding the model in better understanding image details and internal structure, thus enhancing classification performance. …”
Get full text
Article -
320
UNestFormer: Enhancing Decoders and Skip Connections With Nested Transformers for Medical Image Segmentation
Published 2024-01-01“…Precise identification of organs and lesions in medical images is essential for accurate disease diagnosis and analysis of organ structures. Deep convolutional neural network (CNN)-based U-shaped networks are among the most popular and promising approaches for this task. …”
Get full text
Article