-
301
ST-AGRNN: A Spatio-Temporal Attention-Gated Recurrent Neural Network for Traffic State Forecasting
Published 2022-01-01“…In the proposed model, structure-based and location-based localized spatial features are obtained simultaneously by Graph Convolutional Networks (GCNs) and DeepWalk. …”
Get full text
Article -
302
A small object detection model in aerial images based on CPDD-YOLOv8
Published 2025-01-01“…Thirdly, a new DSC2f structure is proposed, which uses Dynamic Snake Convolution (DSConv) to take the place of the first standard Conv of Bottleneck in the C2f structure, so that the model can adapt to different inputs more effectively. …”
Get full text
Article -
303
VM-UNet++ research on crack image segmentation based on improved VM-UNet
Published 2025-03-01“…Abstract Cracks are common defects in physical structures, and if not detected and addressed in a timely manner, they can pose a severe threat to the overall safety of the structure. …”
Get full text
Article -
304
Deep joint learning diagnosis of Alzheimer’s disease based on multimodal feature fusion
Published 2024-11-01“…The other branch learned the position information of brain regions with different changes in the different categories of subjects’ brains by introducing attention convolution, and then obtained the discriminative probability information from locations via convolution and global average pooling. …”
Get full text
Article -
305
MTGNet: Multi-Agent End-to-End Motion Trajectory Prediction with Multimodal Panoramic Dynamic Graph
Published 2025-05-01“…In addition, we utilize the graph convolutional neural network (GCN) to process graph-structured data. …”
Get full text
Article -
306
A security data detection and management method in digital library network based on deep learning
Published 2025-01-01“…The method combines the structures of temporal convolutional network (TCN) and bidirectional gated recurrent unit (BiGRU) to extract spatial and temporal features from digital library network security data. …”
Get full text
Article -
307
Pengembangan Deep Learning untuk Sistem Deteksi Dini Komplikasi Kaki Diabetik Menggunakan Citra Termogram
Published 2023-12-01“…In this study, four deep convolutional neural network models were designed with Occam's razor principle through hyperparameter settings on the algorithm structure aspect in the form of number of layers and optimization aspect in the form of optimizer type. …”
Get full text
Article -
308
CNN–Transformer Hybrid Architecture for Underwater Sonar Image Segmentation
Published 2025-02-01“…FLSSNet is built upon a CNN and Transformer backbone network, integrating four core submodules to address various technical challenges: (1) The asymmetric dual encoder–decoder (ADED) is capable of simultaneously extracting features from different modalities and systematically modeling both local contextual information and global spatial structure. (2) The Transformer feature converter (TFC) module optimizes the multimodal feature fusion process through feature transformation and channel compression. (3) The long-range correlation attention (LRCA) module enhances CNN’s ability to model long-range dependencies through the collaborative use of convolutional kernels, selective sequential scanning, and attention mechanisms, while effectively suppressing noise interference. (4) The recursive contour refinement (RCR) model refines edge contour information through a layer-by-layer recursive mechanism, achieving greater precision in boundary details. …”
Get full text
Article -
309
Multiscale Graph Transformer Network With Dynamic Superpixel Pyramid for Hyperspectral Image Classification
Published 2025-01-01“…To address these limitations, we propose a multi-scale graph transformer network (MSGTN), which captures spatial features at different scales through multiscale graph convolutional networks (GCNs) with adaptive graph structures. …”
Get full text
Article -
310
A Drug-Target Interaction Prediction Method Based on Attention Perception and Modality Fusion
Published 2025-05-01“…[Methods] For drug branches, Graph Transformer and Graph Convolutional Neural Network were used to jointly characterize the global structures and biochemical information of drug molecules. …”
Get full text
Article -
311
AutoLDT: a lightweight spatio-temporal decoupling transformer framework with AutoML method for time series classification
Published 2024-11-01“…The framework proposes a novel lightweight Transformer with fuzzy position encoding, TS-separable linear self-attention mechanism, and convolutional feedforward network, which mine the temporal and spatial features, as well as the local and global relationship of time series. …”
Get full text
Article -
312
A Malware Classification Method Based on Knowledge Distillation and Feature Fusion
Published 2025-01-01“…This approach incorporates image texture features with enhanced Local Binary Pattern (LBP), providing insights into the local structure and layout of images and aiding the model in better understanding image details and internal structure, thus enhancing classification performance. …”
Get full text
Article -
313
UNestFormer: Enhancing Decoders and Skip Connections With Nested Transformers for Medical Image Segmentation
Published 2024-01-01“…Precise identification of organs and lesions in medical images is essential for accurate disease diagnosis and analysis of organ structures. Deep convolutional neural network (CNN)-based U-shaped networks are among the most popular and promising approaches for this task. …”
Get full text
Article -
314
InGSA: integrating generalized self-attention in CNN for Alzheimer's disease classification
Published 2025-03-01“…Furthermore, several GSA heads are used to exploit other dependency structures of global features as well. Our evaluation of InGSA on a two benchmark dataset, using various pre-trained networks, demonstrates the GSA's superior performance.…”
Get full text
Article -
315
Generation driven understanding of localized 3D scenes with 3D diffusion model
Published 2025-04-01“…However, the existing diffusion models primarily focus on the global structure and are constrained by predefined dataset categories, which are unable to accurately resolve the detailed structure of complex 3D scenes. …”
Get full text
Article -
316
GIVTED-Net: GhostNet-Mobile Involution ViT Encoder-Decoder Network for Lightweight Medical Image Segmentation
Published 2024-01-01“…Nevertheless, conventional CNN layers, such as convolution and pooling, demonstrate a spatial inductive bias that constrains their ability to instantly capture global context information. …”
Get full text
Article -
317
Semantic ECG hash similarity graph
Published 2025-07-01“…However, most existing graph structures primarily focus on local similarity while overlooking global semantic correlation. …”
Get full text
Article -
318
DBRSNet: a dual-branch remote sensing image segmentation model based on feature interaction and multi-scale feature fusion
Published 2025-07-01“…In DBRSNet, the Feature-Guided Selection Module (FGSM) adaptively integrates complementary features from CNN and Transformer branches, while the Convolutional Attention Integration Module (CAIM) enhances global dependencies and spectral correlations, ensuring a more comprehensive feature representation. …”
Get full text
Article -
319
A lightweight steel surface defect detection network based on YOLOv9
Published 2025-05-01“…Next, we replace the regular convolution blocks in the model network with spatial-to-depth convolutions, further reducing the model’s computational complexity while retaining global feature information. …”
Get full text
Article -
320
YOLO-DAFS: A Composite-Enhanced Underwater Object Detection Algorithm
Published 2025-05-01“…The detection head adopts DyHead-GDC, integrating ghost depthwise separable convolution with DyHead for greater efficiency. Furthermore, the ADown module replaces conventional feature extraction and downsampling convolutions, reducing parameters and FLOPs by 14%. …”
Get full text
Article