Search alternatives:
structured » structure (Expand Search)
structures » structure (Expand Search)
structural » structure (Expand Search)
convolution » convolutional (Expand Search)
structured » structure (Expand Search)
structures » structure (Expand Search)
structural » structure (Expand Search)
convolution » convolutional (Expand Search)
-
421
Identification of diabetic retinopathy lesions in fundus images by integrating CNN and vision mamba models.
Published 2025-01-01“…The majority of deep learning techniques developed for medical image analysis rely on convolutional modules to extract the inherent structure of images within a certain local receptive field. …”
Get full text
Article -
422
Ensemble Streamflow Simulations in a Qinghai–Tibet Plateau Basin Using a Deep Learning Method with Remote Sensing Precipitation Data as Input
Published 2025-03-01“…Streamflow simulations were carried out using models with diverse structures, including the physically based BTOPMC (Block-wise use of TOPMODEL) and two machine learning models, i.e., Random Forest (RF) and Long Short-Term Memory Neural Networks (LSTM). …”
Get full text
Article -
423
Improved YOLOv8s-based foreign object detection method for mine conveyor belts
Published 2025-06-01“…The core feature extraction and fusion module C2f was improved by VMamba's Visual State Space (VSS) module, which efficiently captured global contextual information in images through a state space model and four-directional scanning mechanism, enhancing the model’s understanding of global image structure. …”
Get full text
Article -
424
ST-AGRNN: A Spatio-Temporal Attention-Gated Recurrent Neural Network for Traffic State Forecasting
Published 2022-01-01“…In the proposed model, structure-based and location-based localized spatial features are obtained simultaneously by Graph Convolutional Networks (GCNs) and DeepWalk. …”
Get full text
Article -
425
Rice Leaf Disease Image Enhancement Based on Improved CycleGAN
Published 2024-11-01“…These included user perception evaluation (UPE), structural similarity index (SSIM), peak signal-to-noise ratio (PSNR), and the performance of disease recognition within object detection frameworks. …”
Get full text
Article -
426
Attention-Enhanced Hybrid Automatic Modulation Classification for Advanced Wireless Communication Systems: A Deep Learning-Transformer Framework
Published 2025-01-01“…To address these limitations, this paper presents a novel attention-enhanced hybrid AMC framework that synergistically integrates specialized convolutional layers for efficient temporal feature extraction with a compact transformer encoder for global sequence modeling. …”
Get full text
Article -
427
DiffMamba: semantic diffusion guided feature modeling network for semantic segmentation of remote sensing images
Published 2025-12-01“…DiffMamba uses a hybrid CNNs-Transformer as the encoder structure, and is equipped with the efficient phase sensing module (EPSM), the multi-view transformer module (MVTrans), the semantic diffusion alignment module (SDAM), and the coordinate state space model (CAMamba). …”
Get full text
Article -
428
An Mcformer encoder integrating Mamba and Cgmlp for improved acoustic feature extraction
Published 2025-07-01“…To address this limitation, the Mcformer encoder is introduced, which incorporates the Mamba module in parallel with multi-head attention blocks to enhance the model’s global context processing capabilities. Additionally, a Convolutional Gated Multilayer Perceptron (Cgmlp) structure is employed to improve the extraction of local features through deep convolutional layers. …”
Get full text
Article -
429
CNN–Transformer Hybrid Architecture for Underwater Sonar Image Segmentation
Published 2025-02-01“…FLSSNet is built upon a CNN and Transformer backbone network, integrating four core submodules to address various technical challenges: (1) The asymmetric dual encoder–decoder (ADED) is capable of simultaneously extracting features from different modalities and systematically modeling both local contextual information and global spatial structure. (2) The Transformer feature converter (TFC) module optimizes the multimodal feature fusion process through feature transformation and channel compression. (3) The long-range correlation attention (LRCA) module enhances CNN’s ability to model long-range dependencies through the collaborative use of convolutional kernels, selective sequential scanning, and attention mechanisms, while effectively suppressing noise interference. (4) The recursive contour refinement (RCR) model refines edge contour information through a layer-by-layer recursive mechanism, achieving greater precision in boundary details. …”
Get full text
Article -
430
SFFNet: Shallow Feature Fusion Network Based on Detection Framework for Infrared Small Target Detection
Published 2024-11-01“…Then, we design the visual-Mamba-based global information extension (VMamba-GIE) module, which leverages a multi-branch structure combining the capability of convolutional layers to extract features in local space with the advantages of state space models in the exploration of long-distance information. …”
Get full text
Article -
431
Rain removal method for single image of dual-branch joint network based on sparse transformer
Published 2024-12-01“…Indeed, RSTB preserves the most valuable self-attention values for the aggregation of features, facilitating high-quality image reconstruction from a global perspective. Finally, the parallel dual-branch joint module, composed of RSTB and UEDB branches, effectively captures the local context and global structure, culminating in a clear background image. …”
Get full text
Article -
432
MFPI-Net: A Multi-Scale Feature Perception and Interaction Network for Semantic Segmentation of Urban Remote Sensing Images
Published 2025-07-01“…The Swin Transformer efficiently extracts multi-level global semantic features through its hierarchical structure and window attention mechanism. …”
Get full text
Article -
433
A Spoofing Speech Detection Method Combining Multi-Scale Features and Cross-Layer Information
Published 2025-03-01“…Spoofing speech detection, which is a pressing issue in the age of generative AI, requires both global information and local features of speech. The multi-layer transformer structure in pre-trained speech models can effectively capture temporal information and global context in speech, but there is still room for improvement in handling local features. …”
Get full text
Article -
434
Hierarchical Fusion of Infrared and Visible Images Based on Channel Attention Mechanism and Generative Adversarial Networks
Published 2024-10-01“…The results show that the proposed algorithm retains the global structure features of multilayer images and has obvious advantages in fusion performance, model generalization and computational efficiency.…”
Get full text
Article -
435
AutoLDT: a lightweight spatio-temporal decoupling transformer framework with AutoML method for time series classification
Published 2024-11-01“…Notably, we adopt the Covariance Matrix Adaptation Evolution Strategy and global adaptive pruning technique to realize automated network structure design, which further improves the model training efficiency and automation, and avoids the uncertainty problem of network design. …”
Get full text
Article -
436
DFPF-Net: Dynamically Focused Progressive Fusion Network for Remote Sensing Change Detection
Published 2025-01-01“…To address these challenges, we propose the dynamically focused progressive fusion network (DFPF-Net) to simultaneously tackle global and local noise influences. On one hand, we utilize a pyramid vision transformer (PVT) as a weight-shared siamese network to implement change detection, efficiently fusing multilevel features extracted from the pyramid structure through a residual based progressive enhanced fusion module (PEFM). …”
Get full text
Article -
437
Application of CycleGAN-based low-light image enhancement algorithm in foreign object detection on belt conveyors in underground mines
Published 2025-07-01“…For the discriminator network, a global-local discriminator structure is designed to optimize overall illumination while adaptively enhancing shadow and highlight regions. …”
Get full text
Article -
438
Adaptive Pixel-Level and Superpixel-Level Feature Fusion Transformer for Hyperspectral Image Classification
Published 2024-01-01“…However, graph convolutional networks (GCNs) can effectively extract features from the global structure. …”
Get full text
Article -
439
SwinNowcast: A Swin Transformer-Based Model for Radar-Based Precipitation Nowcasting
Published 2025-04-01“…Through the novel design of a multi-scale feature balancing module (M-FBM), the model dynamically integrates local-scale features with global spatiotemporal dependencies. Specifically, the multi-scale convolutional block attention module (MSCBAM) captures local multi-scale features, while the gated attention feature fusion unit (GAFFU) adaptively regulates the fusion intensity, thereby enhancing spatial structure and temporal continuity in a synergistic manner. …”
Get full text
Article -
440
IMViT: Adjacency Matrix-Based Lightweight Plain Vision Transformer
Published 2025-01-01“…Hierarchical vision transformers are a big family for better efficiency in computer vision, but in order to obtain global dependencies, their design is often complex. …”
Get full text
Article