-
401
YOLO-SWD—An Improved Ship Recognition Algorithm for Feature Occlusion Scenarios
Published 2025-03-01“…Three improved modules are introduced: the DLKA module enhances the perception of local details and global context through dynamic deformable convolution and large receptive field attention mechanisms; the CKSP module improves the model’s ability to extract target boundaries and shapes; and the WTHead enhances the diversity and robustness of feature extraction. …”
Get full text
Article -
402
A Medical Image Semantics Segmentation Method Based on Image Pre-processing and Image Transformer
Published 2025-01-01“…Existing models in this field pay more attention to editing U-Net’s structure and convolutional layers. In this paper, an image pre-processing technique to enrich the potential of images and an image transformer involved in network TrUNet are proposed to tackle these issues. …”
Get full text
Article -
403
HG-Mamba: A Hybrid Geometry-Aware Bidirectional Mamba Network for Hyperspectral Image Classification
Published 2025-06-01“…The second stage, designated spatial structure perception and context modeling, incorporates a Gaussian Distance Decay (GDD) mechanism to adaptively reweight spatial neighbors based on geometric distances, coupled with a spatial bidirectional Mamba (SaBM) module for comprehensive global context modeling. …”
Get full text
Article -
404
Enhanced Cross-stage-attention U-Net for esophageal target volume segmentation
Published 2024-12-01“…WRA was employed to capture global attention, whose large convolution kernel was further decomposed to simplify the calculation. …”
Get full text
Article -
405
Improved YOLOv8s-based foreign object detection method for mine conveyor belts
Published 2025-06-01“…The core feature extraction and fusion module C2f was improved by VMamba's Visual State Space (VSS) module, which efficiently captured global contextual information in images through a state space model and four-directional scanning mechanism, enhancing the model’s understanding of global image structure. …”
Get full text
Article -
406
Hierarchical Semi-Supervised Representation Learning for Cyber Physical Social Intelligence
Published 2025-06-01“…Simultaneously, a learnable graph neural network captures global topology using a graph structure-level reconstruction loss. …”
Get full text
Article -
407
Rain removal method for single image of dual-branch joint network based on sparse transformer
Published 2024-12-01“…Indeed, RSTB preserves the most valuable self-attention values for the aggregation of features, facilitating high-quality image reconstruction from a global perspective. Finally, the parallel dual-branch joint module, composed of RSTB and UEDB branches, effectively captures the local context and global structure, culminating in a clear background image. …”
Get full text
Article -
408
DiffMamba: semantic diffusion guided feature modeling network for semantic segmentation of remote sensing images
Published 2025-12-01“…With the rapid development of remote sensing technology, the application scope of high-resolution remote sensing images (HR-RSIs) has been continuously expanding. The emergence of convolutional neural networks and Transformer models has significantly enhanced the accuracy of semantic segmentation. …”
Get full text
Article -
409
DDANet: A deep dilated attention network for intracerebral haemorrhage segmentation
Published 2024-12-01“…Additionally, the authors incorporate a self‐attention mechanism to capture global semantic information of high‐level features to guide the extraction and processing of low‐level features, thereby enhancing the model's understanding of the overall structure while maintaining details. …”
Get full text
Article -
410
Robust SAR-assisted cloud removal via supervised align-guided fusion and bidirectional hybrid reconstruction
Published 2025-08-01“…The bidirectional hybrid reconstruction module integrates global and local information via the parallel combination of convolution and transformer layers to ensure consistent filling in both the central and boundary areas. …”
Get full text
Article -
411
Improved stereo matching network based on dense multi-scale feature guided cost aggregation
Published 2024-02-01“…Firstly, a dense multi-scale feature extraction module was designed based on the dense atrous spatial pyramid pooling structure. This module extracted region-level features of different scales by using atrous convolution of different expansion rates, and effectively fused image features of different scales through dense connection, so that the network can capture contextual information. …”
Get full text
Article -
412
Anomaly traffic detection method based on data augmentation and feature mining
Published 2025-01-01“…Finally, a multi-layer graph convolutional network with a hierarchical attention mechanism was designed, in which local and global features were hierarchically extracted and fused through a multi-level neighborhood aggregation strategy, significantly enhancing the model’s capability to identify key features. …”
Get full text
Article -
413
A Spoofing Speech Detection Method Combining Multi-Scale Features and Cross-Layer Information
Published 2025-03-01“…Spoofing speech detection, which is a pressing issue in the age of generative AI, requires both global information and local features of speech. The multi-layer transformer structure in pre-trained speech models can effectively capture temporal information and global context in speech, but there is still room for improvement in handling local features. …”
Get full text
Article -
414
Hierarchical Fusion of Infrared and Visible Images Based on Channel Attention Mechanism and Generative Adversarial Networks
Published 2024-10-01“…The results show that the proposed algorithm retains the global structure features of multilayer images and has obvious advantages in fusion performance, model generalization and computational efficiency.…”
Get full text
Article -
415
DFPF-Net: Dynamically Focused Progressive Fusion Network for Remote Sensing Change Detection
Published 2025-01-01“…To address these challenges, we propose the dynamically focused progressive fusion network (DFPF-Net) to simultaneously tackle global and local noise influences. On one hand, we utilize a pyramid vision transformer (PVT) as a weight-shared siamese network to implement change detection, efficiently fusing multilevel features extracted from the pyramid structure through a residual based progressive enhanced fusion module (PEFM). …”
Get full text
Article -
416
Application of CycleGAN-based low-light image enhancement algorithm in foreign object detection on belt conveyors in underground mines
Published 2025-07-01“…For the discriminator network, a global-local discriminator structure is designed to optimize overall illumination while adaptively enhancing shadow and highlight regions. …”
Get full text
Article -
417
Ensemble Streamflow Simulations in a Qinghai–Tibet Plateau Basin Using a Deep Learning Method with Remote Sensing Precipitation Data as Input
Published 2025-03-01“…By employing a 1D Convolutional Neural Networks (1D CNN), streamflow simulations from multiple models are integrated and a Shapley Additive exPlanations (SHAP) interpretability analysis was conducted to examine the contributions of individual models on ensemble streamflow simulation. …”
Get full text
Article -
418
GNODEVAE: a graph-based ODE-VAE enhances clustering for single-cell data
Published 2025-08-01“…Current methods struggle to simultaneously preserve global structure, model cellular dynamics, and handle technical noise effectively. …”
Get full text
Article -
419
Path planning of intelligent tennis ball picking robot integrating twin network target tracking algorithm
Published 2025-07-01“…Additionally, the Transformer structure improves tracking accuracy by capturing the global context relationship. …”
Get full text
Article -
420
A deep fusion‐based vision transformer for breast cancer classification
Published 2024-12-01“…Cancerous tissue detection in histopathological images relies on complex features related to tissue structure and staining properties. Convolutional neural network (CNN) models like ResNet50, Inception‐V1, and VGG‐16, while useful in many applications, cannot capture the patterns of cell layers and staining properties. …”
Get full text
Article