-
321
DECTNet: A detail enhanced CNN-Transformer network for single-image deraining
Published 2025-01-01“…While CNNs are highly effective at extracting local information, they struggle to capture global context. Conversely, Transformers excel at capturing global information but often face challenges in preserving spatial and structural details. …”
Get full text
Article -
322
MCGFE-CR: Cloud Removal With Multiscale Context-Guided Feature Enhancement Network
Published 2024-01-01“…To enhance the global structural features after fusion and reduce the impact of SAR speckle noise, we incorporate a Residual Block with Channel Attention (RBCA). …”
Get full text
Article -
323
Power Equipment Image Recognition Method Based on Feature Extraction and Deep Learning
Published 2025-01-01“…We plan to introduce a lightweight convolutional structure combined with a graph neural network mechanism to strengthen global context modeling and device structural awareness. …”
Get full text
Article -
324
Research on SeaTreasure Target Detection Technology Based on Improved YOLOv7-Tiny
Published 2025-01-01“…First, based on the YOLOv7-Tiny network, the MAFPN neck structure is used to replace the ELAN structure to achieve the multi-scale capture of semantic information of underwater sea treasures, and to enhance the UPA-YOLO model to accurately locate the targets of underwater sea treasures; second, the P2ELAN module is constructed and added to the backbone network, which makes use of the redundancy information in the feature map and dynamically adjusts the convolution kernel to adapt to data The P2ELAN module is added to the backbone network, using the redundant information in the feature map, dynamically adjusting the convolutional kernel to adapt to the lack of data, reducing the number of parameters in the model, and introducing the MSCA attention mechanism to inhibit the complex and changeable background features underwater, to improve the semantic feature extraction ability of the UPA-YOLO model for underwater targets, adding the MPDiou loss function to the improved algorithm model and completing the data validation of the detection model; finally, based on the TensorRT acceleration framework, the optimisation of the target detection Finally, based on the TensorRT acceleration framework, the target detection model is optimised, and the Jetson Nano edge device is used to complete the localisation deployment and realise the real-time target detection task of underwater sea treasures. …”
Get full text
Article -
325
Two-Branch Filtering Generative Network Based on Transformer for Image Inpainting
Published 2024-01-01“…This module utilizes predictive filtering constructed from convolutions to leverage local interactions, while simultaneously employing a transformer architecture with kernels from the predictive network to capture global correlations. …”
Get full text
Article -
326
A VAN-Based Multi-Scale Cross-Attention Mechanism for Skin Lesion Segmentation Network
Published 2023-01-01“…Although many neural networks based on U-shaped structures and methods, such as skip connections have achieved excellent results in medical image segmentation tasks, the properties of convolutional operations limit their ability to effectively learn local and global features. …”
Get full text
Article -
327
A Lightweight and Rapid Dragon Fruit Detection Method for Harvesting Robots
Published 2025-05-01“…The method builds upon YOLOv10 and integrates Gated Convolution (gConv) into the C2f module, forming a novel C2f-gConv structure that effectively reduces model parameters and computational complexity. …”
Get full text
Article -
328
DaGAM-Trans: Dual graph attention module-based transformer for offline signature forgery detection
Published 2025-09-01“…The Transformer architecture plays a key role in modeling global contextual dependencies across the entire signature image, enabling the system to capture long-range structural information crucial for distinguishing genuine signatures from skilled forgeries. …”
Get full text
Article -
329
BiEHFFNet: A Water Body Detection Network for SAR Images Based on Bi-Encoder and Hybrid Feature Fusion
Published 2025-07-01“…First, a bi-encoder structure based on ResNet and Swin Transformer is used to jointly extract local spatial details and global contextual information, enhancing feature representation in complex scenarios. …”
Get full text
Article -
330
Predicting trajectories of coastal area vessels with a lightweight Slice-Diff self attention
Published 2025-04-01“…Recently, graph-based methods have also been used to predict trajectories, however processing graph-structured data introduces significant increase in computation. …”
Get full text
Article -
331
StomaYOLO: A Lightweight Maize Phenotypic Stomatal Cell Detector Based on Multi-Task Training
Published 2025-07-01“…Maize (<i>Zea mays</i> L.), a vital global food crop, relies on its stomatal structure for regulating photosynthesis and responding to drought. …”
Get full text
Article -
332
YOLOv8n-WSE-Pest: A Lightweight Deep Learning Model Based on YOLOv8n for Pest Identification in Tea Gardens
Published 2024-09-01“…The addition of the Spatial and Channel Reconstruction Convolution structure in the Backbone layer reduces redundant spatial and channel features, thereby reducing the model’s complexity. …”
Get full text
Article -
333
GHFormer-Net: Towards more accurate small green apple/begonia fruit detection in the nighttime
Published 2022-07-01“…Specifically, PVTv2-B1 based on Transformer is applied as the backbone network to extract feature information from the global receptive, which breaks the limitation that spatial convolution is utilized to extract information from the local area; Next, with the help of FPN, shallow features and high-level features with rich semantic information are incorporated by lateral connections and a top-down structure to generate multi-scale feature maps; Then, a detector of RetinaNet is applied to detect green fruits. …”
Get full text
Article -
334
Dual-branch attention network-based stereoscopicvideo compression
Published 2025-01-01“…First, a Local and Global Encoder-decoder Block (LGEDB) based on Transformer and channel attention was proposed, which accurately captured non-repetitive texture details in local regions and global structural information by integrating pixel-level self-attention within each local area and global attention across channels. …”
Get full text
Article -
335
Fusion of Recurrence Plots and Gramian Angular Fields with Bayesian Optimization for Enhanced Time-Series Classification
Published 2025-07-01“…Time-series classification remains a critical task across various domains, demanding models that effectively capture both local recurrence structures and global temporal dependencies. We introduce a novel framework that transforms time series into image representations by fusing recurrence plots (RPs) with both Gramian Angular Summation Fields (GASFs) and Gramian Angular Difference Fields (GADFs). …”
Get full text
Article -
336
Bitemporal Remote Sensing Change Detection With State-Space Models
Published 2025-01-01“…Change detection in very-high-resolution remote sensing images has gained significant attention, particularly with the rise of deep learning techniques such as convolutional neural networks and Transformers. The Mamba structure, successful in computer vision, has been applied to this domain, enhancing computational efficiency. …”
Get full text
Article -
337
Dynamic atrous attention and dual branch context fusion for cross scale Building segmentation in high resolution remote sensing imagery
Published 2025-08-01“…Among them, we introduced the Shift Operation module and the Self-Attention module, which adopt a dual-branch structure to respectively capture local spatial dependencies and global correlations, and perform weight coupling to achieve highly complementary contextual information fusion. …”
Get full text
Article -
338
Inpainting of damaged temple murals using edge- and line-guided diffusion patch GAN
Published 2024-11-01“…The WSFN uses the original image, a line drawing, and an edge map to capture mural details, which are then texturally inpainted in the SCN using gated convolution for enhanced results. Special attention is given to globally extending the receptive field for large-area inpainting. …”
Get full text
Article -
339
Rice Disease Detection: TLI-YOLO Innovative Approach for Enhanced Detection and Mobile Compatibility
Published 2025-04-01“…As a key global food reserve, rice disease detection technology plays an important role in promoting food production, protecting ecological balance and supporting sustainable agricultural development. …”
Get full text
Article -
340
MESM: integrating multi-source data for high-accuracy protein-protein interactions prediction through multimodal language models
Published 2025-08-01“…Finally, MESM uses Graph Convolutional Network (GCN) and SubgraphGCN to extract global and local features from the perspective of the overall graph and subgraphs. …”
Get full text
Article