-
461
Two-dimensional spatial orientation relation recognition between image objects
Published 2025-07-01“…A dedicated fusion module synthesizes features from both branches, generating a structured triple list that documents detected objects, their inter-object spatial orientations, and associated confidence scores. …”
Get full text
Article -
462
MDFT-GAN: A Multi-Domain Feature Transformer GAN for Bearing Fault Diagnosis Under Limited and Imbalanced Data Conditions
Published 2025-05-01“…To improve classification performance, an Enhanced Hybrid Visual Transformer (EH-ViT) is constructed by coupling a lightweight convolutional stem with a ViT encoder, enabling robust and discriminative fault identification. …”
Get full text
Article -
463
Inpainting of damaged temple murals using edge- and line-guided diffusion patch GAN
Published 2024-11-01“…The WSFN uses the original image, a line drawing, and an edge map to capture mural details, which are then texturally inpainted in the SCN using gated convolution for enhanced results. Special attention is given to globally extending the receptive field for large-area inpainting. …”
Get full text
Article -
464
CrysMTM: a multiphase, temperature-resolved, multimodal dataset for crystalline materials
Published 2025-01-01“…This multimodal structure enables both supervised and self-supervised learning across graph-based, image-based, and language-based architectures. …”
Get full text
Article -
465
A web-based artificial intelligence system for label-free virus classification and detection of cytopathic effects
Published 2025-02-01“…AIRVIC’s hierarchical structure highlights its adaptability to virological diagnostics, providing unbiased infectivity scoring and facilitating viral isolation and antiviral efficacy testing. …”
Get full text
Article -
466
An improved U-net and attention mechanism-based model for sugar beet and weed segmentation
Published 2025-01-01“…To address this issue, this paper proposes an efficient crop-weed segmentation model based on an improved UNet architecture and attention mechanisms to enhance both recognition accuracy and processing speed.MethodsThe model adopts the encoder-decoder structure of UNet, utilizing MaxViT (Multi-Axis Vision Transformer) as the encoder to capture both global and local features within images. …”
Get full text
Article -
467
EFINet: Efficient Feature Interaction Network for Real-Time RGB-D Semantic Segmentation
Published 2024-01-01Get full text
Article -
468
FinSafeNet: securing digital transactions using optimized deep learning and multi-kernel PCA(MKPCA) with Nyström approximation
Published 2024-11-01“…FinSafeNet is based on a Bi-Directional Long Short-Term Memory (Bi-LSTM), a Convolutional Neural Network (CNN) and an additional dual attention mechanism to study the transaction data and influence the observation of various security threats. …”
Get full text
Article -
469
A lightweight intelligent compression method for fast Sea Level Anomaly data transmission.
Published 2025-01-01“…., peak signal-to-noise ratio, PSNR; structural similarity index, SSIM). The architecture integrates global-local dual discriminators to enforce spatiotemporal coherence of mesoscale vortices, employs dilated convolutions to enhance feature receptive fields without computational overhead, and incorporates vortex recognition rate as a physics-aware evaluation metric. …”
Get full text
Article -
470
Predicting peak ground acceleration using the ConvMixer networkKey points
Published 2025-04-01“…The proposed ConvMixer is a patch-based model that extracts global features from input seismic data and predicts the PGA of an earthquake by combining depth and pointwise convolutions. …”
Get full text
Article -
471
XTNSR: Xception-based transformer network for single image super resolution
Published 2025-01-01“…A multi-layer feature fusion block with skip connections, part of this hybrid architecture, guarantees efficient local and global feature fusion. The experimental results show better performance in Peak signal-to-noise ratio (PSNR), structural similarity index measure (SSIM), and visual quality than the state-of-the-art techniques. …”
Get full text
Article -
472
Traffic environment perception algorithm based on multi-task feature fusion and orthogonal attention
Published 2025-06-01“…A notable advancement introduced in MTEPN is the cross-task feature aggregation structure. This module promotes information complementarity between tasks by implicitly modeling the global context relationships among different visual tasks. …”
Get full text
Article -
473
One Health interventions and challenges under rural African smallholder farmer settings: A scoping review
Published 2025-06-01“…The global human population is rapidly increasing, escalating interactions of people, animals and the environment. …”
Get full text
Article -
474
Automated recognition of deep-sea benthic megafauna in polymetallic nodule mining areas based on deep learning
Published 2025-12-01“…Its backbone integrates deformable convolutions, attention mechanisms, and ResNet structures to improve feature extraction and reduce background interference. …”
Get full text
Article -
475
LWSARDet: A Lightweight SAR Small Ship Target Detection Network Based on a Position–Morphology Matching Mechanism
Published 2025-07-01“…On the other hand, to reduce feature dilution and computational redundancy in traditional detection heads when focusing on small targets, we replace conventional convolutions with simple linear transformations and design a lightweight detection head, LSD-Head. …”
Get full text
Article -
476
PC3D-YOLO: An Enhanced Multi-Scale Network for Crack Detection in Precast Concrete Components
Published 2025-06-01“…To address these limitations, we propose PC3D-YOLO, an enhanced framework derived from YOLOv11, which strengthens long-range dependency modeling through multi-scale feature integration, offering a novel approach for crack detection in precast concrete structures. Our methodology involves three key innovations: (1) the Multi-Dilation Spatial-Channel Fusion with Shuffling (MSFS) module, employing dilated convolutions and channel shuffling to enable global feature fusion, replaces the C3K2 bottleneck module to enhance long-distance dependency capture; (2) the AIFI_M2SA module substitutes the conventional SPPF to mitigate its restricted receptive field and information loss, incorporating multi-scale attention for improved near-far contextual integration; (3) a redesigned neck network (MSCD-Net) preserves rich contextual information across all feature scales. …”
Get full text
Article -
477
A Picking Point Localization Method for Table Grapes Based on PGSS-YOLOv11s and Morphological Strategies
Published 2025-07-01“…To address these issues, this study proposes a novel picking point localization method for table grapes based on an instance segmentation network called Progressive Global-Local Structure-Sensitive Segmentation (PGSS-YOLOv11s) and a simple combination strategy of morphological operators. …”
Get full text
Article -
478
Automatic Mushroom Species Classification Model for Foodborne Disease Prevention Based on Vision Transformer
Published 2022-01-01“…Mushrooms are the fleshy, spore-bearing structure of certain fungi, produced by a group of mycelia and buried in a substratum. …”
Get full text
Article -
479
MAMNet: Lightweight Multi-Attention Collaborative Network for Fine-Grained Cropland Extraction from Gaofen-2 Remote Sensing Imagery
Published 2025-05-01“…Second, the global–local Transformer block (GLTB) decoder uses multi-head self-attention mechanisms to dynamically fuse multi-scale features across layers, effectively restoring the topological structure of fragmented farmland boundaries. …”
Get full text
Article -
480
A Low Complexity Algorithm for 3D-HEVC Depth Map Intra Coding Based on MAD and ResNet
Published 2025-01-01“…As an extension of HEVC, 3D-HEVC retains the quadtree structure inherent to HEVC and is currently recognized as the most widely adopted international standard for stereoscopic video coding. …”
Get full text
Article