-
401
Lightweight Road Environment Segmentation using Vector Quantization
Published 2025-07-01“…Numerous works based on Fully Convolutional Networks (FCNs) and Transformer architectures have been proposed to leverage local and global contextual learning for efficient and accurate semantic segmentation. …”
Get full text
Article -
402
DASNet a dual branch multi level attention sheep counting network
Published 2025-07-01“…DASNet is built on a modified VGG–19 architecture, where a dual-branch structure is employed to integrate both shallow and deep features. …”
Get full text
Article -
403
TMAR: 3-D Transformer Network via Masked Autoencoder Regularization for Hyperspectral Sharpening
Published 2025-01-01“…In this study, we focus on leveraging the power of CNN and transformer models and propose a multistage deep transformer-based super-resolution network that is regularized via an asymmetric autoencoder structure. In addition, we utilize a 3-D convolution layer in the light transformer structure because it allows for more flexible computation of correlations between HSI layers and better capturing of dependencies within spectral–spatial features. …”
Get full text
Article -
404
HMA-Net: a hybrid mixer framework with multihead attention for breast ultrasound image segmentation
Published 2025-06-01“…The model achieved a Jaccard index of 98.04% and 94.84% and a Dice similarity coefficient of 99.01% and 97.35% on the BUSI and BrEaST datasets, respectively.DiscussionThe ConvMixer and ConvNeXT modules are integrated with convolution-enhanced multihead attention, which enhances the model's ability to capture local and global contextual information. …”
Get full text
Article -
405
GhostConv+CA-YOLOv8n: a lightweight network for rice pest detection based on the aggregation of low-level features in real-world complex backgrounds
Published 2025-08-01“…To overcome these limitations, this paper introduces GhostConv+CA-YOLOv8n, a lightweight object detection framework was proposed, which incorporates several innovative features: GhostConv replaces standard convolutional operations with computationally efficient ghost modules in the YOLOv8n’s backbone structure, reducing parameters by 40,458 while maintaining feature richness; a Context Aggregation (CA) module is applied after the large and medium-sized feature maps were output by the YOLOv8n’s neck structure. …”
Get full text
Article -
406
Rice Leaf Disease Image Enhancement Based on Improved CycleGAN
Published 2024-11-01“…These included user perception evaluation (UPE), structural similarity index (SSIM), peak signal-to-noise ratio (PSNR), and the performance of disease recognition within object detection frameworks. …”
Get full text
Article -
407
SCCA-YOLO: Spatial Channel Fusion and Context-Aware YOLO for Lunar Crater Detection
Published 2025-07-01“…Specifically, the Context-Aware Module (CAM) employs a multi-branch dilated convolutional structure to enhance feature richness and expand the local receptive field, thereby strengthening the feature extraction capability. …”
Get full text
Article -
408
A High-Precision Defect Detection Approach Based on BiFDRep-YOLOv8n for Small Target Defects in Photovoltaic Modules
Published 2025-04-01“…With the accelerated transition of the global energy structure towards decarbonization, the share of PV power generation in the power system continues to rise. …”
Get full text
Article -
409
An industrial carbon block instance segmentation algorithm based on improved YOLOv8
Published 2025-03-01“…YOLOv8-HDSA adds a convolutional self-attention mechanism with residual structure to the head, preserving important local information of carbon blocks and improving the ability to extract fine-grained edge details and global features of carbon blocks. …”
Get full text
Article -
410
Liver Semantic Segmentation Method Based on Multi-Channel Feature Extraction and Cross Fusion
Published 2025-06-01“…Secondly, an atrous spatial pyramid pooling (ASPP) module is incorporated into the bottleneck layer to capture features at various receptive fields using dilated convolutions, while global pooling is applied to enhance the acquisition of contextual information and ensure efficient feature transmission. …”
Get full text
Article -
411
RMIS-Net: a fast medical image segmentation network based on multilayer perceptron
Published 2025-05-01“…To address the persistent challenges of computational complexity and efficiency limitations in existing methods, we propose RMIS-Net—an innovative lightweight segmentation network with three core components: a convolutional layer for preliminary feature extraction, a shift-based fully connected layer for parameter-efficient spatial modeling, and a tokenized multilayer perceptron for global context capture. …”
Get full text
Article -
412
Novel Deep Learning Framework for Evaporator Tube Leakage Estimation in Supercharged Boiler
Published 2025-07-01“…To address these issues, this study proposes a novel deep learning framework (LSTM-CNN–attention), combining a Long Short-Term Memory (LSTM) network with a dual-pathway spatial feature extraction structure (ACNN) that includes an attention mechanism(attention) and a 1D convolutional neural network (1D-CNN) parallel pathway. …”
Get full text
Article -
413
Dynamic atrous attention and dual branch context fusion for cross scale Building segmentation in high resolution remote sensing imagery
Published 2025-08-01“…Among them, we introduced the Shift Operation module and the Self-Attention module, which adopt a dual-branch structure to respectively capture local spatial dependencies and global correlations, and perform weight coupling to achieve highly complementary contextual information fusion. …”
Get full text
Article -
414
GIVTED-Net: GhostNet-Mobile Involution ViT Encoder-Decoder Network for Lightweight Medical Image Segmentation
Published 2024-01-01“…Nevertheless, conventional CNN layers, such as convolution and pooling, demonstrate a spatial inductive bias that constrains their ability to instantly capture global context information. …”
Get full text
Article -
415
FPA-based weighted average ensemble of deep learning models for classification of lung cancer using CT scan images
Published 2025-06-01“…Abstract Cancer is among the most dangerous diseases contributing to rising global mortality rates. Lung cancer, particularly adenocarcinoma, is one of the deadliest forms and severely impacts human life. …”
Get full text
Article -
416
A low illumination target detection method based on a dynamic gradient gain allocation strategy
Published 2024-11-01“…Firstly, efficient multi-scale feature fusion is performed by using a new neck structure in the original model so that it can fully exchange high-level semantic information and low-level spatial information. …”
Get full text
Article -
417
MultiRepPI: a cross-modal feature fusion-based multiple characterization framework for plant peptide-protein interaction prediction
Published 2025-07-01“…First, most methods fail to adequately integrate multimodal information such as sequence, structure, and disorder properties, leading to inadequate characterization of complex interaction patterns. …”
Get full text
Article -
418
Two-dimensional spatial orientation relation recognition between image objects
Published 2025-07-01“…A dedicated fusion module synthesizes features from both branches, generating a structured triple list that documents detected objects, their inter-object spatial orientations, and associated confidence scores. …”
Get full text
Article -
419
MDFT-GAN: A Multi-Domain Feature Transformer GAN for Bearing Fault Diagnosis Under Limited and Imbalanced Data Conditions
Published 2025-05-01“…While generative adversarial networks (GANs) have shown promise in data augmentation, their efficacy deteriorates in the presence of multi-category and structurally complex fault distributions. To address these challenges, this paper proposes a novel fault diagnosis framework based on a Multi-Domain Feature Transformer GAN (MDFT-GAN). …”
Get full text
Article -
420
CrysMTM: a multiphase, temperature-resolved, multimodal dataset for crystalline materials
Published 2025-01-01“…This multimodal structure enables both supervised and self-supervised learning across graph-based, image-based, and language-based architectures. …”
Get full text
Article