-
181
ROPGCViT: A Novel Explainable Vision Transformer for Retinopathy of Prematurity Diagnosis
Published 2025-01-01“…GCViT was enhanced with Squeeze-and-Excitation (SE) block and Residual Multilayer Perceptron (RMLP) structures to effectively learn local and global context information. …”
Get full text
Article -
182
Detection of Diabetic Retinopathy Using a Multi-Decision Inception-ResNet-Blended Hybrid Model
Published 2025-01-01“…The model undergoes thorough pre-processing and testing phases, utilizing eight layers of convolutions at each stage to handle various data matrices and integrate global and specialized features. …”
Get full text
Article -
183
Multi-scale Information Aggregation for Spoofing Detection
Published 2024-11-01“…In this paper, we propose a spoofing detection system built on SincNet and Deep Layer Aggregation (DLA), which leverages speech representations at different levels to distinguish synthetic speech. DLA is totally convolutional with an iterative tree-like structure. …”
Get full text
Article -
184
Fine-grained crop pest classification based on multi-scale feature fusion and mixed attention mechanisms
Published 2025-04-01“…Additionally, a Transformer block is integrated to overcome the limitations of traditional convolutional approaches in capturing global contextual information. …”
Get full text
Article -
185
Using deep learning to screen OCTA images for hypertension to reduce the risk of serious complications
Published 2025-07-01“…Ophthalmic vessels are the only vascular structures that can be directly observed in vivo in a non-invasive manner. …”
Get full text
Article -
186
NGSTGAN: N-Gram Swin Transformer and Multi-Attention U-Net Discriminator for Efficient Multi-Spectral Remote Sensing Image Super-Resolution
Published 2025-06-01“…Recent advancements in convolutional neural networks (CNNs) and Transformers have significantly improved RSISR performance due to their capabilities in local feature extraction and global modeling. …”
Get full text
Article -
187
PolyBuild: An End-to-End Method for Polygonal Building Contour Extraction From High-Resolution Remote Sensing Images
Published 2025-01-01“…However, the presence of varying imaging conditions and complex building structures, makes automatic contour extraction extremely challenging. …”
Get full text
Article -
188
Astronomical Image Superresolution Reconstruction with Deep Learning for Better Identification of Interacting Galaxies
Published 2025-01-01“…To further improve visual quality and enhance the details of galaxy structures, we propose a dual-branch network structure combining convolutional neural networks (CNNs) and Transformer (DBCTNet), which leverages the local characteristics of CNNs to complement the global features of Transformer. …”
Get full text
Article -
189
Enhanced Conformer-Based Speech Recognition via Model Fusion and Adaptive Decoding with Dynamic Rescoring
Published 2024-12-01Get full text
Article -
190
Frequency-Aware Learned Image Compression Using Channel-Wise Attention and Restormer
Published 2025-01-01“…However, learned image compression has a limitation of balancing global context and local texture because the global structure easily ignores local redundancy, especially for the non-repetitive textures, affecting the reconstruction performance. …”
Get full text
Article -
191
Hybrid Deep Learning Architecture with Adaptive Feature Fusion for Multi-Stage Alzheimer’s Disease Classification
Published 2025-06-01“…Background/Objectives: Alzheimer’s disease (AD), a progressive neurodegenerative disorder, demands precise early diagnosis to enable timely interventions. Traditional convolutional neural networks (CNNs) and deep learning models often fail to effectively integrate localized brain changes with global connectivity patterns, limiting their efficacy in Alzheimer’s disease (AD) classification. …”
Get full text
Article -
192
Seismic Waveform Feature Extraction and Reservoir Prediction Based on CNN and UMAP: A Case Study of the Ordos Basin
Published 2025-06-01“…The UMAP-CNN framework leverages the strengths of manifold learning and deep learning, enabling multi-scale feature extraction and dimensionality reduction while preserving both local and global data structures. The evaluation experiments, which considered runtime, receiver operating characteristic (ROC) curves, embedding distribution maps, and other quantitative assessments, illustrated that the UMAP-CNN outperformed t-distributed stochastic neighbor embedding (t-SNE), locally linear embedding (LLE) and isometric feature mapping (Isomap). …”
Get full text
Article -
193
Joint Grid-Based Attention and Multilevel Feature Fusion for Landslide Recognition
Published 2024-01-01“…Landslide recognition (LR) is a fundamental task for disaster prevention and control. Convolutional neural networks (CNNs) and transformer architectures have been widely used for extracting landslide information. …”
Get full text
Article -
194
-
195
Multi-Scale Venation Pattern Analysis for Medicinal Plant Species Recognition
Published 2025-01-01“…Our method extracts and analyzes venation patterns at multiple spatial scales, capturing both global and fine-grained structural details to improve classification performance. …”
Get full text
Article -
196
DSMF-Net: Dual Semantic Metric Learning Fusion Network for Few-Shot Aerial Image Semantic Segmentation
Published 2025-01-01“…To exploit multiscale global semantic context, we construct scale-aware graph prototypes from different stages of the feature layers based on graph convolutional networks (GCNs), while also incorporating prior-guided metric learning to further enhance context at the high-level convolution features. …”
Get full text
Article -
197
A seismic random noise suppression method based on CNN-Mamba
Published 2025-05-01“…This limitation results in insufficient collaborative optimization between local details and macroscopic structures during denoising, further reducing the noise suppression accuracy. …”
Get full text
Article -
198
Causal inference-based graph neural network method for predicting asphalt pavement performance
Published 2025-03-01“…The model comprises four modules: global feature extraction, local feature extraction,causal inference, and dual-channel graph convolution. …”
Get full text
Article -
199
MultiV_Nm: a prediction method for 2′-O-methylation sites based on multi-view features
Published 2025-05-01“…MultiV_Nm extracts the features of Nm sites from multiple dimensions, including sequence features, chemical characteristics, and secondary structure features. By integrating the powerful local feature extraction ability of convolutional neural networks, the ability of graph attention networks to capture global structural information, and the efficient interaction advantage of cross-attention mechanisms for different features, it deeply explores and integrates multi-view features, and finally realizes the prediction of Nm modification sites. …”
Get full text
Article -
200
Multi-Scale Spatial Perception Attention Network for Few-Shot Hyperspectral Image Classification
Published 2024-01-01“…In the encoder, the spatial contraction perception Transformer (SCPFormer) is first proposed to improve the model’s capacity for perceiving global-local joint features. Next, the multi-scale spatial attention (MSSA) module is proposed to capture spatial information at different convolution kernel scales and cascade them to form a more comprehensive representation structure. …”
Get full text
Article