-
1821
A Multi-Source Domain Adaptation Method for Bearing Fault Diagnosis with Dynamically Similarity Guidance on Incomplete Data
Published 2025-01-01“…It enhances diagnostic performance in the target domain by transferring knowledge across diverse domains. DS-MDAN uses convolution kernels of different scales to extract multi-scale feature information and achieves feature fusion through upsampling and operations like addition and concatenation. …”
Get full text
Article -
1822
AFN-Net: Adaptive Fusion Nucleus Segmentation Network Based on Multi-Level U-Net
Published 2025-01-01“…This DSCOM effectively preserves high-resolution information and improves the segmentation accuracy of small targets and boundary regions through multi-level convolution operations and channel optimization. Finally, we proposed an Adaptive Fusion Loss Module (AFLM) that effectively balances different lossy targets by dynamically adjusting weights, thereby further improving the model’s performance in segmentation region consistency and boundary accuracy while maintaining classification accuracy. …”
Get full text
Article -
1823
BDSER-InceptionNet: A Novel Method for Near-Infrared Spectroscopy Model Transfer Based on Deep Learning and Balanced Distribution Adaptation
Published 2025-06-01“…Traditional modeling methods exhibit certain limitations in handling these factors, making it difficult to achieve effective adaptation across different scenarios. Specifically, data distribution shifts and mismatches in multi-scale features hinder the transferability of models across different crop varieties or instruments from different manufacturers. …”
Get full text
Article -
1824
FruitQuery: A lightweight query-based instance segmentation model for in-field fruit ripeness determination
Published 2025-12-01“…FruitQuery runs in an end-to-end way and incorporates the convolution and Transformer to capture fine-grained features related to different fruits at different ripeness stages.Extensive experiments on the combined fruit dataset demonstrate that our FruitQuery achieves the highest average precision of 67.02 with only 14.08M parameters, outperforming 13 state-of-the-art models with 33 variants. …”
Get full text
Article -
1825
Multi-level User Interest and Multi-intent Fusion for Next Basket Recommendation
Published 2025-03-01“…A cross-level contrastive learning paradigm is also designed to combine item representations from different levels in order to enhance the semantic information between items at different levels. …”
Get full text
Article -
1826
Lightweight detection algorithms for small targets on unmanned mining trucks
Published 2025-07-01“…Additionally, it designs a detection decoupling head with a multi-head attention mechanism to improve the issue of network complexity caused by convolutional layer redundancy, processes spatial dimensions to focus on capturing target features, reduces interference from irrelevant backgrounds, and enhances the accuracy of occluded target recognition.urthermore, it constructs a lightweight neural network with dual convolution (CDC), enhancing inter-channel information flow, improving model feature expression capability, and reducing model complexity. …”
Get full text
Article -
1827
A Method of Trackside Kilometer Post Identification Combined with YOLOv3 Model
Published 2020-01-01“…Therefore, an image recognition method based on YOLOv3 was proposed, which could still ensure good recognition accuracy in the face of different illumination, complex background and different forms of image. …”
Get full text
Article -
1828
Multi-camera video collaborative analysis method based on edge computing
Published 2023-08-01“…In order to reduce the processing volume of multi-camera real-time video data in smart city scenarios, a video collaborative analysis method based on machine learning algorithms at the edge was proposed.Firstly, for the important objects detected by each camera, different key windows were designed to filter the region of interest (RoI) in the video, reduce the video data volume and extract its features.Then, based on the extracted data features, the same objects in the videos from different cameras were annotated, and a strategy for calculating the association degree value between cameras was designed for further reducing the video data volume.Finally, the GC-ReID algorithm based on graph convolutional network (GCN) and re-identification (ReID) was proposed, aiming at achieving the collaborative analysis of multi-camera videos.The experimental results show that proposed method can effectively reduce the system latency and improve the video compression rate while ensuring the high accuracy, compared with the existing video analysis methods.…”
Get full text
Article -
1829
Adaptive Conditional Reasoning for Remote Sensing Visual Question Answering
Published 2025-04-01“…In order to enhance the multimodal fusion process of different types of questions, the ACR model further integrates visual and textual features by leveraging type-guided cross-attention. …”
Get full text
Article -
1830
Fault diagnosis method of mine hoist main bearing with small sample based on VAE-WGAN
Published 2025-06-01“…In order to improve the feature extraction ability and fault diagnosis accuracy of fault diagnosis models, based on the lightweight convolutional neural network MobileNetV2, the convolutional block attention mechanism CBAM is integrated into the deep feature mapping of MobileNetV2, and an attention mechanism convolutional classification network CBAM-MobileNetV2 is constructed. …”
Get full text
Article -
1831
GAT-Enhanced YOLOv8_L with Dilated Encoder for Multi-Scale Space Object Detection
Published 2025-06-01“…The Dilated Encoder network is introduced to cover different-scale targets by differentiating receptive fields, and the feature weight allocation is optimized by combining it with a Convolutional Block Attention Module (CBAM). …”
Get full text
Article -
1832
Detection of Pathogenic Microorganisms by Fusion of Recursive Feature Pyramid
Published 2023-10-01“…Aiming at the problems of less research on the detection of pathogenic microorganisms , large differences in target size and complex background , a multi-scale detection method of pathogenic microorganisms fused with recursive feature pyramid was proposed. …”
Get full text
Article -
1833
Deep vision-based real-time hand gesture recognition: a review
Published 2025-06-01“…The choice of evaluation metrics and dataset is critical since different tasks require different evaluation parameters, and the model learns more patterns and features from diverse data. …”
Get full text
Article -
1834
A Survey on Immersive Cyber Situational Awareness Systems
Published 2025-06-01“…However, these systems face data occlusion and convolution issues due to the burgeoning complexity, dimensionality, and heterogeneity of cybersecurity data, which damages cyber situational awareness of end-users. …”
Get full text
Article -
1835
Multimodal anomaly detection in complex environments using video and audio fusion
Published 2025-05-01“…The model named Spatio-Temporal Anomaly Detection Network (STADNet) captures the spatio-temporal features of video images through multi-scale Three-Dimensional (3D) convolution module and spatio-temporal attention mechanism. …”
Get full text
Article -
1836
TMFN: a text-based multimodal fusion network with multi-scale feature extraction and unsupervised contrastive learning for multimodal sentiment analysis
Published 2025-01-01“…Firstly, we propose an innovative pyramid-structured multi-scale feature extraction method, which captures the multi-scale features of modal data through convolution kernels of different sizes and strengthens key features through channel attention mechanism. …”
Get full text
Article -
1837
Computer-Aided Brain Tumor Diagnosis: Performance Evaluation of Deep Learner CNN Using Augmented Brain MRI
Published 2021-01-01“…The focus of this research is on early diagnosis of brain tumor via Convolution Neural Network (CNN) to enhance state-of-the-art diagnosis accuracy. …”
Get full text
Article -
1838
Study on the Strategy of Playing Doudizhu Game Based on Multirole Modeling
Published 2020-01-01“…Role modeling learns different roles and behaviors by using a convolutional neural network. …”
Get full text
Article -
1839
Time series image coding classification theory based on Lagrange multiplier method
Published 2025-07-01“…Recent advancements in convolutional neural networks (CNNs) for image recognition and classification have inspired innovative approaches in this field. …”
Get full text
Article -
1840
Influence of Spatial Scale Effect on UAV Remote Sensing Accuracy in Identifying Chinese Cabbage (<i>Brassica rapa</i> subsp. <i>Pekinensis</i>) Plants
Published 2024-10-01“…The research results show that (1) the ExG can effectively distinguish between soil, mulch, and Chinese cabbage plants; (2) images of different spatial resolutions differ in the optimal type of frequency domain filtering and convolution kernel size, and the threshold segmentation effect also varies; (3) as the spatial resolution of the imagery decreases, the optimal window size for morphological filtering also decreases, accordingly; and (4) at a flight height of 30 m to 50 m, the recognition effect is the best, achieving a balance between recognition accuracy and coverage efficiency. …”
Get full text
Article