ASDNet: An Efficient Self-Supervised Convolutional Network for Anomalous Sound Detection

Anomalous Sound Detection (ASD) is crucial for ensuring industrial equipment safety and enhancing production efficiency. However, existing methods, while pursuing high detection accuracy, are often associated with high computational complexity, making them unsuitable for resource-constrained environ...

Full description

Saved in:
Bibliographic Details
Main Authors: Dewei Kong, Guoshun Yuan, Hongjiang Yu, Shuai Wang, Bo Zhang
Format: Article
Language:English
Published: MDPI AG 2025-01-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/2/584
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Anomalous Sound Detection (ASD) is crucial for ensuring industrial equipment safety and enhancing production efficiency. However, existing methods, while pursuing high detection accuracy, are often associated with high computational complexity, making them unsuitable for resource-constrained environments. This study proposes an efficient self-supervised ASD framework that integrates spectral features, lightweight neural networks, and various anomaly scoring methods. Unlike traditional Log-Mel features, spectral features retain richer frequency domain details, providing high-quality inputs that enhance detection accuracy. The framework includes two network architectures: the lightweight ASDNet, optimized for resource-limited scenarios, and SpecMFN, which combines SpecNet and MobileFaceNet for advanced feature extraction and classification. These architectures employ various anomaly scoring methods, enabling complex decision boundaries to effectively detect diverse anomalous patterns. Experimental results demonstrate that ASDNet achieves an average AUC of 94.42% and a pAUC of 87.18%, outperforming existing methods by 6.75% and 9.34%, respectively, while significantly reducing FLOPs (85.4 M, a 93.81% reduction) and parameters (0.51 M, a 41.38% reduction). SpecMFN achieves AUC and pAUC values of 94.36% and 88.60%, respectively, with FLOPs reduced by 86.6%. These results highlight the framework’s ability to balance performance and computational efficiency, making it a robust and practical solution for ASD tasks in industrial and resource-constrained environments.
ISSN:2076-3417