Improving Audio Recognition With Randomized Area Ratio Patch Masking: A Data Augmentation Perspective

In audio recognition, improving the accuracy and generalizability of Pretrained Audio Neural Networks (PANNs) remains challenging. This study introduces Randomized Area Ratio Patch Masking (RARPM), a novel data augmentation technique that applies random patches with varying transparency to log mel s...

Full description

Saved in:
Bibliographic Details
Main Authors: Weichun Wong, Yachun Li, Shihan Li
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10706845/
Tags: Add Tag
No Tags, Be the first to tag this record!