-
1
Time alignment in speech and speaker recognition
Published 2001-12-01“… Time scale alignment problems in speech and speaker recognition by voice are investigated. Dynamic programming approach to solution of this problem is discussed. …”
Get full text
Article -
2
Cross-linguistic rhythmic patterns in Persian-English bilingual speakers: Implications for speaker recognition
Published 2024-12-01“…This research has implications for understanding bilingual speech production and enhancing speaker recognition technology.…”
Get full text
Article -
3
A Hierarchical Framework Approach for Voice Activity Detection and Speech Enhancement
Published 2014-01-01“…Accurate and effective voice activity detection (VAD) is a fundamental step for robust speech or speaker recognition. In this study, we proposed a hierarchical framework approach for VAD and speech enhancement. …”
Get full text
Article -
4
Multitask Learning with Local Attention for Tibetan Speech Recognition
Published 2020-01-01“…With an increase in task number, such as simultaneous Tibetan speech content recognition, dialect identification, and speaker recognition, the accuracy rate of a single WaveNet-CTC decreases on speech recognition. …”
Get full text
Article -
5
Phoneme analysis for multiple languages with fuzzy‐based speaker identification
Published 2022-11-01“…Also, this paper expands the research study of fuzzy models in the speaker recognition field, using a Fuzzy C‐Means and Fuzzy k‐Nearest Neighbours and comparing them with k‐Nearest Neighbours and Support Vector Machines. …”
Get full text
Article -
6
A Robust Approach for Speaker Identification Using Dialect Information
Published 2022-01-01“…The results show that the proposed framework is better in terms of average speaker recognition accuracy (84.5% identification accuracy) and consumes 39% less time for the identification of speaker.…”
Get full text
Article -
7
Enhancing depression recognition through a mixed expert model by integrating speaker-related and emotion-related features
Published 2025-02-01“…Our approach begins with a Time Delay Neural Network to pre-train a speaker-related feature extractor using a large-scale speaker recognition dataset while simultaneously pre-training a speaker’s emotion-related feature extractor with a speech emotion dataset. …”
Get full text
Article