A Robust Approach for Speaker Identification Using Dialect Information

The present research is an effort to enhance the performance of voice processing systems, in our case the speaker identification system (SIS) by addressing the variability caused by the dialectical variations of a language. We present an effective solution to reduce dialect-related variability from...

Full description

Saved in:
Bibliographic Details
Main Authors: Shahid Munir Shah, Muhammad Moinuddin, Rizwan Ahmed Khan
Format: Article
Language:English
Published: Wiley 2022-01-01
Series:Applied Computational Intelligence and Soft Computing
Online Access:http://dx.doi.org/10.1155/2022/4980920
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832549624748965888
author Shahid Munir Shah
Muhammad Moinuddin
Rizwan Ahmed Khan
author_facet Shahid Munir Shah
Muhammad Moinuddin
Rizwan Ahmed Khan
author_sort Shahid Munir Shah
collection DOAJ
description The present research is an effort to enhance the performance of voice processing systems, in our case the speaker identification system (SIS) by addressing the variability caused by the dialectical variations of a language. We present an effective solution to reduce dialect-related variability from voice processing systems. The proposed method minimizes the system’s complexity by reducing search space during the testing process of speaker identification. The speaker is searched from the set of speakers of the identified dialect instead of all the speakers present in system training. The study is conducted on the Pashto language, and the voice data samples are collected from native Pashto speakers of specific regions of Pakistan and Afghanistan where Pashto is spoken with different dialectal variations. The task of speaker identification is achieved with the help of a novel hierarchical framework that works in two steps. In the first step, the speaker’s dialect is identified. For automated dialect identification, the spectral and prosodic features have been used in conjunction with Gaussian mixture model (GMM). In the second step, the speaker is identified using a multilayer perceptron (MLP)-based speaker identification system, which gets aggregated input from the first step, i.e., dialect identification along with prosodic and spectral features. The robustness of the proposed SIS is compared with traditional state-of-the-art methods in the literature. The results show that the proposed framework is better in terms of average speaker recognition accuracy (84.5% identification accuracy) and consumes 39% less time for the identification of speaker.
format Article
id doaj-art-fc610d32fecd4be5b7e90604f344be26
institution Kabale University
issn 1687-9732
language English
publishDate 2022-01-01
publisher Wiley
record_format Article
series Applied Computational Intelligence and Soft Computing
spelling doaj-art-fc610d32fecd4be5b7e90604f344be262025-02-03T06:10:54ZengWileyApplied Computational Intelligence and Soft Computing1687-97322022-01-01202210.1155/2022/4980920A Robust Approach for Speaker Identification Using Dialect InformationShahid Munir Shah0Muhammad Moinuddin1Rizwan Ahmed Khan2Faculty of ITCenter of Excellence in Intelligent Engineering SystemsFaculty of ITThe present research is an effort to enhance the performance of voice processing systems, in our case the speaker identification system (SIS) by addressing the variability caused by the dialectical variations of a language. We present an effective solution to reduce dialect-related variability from voice processing systems. The proposed method minimizes the system’s complexity by reducing search space during the testing process of speaker identification. The speaker is searched from the set of speakers of the identified dialect instead of all the speakers present in system training. The study is conducted on the Pashto language, and the voice data samples are collected from native Pashto speakers of specific regions of Pakistan and Afghanistan where Pashto is spoken with different dialectal variations. The task of speaker identification is achieved with the help of a novel hierarchical framework that works in two steps. In the first step, the speaker’s dialect is identified. For automated dialect identification, the spectral and prosodic features have been used in conjunction with Gaussian mixture model (GMM). In the second step, the speaker is identified using a multilayer perceptron (MLP)-based speaker identification system, which gets aggregated input from the first step, i.e., dialect identification along with prosodic and spectral features. The robustness of the proposed SIS is compared with traditional state-of-the-art methods in the literature. The results show that the proposed framework is better in terms of average speaker recognition accuracy (84.5% identification accuracy) and consumes 39% less time for the identification of speaker.http://dx.doi.org/10.1155/2022/4980920
spellingShingle Shahid Munir Shah
Muhammad Moinuddin
Rizwan Ahmed Khan
A Robust Approach for Speaker Identification Using Dialect Information
Applied Computational Intelligence and Soft Computing
title A Robust Approach for Speaker Identification Using Dialect Information
title_full A Robust Approach for Speaker Identification Using Dialect Information
title_fullStr A Robust Approach for Speaker Identification Using Dialect Information
title_full_unstemmed A Robust Approach for Speaker Identification Using Dialect Information
title_short A Robust Approach for Speaker Identification Using Dialect Information
title_sort robust approach for speaker identification using dialect information
url http://dx.doi.org/10.1155/2022/4980920
work_keys_str_mv AT shahidmunirshah arobustapproachforspeakeridentificationusingdialectinformation
AT muhammadmoinuddin arobustapproachforspeakeridentificationusingdialectinformation
AT rizwanahmedkhan arobustapproachforspeakeridentificationusingdialectinformation
AT shahidmunirshah robustapproachforspeakeridentificationusingdialectinformation
AT muhammadmoinuddin robustapproachforspeakeridentificationusingdialectinformation
AT rizwanahmedkhan robustapproachforspeakeridentificationusingdialectinformation