A Robust Approach for Speaker Identification Using Dialect Information
The present research is an effort to enhance the performance of voice processing systems, in our case the speaker identification system (SIS) by addressing the variability caused by the dialectical variations of a language. We present an effective solution to reduce dialect-related variability from...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2022-01-01
|
Series: | Applied Computational Intelligence and Soft Computing |
Online Access: | http://dx.doi.org/10.1155/2022/4980920 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832549624748965888 |
---|---|
author | Shahid Munir Shah Muhammad Moinuddin Rizwan Ahmed Khan |
author_facet | Shahid Munir Shah Muhammad Moinuddin Rizwan Ahmed Khan |
author_sort | Shahid Munir Shah |
collection | DOAJ |
description | The present research is an effort to enhance the performance of voice processing systems, in our case the speaker identification system (SIS) by addressing the variability caused by the dialectical variations of a language. We present an effective solution to reduce dialect-related variability from voice processing systems. The proposed method minimizes the system’s complexity by reducing search space during the testing process of speaker identification. The speaker is searched from the set of speakers of the identified dialect instead of all the speakers present in system training. The study is conducted on the Pashto language, and the voice data samples are collected from native Pashto speakers of specific regions of Pakistan and Afghanistan where Pashto is spoken with different dialectal variations. The task of speaker identification is achieved with the help of a novel hierarchical framework that works in two steps. In the first step, the speaker’s dialect is identified. For automated dialect identification, the spectral and prosodic features have been used in conjunction with Gaussian mixture model (GMM). In the second step, the speaker is identified using a multilayer perceptron (MLP)-based speaker identification system, which gets aggregated input from the first step, i.e., dialect identification along with prosodic and spectral features. The robustness of the proposed SIS is compared with traditional state-of-the-art methods in the literature. The results show that the proposed framework is better in terms of average speaker recognition accuracy (84.5% identification accuracy) and consumes 39% less time for the identification of speaker. |
format | Article |
id | doaj-art-fc610d32fecd4be5b7e90604f344be26 |
institution | Kabale University |
issn | 1687-9732 |
language | English |
publishDate | 2022-01-01 |
publisher | Wiley |
record_format | Article |
series | Applied Computational Intelligence and Soft Computing |
spelling | doaj-art-fc610d32fecd4be5b7e90604f344be262025-02-03T06:10:54ZengWileyApplied Computational Intelligence and Soft Computing1687-97322022-01-01202210.1155/2022/4980920A Robust Approach for Speaker Identification Using Dialect InformationShahid Munir Shah0Muhammad Moinuddin1Rizwan Ahmed Khan2Faculty of ITCenter of Excellence in Intelligent Engineering SystemsFaculty of ITThe present research is an effort to enhance the performance of voice processing systems, in our case the speaker identification system (SIS) by addressing the variability caused by the dialectical variations of a language. We present an effective solution to reduce dialect-related variability from voice processing systems. The proposed method minimizes the system’s complexity by reducing search space during the testing process of speaker identification. The speaker is searched from the set of speakers of the identified dialect instead of all the speakers present in system training. The study is conducted on the Pashto language, and the voice data samples are collected from native Pashto speakers of specific regions of Pakistan and Afghanistan where Pashto is spoken with different dialectal variations. The task of speaker identification is achieved with the help of a novel hierarchical framework that works in two steps. In the first step, the speaker’s dialect is identified. For automated dialect identification, the spectral and prosodic features have been used in conjunction with Gaussian mixture model (GMM). In the second step, the speaker is identified using a multilayer perceptron (MLP)-based speaker identification system, which gets aggregated input from the first step, i.e., dialect identification along with prosodic and spectral features. The robustness of the proposed SIS is compared with traditional state-of-the-art methods in the literature. The results show that the proposed framework is better in terms of average speaker recognition accuracy (84.5% identification accuracy) and consumes 39% less time for the identification of speaker.http://dx.doi.org/10.1155/2022/4980920 |
spellingShingle | Shahid Munir Shah Muhammad Moinuddin Rizwan Ahmed Khan A Robust Approach for Speaker Identification Using Dialect Information Applied Computational Intelligence and Soft Computing |
title | A Robust Approach for Speaker Identification Using Dialect Information |
title_full | A Robust Approach for Speaker Identification Using Dialect Information |
title_fullStr | A Robust Approach for Speaker Identification Using Dialect Information |
title_full_unstemmed | A Robust Approach for Speaker Identification Using Dialect Information |
title_short | A Robust Approach for Speaker Identification Using Dialect Information |
title_sort | robust approach for speaker identification using dialect information |
url | http://dx.doi.org/10.1155/2022/4980920 |
work_keys_str_mv | AT shahidmunirshah arobustapproachforspeakeridentificationusingdialectinformation AT muhammadmoinuddin arobustapproachforspeakeridentificationusingdialectinformation AT rizwanahmedkhan arobustapproachforspeakeridentificationusingdialectinformation AT shahidmunirshah robustapproachforspeakeridentificationusingdialectinformation AT muhammadmoinuddin robustapproachforspeakeridentificationusingdialectinformation AT rizwanahmedkhan robustapproachforspeakeridentificationusingdialectinformation |