Speech Recognition System Based on Machine Learning in Persian Language
In today's world, where speech recognition has become an integral part of our daily lives, the need for systems equipped with this technology has increased dramatically in the past few years. This research aims to locate the two selected Persian words in any given audio file. For this purpose,...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
REA Press
2022-06-01
|
Series: | Computational Algorithms and Numerical Dimensions |
Subjects: | |
Online Access: | https://www.journal-cand.com/article_146462_10b45d4ea5301bbf7c38edf76f6b175e.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832580028949331968 |
---|---|
author | Shahed Mohammadi Niloufar Hemati Neda Mohammadi |
author_facet | Shahed Mohammadi Niloufar Hemati Neda Mohammadi |
author_sort | Shahed Mohammadi |
collection | DOAJ |
description | In today's world, where speech recognition has become an integral part of our daily lives, the need for systems equipped with this technology has increased dramatically in the past few years. This research aims to locate the two selected Persian words in any given audio file. For this purpose, two standard and native datasets were prepared for this model one for train and the other for the test. Both datasets were converted into images of audio waveforms. Using the object detection technique, the model could extract different bounding boxes for each test audio, and then each box image goes through a CNN classifier and returns a corresponding label. Finally, a threshold is set so that only boxes with high accuracy are displayed as output. The results showed 93% accuracy for the CNN classifier and 50% accuracy for testing the model with object detection. |
format | Article |
id | doaj-art-8fb3462b6bb8442484262bcb655e8777 |
institution | Kabale University |
issn | 2980-7646 2980-9320 |
language | English |
publishDate | 2022-06-01 |
publisher | REA Press |
record_format | Article |
series | Computational Algorithms and Numerical Dimensions |
spelling | doaj-art-8fb3462b6bb8442484262bcb655e87772025-01-30T11:20:45ZengREA PressComputational Algorithms and Numerical Dimensions2980-76462980-93202022-06-0112728310.22105/cand.2022.146462146462Speech Recognition System Based on Machine Learning in Persian LanguageShahed Mohammadi0Niloufar Hemati1Neda Mohammadi2Department of Computer Since and Systems Engineering, Ayandegan Institute of Higher Education, Tonekabon, Iran.Department of Computer Science, Islamic Azad University Central Tehran Branch, Tehran, Iran.Department of Industrial Engineering, Sadra University, Tehran, Iran.In today's world, where speech recognition has become an integral part of our daily lives, the need for systems equipped with this technology has increased dramatically in the past few years. This research aims to locate the two selected Persian words in any given audio file. For this purpose, two standard and native datasets were prepared for this model one for train and the other for the test. Both datasets were converted into images of audio waveforms. Using the object detection technique, the model could extract different bounding boxes for each test audio, and then each box image goes through a CNN classifier and returns a corresponding label. Finally, a threshold is set so that only boxes with high accuracy are displayed as output. The results showed 93% accuracy for the CNN classifier and 50% accuracy for testing the model with object detection.https://www.journal-cand.com/article_146462_10b45d4ea5301bbf7c38edf76f6b175e.pdfspeech recognitionsignal processingobject detectionneural networkdeep learning |
spellingShingle | Shahed Mohammadi Niloufar Hemati Neda Mohammadi Speech Recognition System Based on Machine Learning in Persian Language Computational Algorithms and Numerical Dimensions speech recognition signal processing object detection neural network deep learning |
title | Speech Recognition System Based on Machine Learning in Persian Language |
title_full | Speech Recognition System Based on Machine Learning in Persian Language |
title_fullStr | Speech Recognition System Based on Machine Learning in Persian Language |
title_full_unstemmed | Speech Recognition System Based on Machine Learning in Persian Language |
title_short | Speech Recognition System Based on Machine Learning in Persian Language |
title_sort | speech recognition system based on machine learning in persian language |
topic | speech recognition signal processing object detection neural network deep learning |
url | https://www.journal-cand.com/article_146462_10b45d4ea5301bbf7c38edf76f6b175e.pdf |
work_keys_str_mv | AT shahedmohammadi speechrecognitionsystembasedonmachinelearninginpersianlanguage AT niloufarhemati speechrecognitionsystembasedonmachinelearninginpersianlanguage AT nedamohammadi speechrecognitionsystembasedonmachinelearninginpersianlanguage |