Speech Recognition System Based on Machine Learning in Persian Language

In today's world, where speech recognition has become an integral part of our daily lives, the need for systems equipped with this technology has increased dramatically in the past few years. This research aims to locate the two selected Persian words in any given audio file. For this purpose,...

Full description

Saved in:
Bibliographic Details
Main Authors: Shahed Mohammadi, Niloufar Hemati, Neda Mohammadi
Format: Article
Language:English
Published: REA Press 2022-06-01
Series:Computational Algorithms and Numerical Dimensions
Subjects:
Online Access:https://www.journal-cand.com/article_146462_10b45d4ea5301bbf7c38edf76f6b175e.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832580028949331968
author Shahed Mohammadi
Niloufar Hemati
Neda Mohammadi
author_facet Shahed Mohammadi
Niloufar Hemati
Neda Mohammadi
author_sort Shahed Mohammadi
collection DOAJ
description In today's world, where speech recognition has become an integral part of our daily lives, the need for systems equipped with this technology has increased dramatically in the past few years. This research aims to locate the two selected Persian words in any given audio file. For this purpose, two standard and native datasets were prepared for this model one for train and the other for the test. Both datasets were converted into images of audio waveforms. Using the object detection technique, the model could extract different bounding boxes for each test audio, and then each box image goes through a CNN classifier and returns a corresponding label. Finally, a threshold is set so that only boxes with high accuracy are displayed as output. The results showed 93% accuracy for the CNN classifier and 50% accuracy for testing the model with object detection.
format Article
id doaj-art-8fb3462b6bb8442484262bcb655e8777
institution Kabale University
issn 2980-7646
2980-9320
language English
publishDate 2022-06-01
publisher REA Press
record_format Article
series Computational Algorithms and Numerical Dimensions
spelling doaj-art-8fb3462b6bb8442484262bcb655e87772025-01-30T11:20:45ZengREA PressComputational Algorithms and Numerical Dimensions2980-76462980-93202022-06-0112728310.22105/cand.2022.146462146462Speech Recognition System Based on Machine Learning in Persian LanguageShahed Mohammadi0Niloufar Hemati1Neda Mohammadi2Department of Computer Since and Systems Engineering, Ayandegan Institute of Higher Education, Tonekabon, Iran.Department of Computer Science, Islamic Azad University Central Tehran Branch, Tehran, Iran.Department of Industrial Engineering, Sadra University, Tehran, Iran.In today's world, where speech recognition has become an integral part of our daily lives, the need for systems equipped with this technology has increased dramatically in the past few years. This research aims to locate the two selected Persian words in any given audio file. For this purpose, two standard and native datasets were prepared for this model one for train and the other for the test. Both datasets were converted into images of audio waveforms. Using the object detection technique, the model could extract different bounding boxes for each test audio, and then each box image goes through a CNN classifier and returns a corresponding label. Finally, a threshold is set so that only boxes with high accuracy are displayed as output. The results showed 93% accuracy for the CNN classifier and 50% accuracy for testing the model with object detection.https://www.journal-cand.com/article_146462_10b45d4ea5301bbf7c38edf76f6b175e.pdfspeech recognitionsignal processingobject detectionneural networkdeep learning
spellingShingle Shahed Mohammadi
Niloufar Hemati
Neda Mohammadi
Speech Recognition System Based on Machine Learning in Persian Language
Computational Algorithms and Numerical Dimensions
speech recognition
signal processing
object detection
neural network
deep learning
title Speech Recognition System Based on Machine Learning in Persian Language
title_full Speech Recognition System Based on Machine Learning in Persian Language
title_fullStr Speech Recognition System Based on Machine Learning in Persian Language
title_full_unstemmed Speech Recognition System Based on Machine Learning in Persian Language
title_short Speech Recognition System Based on Machine Learning in Persian Language
title_sort speech recognition system based on machine learning in persian language
topic speech recognition
signal processing
object detection
neural network
deep learning
url https://www.journal-cand.com/article_146462_10b45d4ea5301bbf7c38edf76f6b175e.pdf
work_keys_str_mv AT shahedmohammadi speechrecognitionsystembasedonmachinelearninginpersianlanguage
AT niloufarhemati speechrecognitionsystembasedonmachinelearninginpersianlanguage
AT nedamohammadi speechrecognitionsystembasedonmachinelearninginpersianlanguage