CNN Based Automatic Speech Recognition: A Comparative Study

Recently, one of the most common approaches used in speech recognition is deep learning. The most advanced results have been obtained with speech recognition systems created using convolutional neural network (CNN) and recurrent neural networks (RNN). Since CNNs can capture local features effectivel...

Full description

Saved in:

Bibliographic Details
Main Authors:	Hilal Ilgaz, Beyza Akkoyun, Özlem Alpay, M. Ali Akcayol
Format:	Article
Language:	English
Published:	Ediciones Universidad de Salamanca 2024-08-01
Series:	Advances in Distributed Computing and Artificial Intelligence Journal
Subjects:	deep learning artificial neural networks speech recognition
Online Access:	https://revistas.usal.es/cinco/index.php/2255-2863/article/view/29191
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832590610190565376
author	Hilal Ilgaz Beyza Akkoyun Özlem Alpay M. Ali Akcayol
author_facet	Hilal Ilgaz Beyza Akkoyun Özlem Alpay M. Ali Akcayol
author_sort	Hilal Ilgaz
collection	DOAJ
description	Recently, one of the most common approaches used in speech recognition is deep learning. The most advanced results have been obtained with speech recognition systems created using convolutional neural network (CNN) and recurrent neural networks (RNN). Since CNNs can capture local features effectively, they are applied to tasks with relatively short-term dependencies, such as keyword detection or phoneme- level sequence recognition. This paper presents the development of a deep learning and speech command recognition system. The Google Speech Commands Dataset has been used for training. The dataset contained 65.000 one-second-long words of 30 short English words. That is, %80 of the dataset has been used in the training and %20 of the dataset has been used in the testing. The data set consists of one-second voice commands that have been converted into a spectrogram and used to train different artificial neural network (ANN) models. Various variants of CNN are used in deep learning applications. The performance of the proposed model has reached %94.60.
format	Article
id	doaj-art-735a6efc68a0465f8100c3b93c74c2b8
institution	Kabale University
issn	2255-2863
language	English
publishDate	2024-08-01
publisher	Ediciones Universidad de Salamanca
record_format	Article
series	Advances in Distributed Computing and Artificial Intelligence Journal
spelling	doaj-art-735a6efc68a0465f8100c3b93c74c2b82025-01-23T11:25:18ZengEdiciones Universidad de SalamancaAdvances in Distributed Computing and Artificial Intelligence Journal2255-28632024-08-0113e29191e2919110.14201/adcaij.2919134652CNN Based Automatic Speech Recognition: A Comparative StudyHilal Ilgaz0Beyza Akkoyun1Özlem Alpay2M. Ali Akcayol3Computer Engineering Department, University of Gazi. Ankara, TurkeyComputer Engineering Department, University of Gazi. Ankara, Turkey.Computer Engineering Department, University of Gazi. Ankara, Turkey.Computer Engineering Department, University of Gazi. Ankara, Turkey.Recently, one of the most common approaches used in speech recognition is deep learning. The most advanced results have been obtained with speech recognition systems created using convolutional neural network (CNN) and recurrent neural networks (RNN). Since CNNs can capture local features effectively, they are applied to tasks with relatively short-term dependencies, such as keyword detection or phoneme- level sequence recognition. This paper presents the development of a deep learning and speech command recognition system. The Google Speech Commands Dataset has been used for training. The dataset contained 65.000 one-second-long words of 30 short English words. That is, %80 of the dataset has been used in the training and %20 of the dataset has been used in the testing. The data set consists of one-second voice commands that have been converted into a spectrogram and used to train different artificial neural network (ANN) models. Various variants of CNN are used in deep learning applications. The performance of the proposed model has reached %94.60.https://revistas.usal.es/cinco/index.php/2255-2863/article/view/29191deep learningartificial neural networksspeech recognition
spellingShingle	Hilal Ilgaz Beyza Akkoyun Özlem Alpay M. Ali Akcayol CNN Based Automatic Speech Recognition: A Comparative Study Advances in Distributed Computing and Artificial Intelligence Journal deep learning artificial neural networks speech recognition
title	CNN Based Automatic Speech Recognition: A Comparative Study
title_full	CNN Based Automatic Speech Recognition: A Comparative Study
title_fullStr	CNN Based Automatic Speech Recognition: A Comparative Study
title_full_unstemmed	CNN Based Automatic Speech Recognition: A Comparative Study
title_short	CNN Based Automatic Speech Recognition: A Comparative Study
title_sort	cnn based automatic speech recognition a comparative study
topic	deep learning artificial neural networks speech recognition
url	https://revistas.usal.es/cinco/index.php/2255-2863/article/view/29191
work_keys_str_mv	AT hilalilgaz cnnbasedautomaticspeechrecognitionacomparativestudy AT beyzaakkoyun cnnbasedautomaticspeechrecognitionacomparativestudy AT ozlemalpay cnnbasedautomaticspeechrecognitionacomparativestudy AT maliakcayol cnnbasedautomaticspeechrecognitionacomparativestudy

CNN Based Automatic Speech Recognition: A Comparative Study

Similar Items