Experiments on Detection of Voiced Hesitations in Russian Spontaneous Speech

The development and popularity of voice-user interfaces made spontaneous speech processing an important research field. One of the main focus areas in this field is automatic speech recognition (ASR) that enables the recognition and translation of spoken language into text by computers. However, ASR...

Full description

Saved in:
Bibliographic Details
Main Authors: Vasilisa Verkhodanova, Vladimir Shapranov
Format: Article
Language:English
Published: Wiley 2016-01-01
Series:Journal of Electrical and Computer Engineering
Online Access:http://dx.doi.org/10.1155/2016/2013658
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832560840652357632
author Vasilisa Verkhodanova
Vladimir Shapranov
author_facet Vasilisa Verkhodanova
Vladimir Shapranov
author_sort Vasilisa Verkhodanova
collection DOAJ
description The development and popularity of voice-user interfaces made spontaneous speech processing an important research field. One of the main focus areas in this field is automatic speech recognition (ASR) that enables the recognition and translation of spoken language into text by computers. However, ASR systems often work less efficiently for spontaneous than for read speech, since the former differs from any other type of speech in many ways. And the presence of speech disfluencies is its prominent characteristic. These phenomena are an important feature in human-human communication and at the same time they are a challenging obstacle for the speech processing tasks. In this paper we address an issue of voiced hesitations (filled pauses and sound lengthenings) detection in Russian spontaneous speech by utilizing different machine learning techniques, from grid search and gradient descent in rule-based approaches to such data-driven ones as ELM and SVM based on the automatically extracted acoustic features. Experimental results on the mixed and quality diverse corpus of spontaneous Russian speech indicate the efficiency of the techniques for the task in question, with SVM outperforming other methods.
format Article
id doaj-art-e97331474dca443bb6e9510e763d0d40
institution Kabale University
issn 2090-0147
2090-0155
language English
publishDate 2016-01-01
publisher Wiley
record_format Article
series Journal of Electrical and Computer Engineering
spelling doaj-art-e97331474dca443bb6e9510e763d0d402025-02-03T01:26:41ZengWileyJournal of Electrical and Computer Engineering2090-01472090-01552016-01-01201610.1155/2016/20136582013658Experiments on Detection of Voiced Hesitations in Russian Spontaneous SpeechVasilisa Verkhodanova0Vladimir Shapranov1St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences, St. Petersburg, RussiaSt. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences, St. Petersburg, RussiaThe development and popularity of voice-user interfaces made spontaneous speech processing an important research field. One of the main focus areas in this field is automatic speech recognition (ASR) that enables the recognition and translation of spoken language into text by computers. However, ASR systems often work less efficiently for spontaneous than for read speech, since the former differs from any other type of speech in many ways. And the presence of speech disfluencies is its prominent characteristic. These phenomena are an important feature in human-human communication and at the same time they are a challenging obstacle for the speech processing tasks. In this paper we address an issue of voiced hesitations (filled pauses and sound lengthenings) detection in Russian spontaneous speech by utilizing different machine learning techniques, from grid search and gradient descent in rule-based approaches to such data-driven ones as ELM and SVM based on the automatically extracted acoustic features. Experimental results on the mixed and quality diverse corpus of spontaneous Russian speech indicate the efficiency of the techniques for the task in question, with SVM outperforming other methods.http://dx.doi.org/10.1155/2016/2013658
spellingShingle Vasilisa Verkhodanova
Vladimir Shapranov
Experiments on Detection of Voiced Hesitations in Russian Spontaneous Speech
Journal of Electrical and Computer Engineering
title Experiments on Detection of Voiced Hesitations in Russian Spontaneous Speech
title_full Experiments on Detection of Voiced Hesitations in Russian Spontaneous Speech
title_fullStr Experiments on Detection of Voiced Hesitations in Russian Spontaneous Speech
title_full_unstemmed Experiments on Detection of Voiced Hesitations in Russian Spontaneous Speech
title_short Experiments on Detection of Voiced Hesitations in Russian Spontaneous Speech
title_sort experiments on detection of voiced hesitations in russian spontaneous speech
url http://dx.doi.org/10.1155/2016/2013658
work_keys_str_mv AT vasilisaverkhodanova experimentsondetectionofvoicedhesitationsinrussianspontaneousspeech
AT vladimirshapranov experimentsondetectionofvoicedhesitationsinrussianspontaneousspeech