The Relative Weight of Temporal Envelope Cues in Different Frequency Regions for Mandarin Sentence Recognition

Acoustic temporal envelope (E) cues containing speech information are distributed across the frequency spectrum. To investigate the relative weight of E cues in different frequency regions for Mandarin sentence recognition, E information was extracted from 30 contiguous bands across the range of 80–...

Full description

Saved in:
Bibliographic Details
Main Authors: Yang Guo, Yuanyuan Sun, Yanmei Feng, Yujun Zhang, Shankai Yin
Format: Article
Language:English
Published: Wiley 2017-01-01
Series:Neural Plasticity
Online Access:http://dx.doi.org/10.1155/2017/7416727
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832563682908831744
author Yang Guo
Yuanyuan Sun
Yanmei Feng
Yujun Zhang
Shankai Yin
author_facet Yang Guo
Yuanyuan Sun
Yanmei Feng
Yujun Zhang
Shankai Yin
author_sort Yang Guo
collection DOAJ
description Acoustic temporal envelope (E) cues containing speech information are distributed across the frequency spectrum. To investigate the relative weight of E cues in different frequency regions for Mandarin sentence recognition, E information was extracted from 30 contiguous bands across the range of 80–7,562 Hz using Hilbert decomposition and then allocated to five frequency regions. Recognition scores were obtained with acoustic E cues from 1 or 2 random regions from 40 normal-hearing listeners. While the recognition scores ranged from 8.2% to 16.3% when E information from only one region was available, the scores ranged from 57.9% to 87.7% when E information from two frequency regions was presented, suggesting a synergistic effect among the temporal E cues in different frequency regions. Next, the relative contributions of the E information from the five frequency regions to sentence perception were computed using a least-squares approach. The results demonstrated that, for Mandarin Chinese, a tonal language, the temporal E cues of Frequency Region 1 (80–502 Hz) and Region 3 (1,022–1,913 Hz) contributed more to the intelligence of sentence recognition than other regions, particularly the region of 80–502 Hz, which contained fundamental frequency (F0) information.
format Article
id doaj-art-0a33beec512846bdb865e3ef1be56074
institution Kabale University
issn 2090-5904
1687-5443
language English
publishDate 2017-01-01
publisher Wiley
record_format Article
series Neural Plasticity
spelling doaj-art-0a33beec512846bdb865e3ef1be560742025-02-03T01:12:42ZengWileyNeural Plasticity2090-59041687-54432017-01-01201710.1155/2017/74167277416727The Relative Weight of Temporal Envelope Cues in Different Frequency Regions for Mandarin Sentence RecognitionYang Guo0Yuanyuan Sun1Yanmei Feng2Yujun Zhang3Shankai Yin4Department of Otolaryngology Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, No. 600, Yishan Road, Xuhui District, Shanghai 200233, ChinaDepartment of Otolaryngology Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, No. 600, Yishan Road, Xuhui District, Shanghai 200233, ChinaDepartment of Otolaryngology Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, No. 600, Yishan Road, Xuhui District, Shanghai 200233, ChinaDepartment of Otolaryngology Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, No. 600, Yishan Road, Xuhui District, Shanghai 200233, ChinaDepartment of Otolaryngology Head and Neck Surgery, Shanghai Jiao Tong University Affiliated Sixth People’s Hospital, No. 600, Yishan Road, Xuhui District, Shanghai 200233, ChinaAcoustic temporal envelope (E) cues containing speech information are distributed across the frequency spectrum. To investigate the relative weight of E cues in different frequency regions for Mandarin sentence recognition, E information was extracted from 30 contiguous bands across the range of 80–7,562 Hz using Hilbert decomposition and then allocated to five frequency regions. Recognition scores were obtained with acoustic E cues from 1 or 2 random regions from 40 normal-hearing listeners. While the recognition scores ranged from 8.2% to 16.3% when E information from only one region was available, the scores ranged from 57.9% to 87.7% when E information from two frequency regions was presented, suggesting a synergistic effect among the temporal E cues in different frequency regions. Next, the relative contributions of the E information from the five frequency regions to sentence perception were computed using a least-squares approach. The results demonstrated that, for Mandarin Chinese, a tonal language, the temporal E cues of Frequency Region 1 (80–502 Hz) and Region 3 (1,022–1,913 Hz) contributed more to the intelligence of sentence recognition than other regions, particularly the region of 80–502 Hz, which contained fundamental frequency (F0) information.http://dx.doi.org/10.1155/2017/7416727
spellingShingle Yang Guo
Yuanyuan Sun
Yanmei Feng
Yujun Zhang
Shankai Yin
The Relative Weight of Temporal Envelope Cues in Different Frequency Regions for Mandarin Sentence Recognition
Neural Plasticity
title The Relative Weight of Temporal Envelope Cues in Different Frequency Regions for Mandarin Sentence Recognition
title_full The Relative Weight of Temporal Envelope Cues in Different Frequency Regions for Mandarin Sentence Recognition
title_fullStr The Relative Weight of Temporal Envelope Cues in Different Frequency Regions for Mandarin Sentence Recognition
title_full_unstemmed The Relative Weight of Temporal Envelope Cues in Different Frequency Regions for Mandarin Sentence Recognition
title_short The Relative Weight of Temporal Envelope Cues in Different Frequency Regions for Mandarin Sentence Recognition
title_sort relative weight of temporal envelope cues in different frequency regions for mandarin sentence recognition
url http://dx.doi.org/10.1155/2017/7416727
work_keys_str_mv AT yangguo therelativeweightoftemporalenvelopecuesindifferentfrequencyregionsformandarinsentencerecognition
AT yuanyuansun therelativeweightoftemporalenvelopecuesindifferentfrequencyregionsformandarinsentencerecognition
AT yanmeifeng therelativeweightoftemporalenvelopecuesindifferentfrequencyregionsformandarinsentencerecognition
AT yujunzhang therelativeweightoftemporalenvelopecuesindifferentfrequencyregionsformandarinsentencerecognition
AT shankaiyin therelativeweightoftemporalenvelopecuesindifferentfrequencyregionsformandarinsentencerecognition
AT yangguo relativeweightoftemporalenvelopecuesindifferentfrequencyregionsformandarinsentencerecognition
AT yuanyuansun relativeweightoftemporalenvelopecuesindifferentfrequencyregionsformandarinsentencerecognition
AT yanmeifeng relativeweightoftemporalenvelopecuesindifferentfrequencyregionsformandarinsentencerecognition
AT yujunzhang relativeweightoftemporalenvelopecuesindifferentfrequencyregionsformandarinsentencerecognition
AT shankaiyin relativeweightoftemporalenvelopecuesindifferentfrequencyregionsformandarinsentencerecognition