A Hierarchical Framework Approach for Voice Activity Detection and Speech Enhancement

Accurate and effective voice activity detection (VAD) is a fundamental step for robust speech or speaker recognition. In this study, we proposed a hierarchical framework approach for VAD and speech enhancement. The modified Wiener filter (MWF) approach is utilized for noise reduction in the speech e...

Full description

Saved in:
Bibliographic Details
Main Authors: Yan Zhang, Zhen-min Tang, Yan-ping Li, Yang Luo
Format: Article
Language:English
Published: Wiley 2014-01-01
Series:The Scientific World Journal
Online Access:http://dx.doi.org/10.1155/2014/723643
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832562621569564672
author Yan Zhang
Zhen-min Tang
Yan-ping Li
Yang Luo
author_facet Yan Zhang
Zhen-min Tang
Yan-ping Li
Yang Luo
author_sort Yan Zhang
collection DOAJ
description Accurate and effective voice activity detection (VAD) is a fundamental step for robust speech or speaker recognition. In this study, we proposed a hierarchical framework approach for VAD and speech enhancement. The modified Wiener filter (MWF) approach is utilized for noise reduction in the speech enhancement block. For the feature selection and voting block, several discriminating features were employed in a voting paradigm for the consideration of reliability and discriminative power. Effectiveness of the proposed approach is compared and evaluated to other VAD techniques by using two well-known databases, namely, TIMIT database and NOISEX-92 database. Experimental results show that the proposed method performs well under a variety of noisy conditions.
format Article
id doaj-art-94102a58d18542b5afa0528d64a8fb2f
institution Kabale University
issn 2356-6140
1537-744X
language English
publishDate 2014-01-01
publisher Wiley
record_format Article
series The Scientific World Journal
spelling doaj-art-94102a58d18542b5afa0528d64a8fb2f2025-02-03T01:22:14ZengWileyThe Scientific World Journal2356-61401537-744X2014-01-01201410.1155/2014/723643723643A Hierarchical Framework Approach for Voice Activity Detection and Speech EnhancementYan Zhang0Zhen-min Tang1Yan-ping Li2Yang Luo3College of Computer Science and Technology, Nanjing University of Science and Technology (NUST), Nanjing 210094, ChinaCollege of Computer Science and Technology, Nanjing University of Science and Technology (NUST), Nanjing 210094, ChinaCollege of Telecommunications and Information Engineering, Nanjing University of Posts and Telecommunications (NUPT), Nanjing 210046, ChinaCollege of Information Technology, Jinling Institute of Technology (JIT), Nanjing 211169, ChinaAccurate and effective voice activity detection (VAD) is a fundamental step for robust speech or speaker recognition. In this study, we proposed a hierarchical framework approach for VAD and speech enhancement. The modified Wiener filter (MWF) approach is utilized for noise reduction in the speech enhancement block. For the feature selection and voting block, several discriminating features were employed in a voting paradigm for the consideration of reliability and discriminative power. Effectiveness of the proposed approach is compared and evaluated to other VAD techniques by using two well-known databases, namely, TIMIT database and NOISEX-92 database. Experimental results show that the proposed method performs well under a variety of noisy conditions.http://dx.doi.org/10.1155/2014/723643
spellingShingle Yan Zhang
Zhen-min Tang
Yan-ping Li
Yang Luo
A Hierarchical Framework Approach for Voice Activity Detection and Speech Enhancement
The Scientific World Journal
title A Hierarchical Framework Approach for Voice Activity Detection and Speech Enhancement
title_full A Hierarchical Framework Approach for Voice Activity Detection and Speech Enhancement
title_fullStr A Hierarchical Framework Approach for Voice Activity Detection and Speech Enhancement
title_full_unstemmed A Hierarchical Framework Approach for Voice Activity Detection and Speech Enhancement
title_short A Hierarchical Framework Approach for Voice Activity Detection and Speech Enhancement
title_sort hierarchical framework approach for voice activity detection and speech enhancement
url http://dx.doi.org/10.1155/2014/723643
work_keys_str_mv AT yanzhang ahierarchicalframeworkapproachforvoiceactivitydetectionandspeechenhancement
AT zhenmintang ahierarchicalframeworkapproachforvoiceactivitydetectionandspeechenhancement
AT yanpingli ahierarchicalframeworkapproachforvoiceactivitydetectionandspeechenhancement
AT yangluo ahierarchicalframeworkapproachforvoiceactivitydetectionandspeechenhancement
AT yanzhang hierarchicalframeworkapproachforvoiceactivitydetectionandspeechenhancement
AT zhenmintang hierarchicalframeworkapproachforvoiceactivitydetectionandspeechenhancement
AT yanpingli hierarchicalframeworkapproachforvoiceactivitydetectionandspeechenhancement
AT yangluo hierarchicalframeworkapproachforvoiceactivitydetectionandspeechenhancement