The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting

To improve the sound quality of speech synthesis technology in intelligent broadcasting, a deep neural network-based method is proposed. It also proved the effectiveness of the DNN discrimination s/u/v and completed the conversion of the HMM synthesis spectrum parameter to original speech. Further,...

Full description

Saved in:

Bibliographic Details
Main Author:	Jihong Yang
Format:	Article
Language:	English
Published:	Wiley 2022-01-01
Series:	Journal of Control Science and Engineering
Online Access:	http://dx.doi.org/10.1155/2022/1971679
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832549149321461760
author	Jihong Yang
author_facet	Jihong Yang
author_sort	Jihong Yang
collection	DOAJ
description	To improve the sound quality of speech synthesis technology in intelligent broadcasting, a deep neural network-based method is proposed. It also proved the effectiveness of the DNN discrimination s/u/v and completed the conversion of the HMM synthesis spectrum parameter to original speech. Further, the scheme for transforming the parameters obtained from the temporary decomposition (TD) algorithm, DNN trains the event vectors obtained from TD decomposition, establishes the transformation model, and recombines with the untransformed event function. Experiments proved that the conversion effect of 16 dimensional parameters is not very ideal in subjective evaluation due to the fact that too few dimensions lead to insufficient spectral details, and the distortion in the process of further synthesis; the parameter conversion of 48 dimensions is slightly better than 16 dimensions, mainly due to more spectral details, but on the other hand, the influence of codebook mapping also affects the sound instability to some extent. It proves that the intelligent voice broadcast system completely solves these problems, which not only reduces construction costs, but also improves service efficiency.
format	Article
id	doaj-art-20dbf60c0be34fe6ab2d96a0d2da96fc
institution	Kabale University
issn	1687-5257
language	English
publishDate	2022-01-01
publisher	Wiley
record_format	Article
series	Journal of Control Science and Engineering
spelling	doaj-art-20dbf60c0be34fe6ab2d96a0d2da96fc2025-02-03T06:11:56ZengWileyJournal of Control Science and Engineering1687-52572022-01-01202210.1155/2022/1971679The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent BroadcastingJihong Yang0School of Film and Television MediaTo improve the sound quality of speech synthesis technology in intelligent broadcasting, a deep neural network-based method is proposed. It also proved the effectiveness of the DNN discrimination s/u/v and completed the conversion of the HMM synthesis spectrum parameter to original speech. Further, the scheme for transforming the parameters obtained from the temporary decomposition (TD) algorithm, DNN trains the event vectors obtained from TD decomposition, establishes the transformation model, and recombines with the untransformed event function. Experiments proved that the conversion effect of 16 dimensional parameters is not very ideal in subjective evaluation due to the fact that too few dimensions lead to insufficient spectral details, and the distortion in the process of further synthesis; the parameter conversion of 48 dimensions is slightly better than 16 dimensions, mainly due to more spectral details, but on the other hand, the influence of codebook mapping also affects the sound instability to some extent. It proves that the intelligent voice broadcast system completely solves these problems, which not only reduces construction costs, but also improves service efficiency.http://dx.doi.org/10.1155/2022/1971679
spellingShingle	Jihong Yang The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting Journal of Control Science and Engineering
title	The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting
title_full	The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting
title_fullStr	The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting
title_full_unstemmed	The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting
title_short	The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting
title_sort	application of speech synthesis technology based on deep neural network in intelligent broadcasting
url	http://dx.doi.org/10.1155/2022/1971679
work_keys_str_mv	AT jihongyang theapplicationofspeechsynthesistechnologybasedondeepneuralnetworkinintelligentbroadcasting AT jihongyang applicationofspeechsynthesistechnologybasedondeepneuralnetworkinintelligentbroadcasting

The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting

Similar Items