The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting

To improve the sound quality of speech synthesis technology in intelligent broadcasting, a deep neural network-based method is proposed. It also proved the effectiveness of the DNN discrimination s/u/v and completed the conversion of the HMM synthesis spectrum parameter to original speech. Further,...

Full description

Saved in:
Bibliographic Details
Main Author: Jihong Yang
Format: Article
Language:English
Published: Wiley 2022-01-01
Series:Journal of Control Science and Engineering
Online Access:http://dx.doi.org/10.1155/2022/1971679
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832549149321461760
author Jihong Yang
author_facet Jihong Yang
author_sort Jihong Yang
collection DOAJ
description To improve the sound quality of speech synthesis technology in intelligent broadcasting, a deep neural network-based method is proposed. It also proved the effectiveness of the DNN discrimination s/u/v and completed the conversion of the HMM synthesis spectrum parameter to original speech. Further, the scheme for transforming the parameters obtained from the temporary decomposition (TD) algorithm, DNN trains the event vectors obtained from TD decomposition, establishes the transformation model, and recombines with the untransformed event function. Experiments proved that the conversion effect of 16 dimensional parameters is not very ideal in subjective evaluation due to the fact that too few dimensions lead to insufficient spectral details, and the distortion in the process of further synthesis; the parameter conversion of 48 dimensions is slightly better than 16 dimensions, mainly due to more spectral details, but on the other hand, the influence of codebook mapping also affects the sound instability to some extent. It proves that the intelligent voice broadcast system completely solves these problems, which not only reduces construction costs, but also improves service efficiency.
format Article
id doaj-art-20dbf60c0be34fe6ab2d96a0d2da96fc
institution Kabale University
issn 1687-5257
language English
publishDate 2022-01-01
publisher Wiley
record_format Article
series Journal of Control Science and Engineering
spelling doaj-art-20dbf60c0be34fe6ab2d96a0d2da96fc2025-02-03T06:11:56ZengWileyJournal of Control Science and Engineering1687-52572022-01-01202210.1155/2022/1971679The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent BroadcastingJihong Yang0School of Film and Television MediaTo improve the sound quality of speech synthesis technology in intelligent broadcasting, a deep neural network-based method is proposed. It also proved the effectiveness of the DNN discrimination s/u/v and completed the conversion of the HMM synthesis spectrum parameter to original speech. Further, the scheme for transforming the parameters obtained from the temporary decomposition (TD) algorithm, DNN trains the event vectors obtained from TD decomposition, establishes the transformation model, and recombines with the untransformed event function. Experiments proved that the conversion effect of 16 dimensional parameters is not very ideal in subjective evaluation due to the fact that too few dimensions lead to insufficient spectral details, and the distortion in the process of further synthesis; the parameter conversion of 48 dimensions is slightly better than 16 dimensions, mainly due to more spectral details, but on the other hand, the influence of codebook mapping also affects the sound instability to some extent. It proves that the intelligent voice broadcast system completely solves these problems, which not only reduces construction costs, but also improves service efficiency.http://dx.doi.org/10.1155/2022/1971679
spellingShingle Jihong Yang
The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting
Journal of Control Science and Engineering
title The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting
title_full The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting
title_fullStr The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting
title_full_unstemmed The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting
title_short The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting
title_sort application of speech synthesis technology based on deep neural network in intelligent broadcasting
url http://dx.doi.org/10.1155/2022/1971679
work_keys_str_mv AT jihongyang theapplicationofspeechsynthesistechnologybasedondeepneuralnetworkinintelligentbroadcasting
AT jihongyang applicationofspeechsynthesistechnologybasedondeepneuralnetworkinintelligentbroadcasting