The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting
To improve the sound quality of speech synthesis technology in intelligent broadcasting, a deep neural network-based method is proposed. It also proved the effectiveness of the DNN discrimination s/u/v and completed the conversion of the HMM synthesis spectrum parameter to original speech. Further,...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Wiley
2022-01-01
|
Series: | Journal of Control Science and Engineering |
Online Access: | http://dx.doi.org/10.1155/2022/1971679 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832549149321461760 |
---|---|
author | Jihong Yang |
author_facet | Jihong Yang |
author_sort | Jihong Yang |
collection | DOAJ |
description | To improve the sound quality of speech synthesis technology in intelligent broadcasting, a deep neural network-based method is proposed. It also proved the effectiveness of the DNN discrimination s/u/v and completed the conversion of the HMM synthesis spectrum parameter to original speech. Further, the scheme for transforming the parameters obtained from the temporary decomposition (TD) algorithm, DNN trains the event vectors obtained from TD decomposition, establishes the transformation model, and recombines with the untransformed event function. Experiments proved that the conversion effect of 16 dimensional parameters is not very ideal in subjective evaluation due to the fact that too few dimensions lead to insufficient spectral details, and the distortion in the process of further synthesis; the parameter conversion of 48 dimensions is slightly better than 16 dimensions, mainly due to more spectral details, but on the other hand, the influence of codebook mapping also affects the sound instability to some extent. It proves that the intelligent voice broadcast system completely solves these problems, which not only reduces construction costs, but also improves service efficiency. |
format | Article |
id | doaj-art-20dbf60c0be34fe6ab2d96a0d2da96fc |
institution | Kabale University |
issn | 1687-5257 |
language | English |
publishDate | 2022-01-01 |
publisher | Wiley |
record_format | Article |
series | Journal of Control Science and Engineering |
spelling | doaj-art-20dbf60c0be34fe6ab2d96a0d2da96fc2025-02-03T06:11:56ZengWileyJournal of Control Science and Engineering1687-52572022-01-01202210.1155/2022/1971679The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent BroadcastingJihong Yang0School of Film and Television MediaTo improve the sound quality of speech synthesis technology in intelligent broadcasting, a deep neural network-based method is proposed. It also proved the effectiveness of the DNN discrimination s/u/v and completed the conversion of the HMM synthesis spectrum parameter to original speech. Further, the scheme for transforming the parameters obtained from the temporary decomposition (TD) algorithm, DNN trains the event vectors obtained from TD decomposition, establishes the transformation model, and recombines with the untransformed event function. Experiments proved that the conversion effect of 16 dimensional parameters is not very ideal in subjective evaluation due to the fact that too few dimensions lead to insufficient spectral details, and the distortion in the process of further synthesis; the parameter conversion of 48 dimensions is slightly better than 16 dimensions, mainly due to more spectral details, but on the other hand, the influence of codebook mapping also affects the sound instability to some extent. It proves that the intelligent voice broadcast system completely solves these problems, which not only reduces construction costs, but also improves service efficiency.http://dx.doi.org/10.1155/2022/1971679 |
spellingShingle | Jihong Yang The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting Journal of Control Science and Engineering |
title | The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting |
title_full | The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting |
title_fullStr | The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting |
title_full_unstemmed | The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting |
title_short | The Application of Speech Synthesis Technology Based on Deep Neural Network in Intelligent Broadcasting |
title_sort | application of speech synthesis technology based on deep neural network in intelligent broadcasting |
url | http://dx.doi.org/10.1155/2022/1971679 |
work_keys_str_mv | AT jihongyang theapplicationofspeechsynthesistechnologybasedondeepneuralnetworkinintelligentbroadcasting AT jihongyang applicationofspeechsynthesistechnologybasedondeepneuralnetworkinintelligentbroadcasting |