Adaptive Language Processing Based on Deep Learning in Cloud Computing Platform

With the continuous advancement of technology, the amount of information and knowledge disseminated on the Internet every day has been developing several times. At the same time, a large amount of bilingual data has also been produced in the real world. These data are undoubtedly a great asset for s...

Full description

Saved in:

Bibliographic Details
Main Authors:	Wenbin Xu, Chengbo Yin
Format:	Article
Language:	English
Published:	Wiley 2020-01-01
Series:	Complexity
Online Access:	http://dx.doi.org/10.1155/2020/5828130
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832551295275237376
author	Wenbin Xu Chengbo Yin
author_facet	Wenbin Xu Chengbo Yin
author_sort	Wenbin Xu
collection	DOAJ
description	With the continuous advancement of technology, the amount of information and knowledge disseminated on the Internet every day has been developing several times. At the same time, a large amount of bilingual data has also been produced in the real world. These data are undoubtedly a great asset for statistical machine translation research. Based on the dual-sentence quality corpus screening, two corpus screening strategies are proposed first, based on the double-sentence pair length ratio method and the word-based alignment information method. The innovation of these two methods is that no additional linguistic resources such as bilingual dictionary and syntactic analyzer are needed as auxiliary. No manual intervention is required, and the poor quality sentence pairs can be automatically selected and can be applied to any language pair. Secondly, a domain adaptive method based on massive corpus is proposed. The method based on massive corpus utilizes massive corpus mechanism to carry out multidomain automatic model migration. In this domain, each domain learns the intradomain model independently, and different domains share the same general model. Through the method of massive corpus, these models can be combined and adjusted to make the model learning more accurate. Finally, the adaptive method of massive corpus filtering and statistical machine translation based on cloud platform is verified. Experiments show that both methods have good effects and can effectively improve the translation quality of statistical machines.
format	Article
id	doaj-art-f5b3953586864ff4a44edc429e050c6a
institution	Kabale University
issn	1076-2787 1099-0526
language	English
publishDate	2020-01-01
publisher	Wiley
record_format	Article
series	Complexity
spelling	doaj-art-f5b3953586864ff4a44edc429e050c6a2025-02-03T06:04:37ZengWileyComplexity1076-27871099-05262020-01-01202010.1155/2020/58281305828130Adaptive Language Processing Based on Deep Learning in Cloud Computing PlatformWenbin Xu0Chengbo Yin1Department of English Language and Literature, China University of Petroleum (East China), Qingdao, Shandong 266580, ChinaSchool of Data Science, Qingdao Huanghai University, Qingdao 266427, Shandong, ChinaWith the continuous advancement of technology, the amount of information and knowledge disseminated on the Internet every day has been developing several times. At the same time, a large amount of bilingual data has also been produced in the real world. These data are undoubtedly a great asset for statistical machine translation research. Based on the dual-sentence quality corpus screening, two corpus screening strategies are proposed first, based on the double-sentence pair length ratio method and the word-based alignment information method. The innovation of these two methods is that no additional linguistic resources such as bilingual dictionary and syntactic analyzer are needed as auxiliary. No manual intervention is required, and the poor quality sentence pairs can be automatically selected and can be applied to any language pair. Secondly, a domain adaptive method based on massive corpus is proposed. The method based on massive corpus utilizes massive corpus mechanism to carry out multidomain automatic model migration. In this domain, each domain learns the intradomain model independently, and different domains share the same general model. Through the method of massive corpus, these models can be combined and adjusted to make the model learning more accurate. Finally, the adaptive method of massive corpus filtering and statistical machine translation based on cloud platform is verified. Experiments show that both methods have good effects and can effectively improve the translation quality of statistical machines.http://dx.doi.org/10.1155/2020/5828130
spellingShingle	Wenbin Xu Chengbo Yin Adaptive Language Processing Based on Deep Learning in Cloud Computing Platform Complexity
title	Adaptive Language Processing Based on Deep Learning in Cloud Computing Platform
title_full	Adaptive Language Processing Based on Deep Learning in Cloud Computing Platform
title_fullStr	Adaptive Language Processing Based on Deep Learning in Cloud Computing Platform
title_full_unstemmed	Adaptive Language Processing Based on Deep Learning in Cloud Computing Platform
title_short	Adaptive Language Processing Based on Deep Learning in Cloud Computing Platform
title_sort	adaptive language processing based on deep learning in cloud computing platform
url	http://dx.doi.org/10.1155/2020/5828130
work_keys_str_mv	AT wenbinxu adaptivelanguageprocessingbasedondeeplearningincloudcomputingplatform AT chengboyin adaptivelanguageprocessingbasedondeeplearningincloudcomputingplatform

Adaptive Language Processing Based on Deep Learning in Cloud Computing Platform

Similar Items