Adaptive Language Processing Based on Deep Learning in Cloud Computing Platform

With the continuous advancement of technology, the amount of information and knowledge disseminated on the Internet every day has been developing several times. At the same time, a large amount of bilingual data has also been produced in the real world. These data are undoubtedly a great asset for s...

Full description

Saved in:
Bibliographic Details
Main Authors: Wenbin Xu, Chengbo Yin
Format: Article
Language:English
Published: Wiley 2020-01-01
Series:Complexity
Online Access:http://dx.doi.org/10.1155/2020/5828130
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832551295275237376
author Wenbin Xu
Chengbo Yin
author_facet Wenbin Xu
Chengbo Yin
author_sort Wenbin Xu
collection DOAJ
description With the continuous advancement of technology, the amount of information and knowledge disseminated on the Internet every day has been developing several times. At the same time, a large amount of bilingual data has also been produced in the real world. These data are undoubtedly a great asset for statistical machine translation research. Based on the dual-sentence quality corpus screening, two corpus screening strategies are proposed first, based on the double-sentence pair length ratio method and the word-based alignment information method. The innovation of these two methods is that no additional linguistic resources such as bilingual dictionary and syntactic analyzer are needed as auxiliary. No manual intervention is required, and the poor quality sentence pairs can be automatically selected and can be applied to any language pair. Secondly, a domain adaptive method based on massive corpus is proposed. The method based on massive corpus utilizes massive corpus mechanism to carry out multidomain automatic model migration. In this domain, each domain learns the intradomain model independently, and different domains share the same general model. Through the method of massive corpus, these models can be combined and adjusted to make the model learning more accurate. Finally, the adaptive method of massive corpus filtering and statistical machine translation based on cloud platform is verified. Experiments show that both methods have good effects and can effectively improve the translation quality of statistical machines.
format Article
id doaj-art-f5b3953586864ff4a44edc429e050c6a
institution Kabale University
issn 1076-2787
1099-0526
language English
publishDate 2020-01-01
publisher Wiley
record_format Article
series Complexity
spelling doaj-art-f5b3953586864ff4a44edc429e050c6a2025-02-03T06:04:37ZengWileyComplexity1076-27871099-05262020-01-01202010.1155/2020/58281305828130Adaptive Language Processing Based on Deep Learning in Cloud Computing PlatformWenbin Xu0Chengbo Yin1Department of English Language and Literature, China University of Petroleum (East China), Qingdao, Shandong 266580, ChinaSchool of Data Science, Qingdao Huanghai University, Qingdao 266427, Shandong, ChinaWith the continuous advancement of technology, the amount of information and knowledge disseminated on the Internet every day has been developing several times. At the same time, a large amount of bilingual data has also been produced in the real world. These data are undoubtedly a great asset for statistical machine translation research. Based on the dual-sentence quality corpus screening, two corpus screening strategies are proposed first, based on the double-sentence pair length ratio method and the word-based alignment information method. The innovation of these two methods is that no additional linguistic resources such as bilingual dictionary and syntactic analyzer are needed as auxiliary. No manual intervention is required, and the poor quality sentence pairs can be automatically selected and can be applied to any language pair. Secondly, a domain adaptive method based on massive corpus is proposed. The method based on massive corpus utilizes massive corpus mechanism to carry out multidomain automatic model migration. In this domain, each domain learns the intradomain model independently, and different domains share the same general model. Through the method of massive corpus, these models can be combined and adjusted to make the model learning more accurate. Finally, the adaptive method of massive corpus filtering and statistical machine translation based on cloud platform is verified. Experiments show that both methods have good effects and can effectively improve the translation quality of statistical machines.http://dx.doi.org/10.1155/2020/5828130
spellingShingle Wenbin Xu
Chengbo Yin
Adaptive Language Processing Based on Deep Learning in Cloud Computing Platform
Complexity
title Adaptive Language Processing Based on Deep Learning in Cloud Computing Platform
title_full Adaptive Language Processing Based on Deep Learning in Cloud Computing Platform
title_fullStr Adaptive Language Processing Based on Deep Learning in Cloud Computing Platform
title_full_unstemmed Adaptive Language Processing Based on Deep Learning in Cloud Computing Platform
title_short Adaptive Language Processing Based on Deep Learning in Cloud Computing Platform
title_sort adaptive language processing based on deep learning in cloud computing platform
url http://dx.doi.org/10.1155/2020/5828130
work_keys_str_mv AT wenbinxu adaptivelanguageprocessingbasedondeeplearningincloudcomputingplatform
AT chengboyin adaptivelanguageprocessingbasedondeeplearningincloudcomputingplatform