LINGUISTIC ANALYSIS FOR THE BELARUSIAN CORPUS WITH THE APPLICATION OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING TECHNIQUES
The article focuses on the problems existing in text-to-speech synthesis. Different morphological, lexical and syntactical elements were localized with the help of the Belarusian unit of NooJ program. Those types of errors, which occur in Belarusian texts, were analyzed and corrected. Language model...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | Russian |
Published: |
National Academy of Sciences of Belarus, the United Institute of Informatics Problems
2017-12-01
|
Series: | Informatika |
Online Access: | https://inf.grid.by/jour/article/view/241 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832543205401296896 |
---|---|
author | Yu. S. Hetsevich I. V. Reentovich |
author_facet | Yu. S. Hetsevich I. V. Reentovich |
author_sort | Yu. S. Hetsevich |
collection | DOAJ |
description | The article focuses on the problems existing in text-to-speech synthesis. Different morphological, lexical and syntactical elements were localized with the help of the Belarusian unit of NooJ program. Those types of errors, which occur in Belarusian texts, were analyzed and corrected. Language model and part of speech tagging model were built. The natural language processing of Belarusian corpus with the help of developed algorithm using machine learning was carried out. The precision of developed models of machine learning has been 80–90 %. The dictionary was enriched with new words for the further using it in the systems of Belarusian speech synthesis. |
format | Article |
id | doaj-art-25daab8456b843e29d73ec8619a3911c |
institution | Kabale University |
issn | 1816-0301 |
language | Russian |
publishDate | 2017-12-01 |
publisher | National Academy of Sciences of Belarus, the United Institute of Informatics Problems |
record_format | Article |
series | Informatika |
spelling | doaj-art-25daab8456b843e29d73ec8619a3911c2025-02-03T11:51:43ZrusNational Academy of Sciences of Belarus, the United Institute of Informatics ProblemsInformatika1816-03012017-12-0104(56)7077234LINGUISTIC ANALYSIS FOR THE BELARUSIAN CORPUS WITH THE APPLICATION OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING TECHNIQUESYu. S. Hetsevich0I. V. Reentovich1United Institute of Informatics Problems, National Academy of Sciences of BelarusUnited Institute of Informatics Problems, National Academy of Sciences of BelarusThe article focuses on the problems existing in text-to-speech synthesis. Different morphological, lexical and syntactical elements were localized with the help of the Belarusian unit of NooJ program. Those types of errors, which occur in Belarusian texts, were analyzed and corrected. Language model and part of speech tagging model were built. The natural language processing of Belarusian corpus with the help of developed algorithm using machine learning was carried out. The precision of developed models of machine learning has been 80–90 %. The dictionary was enriched with new words for the further using it in the systems of Belarusian speech synthesis.https://inf.grid.by/jour/article/view/241 |
spellingShingle | Yu. S. Hetsevich I. V. Reentovich LINGUISTIC ANALYSIS FOR THE BELARUSIAN CORPUS WITH THE APPLICATION OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING TECHNIQUES Informatika |
title | LINGUISTIC ANALYSIS FOR THE BELARUSIAN CORPUS WITH THE APPLICATION OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING TECHNIQUES |
title_full | LINGUISTIC ANALYSIS FOR THE BELARUSIAN CORPUS WITH THE APPLICATION OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING TECHNIQUES |
title_fullStr | LINGUISTIC ANALYSIS FOR THE BELARUSIAN CORPUS WITH THE APPLICATION OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING TECHNIQUES |
title_full_unstemmed | LINGUISTIC ANALYSIS FOR THE BELARUSIAN CORPUS WITH THE APPLICATION OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING TECHNIQUES |
title_short | LINGUISTIC ANALYSIS FOR THE BELARUSIAN CORPUS WITH THE APPLICATION OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING TECHNIQUES |
title_sort | linguistic analysis for the belarusian corpus with the application of natural language processing and machine learning techniques |
url | https://inf.grid.by/jour/article/view/241 |
work_keys_str_mv | AT yushetsevich linguisticanalysisforthebelarusiancorpuswiththeapplicationofnaturallanguageprocessingandmachinelearningtechniques AT ivreentovich linguisticanalysisforthebelarusiancorpuswiththeapplicationofnaturallanguageprocessingandmachinelearningtechniques |