LINGUISTIC ANALYSIS FOR THE BELARUSIAN CORPUS WITH THE APPLICATION OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING TECHNIQUES

The article focuses on the problems existing in text-to-speech synthesis. Different morphological, lexical and syntactical elements were localized with the help of the Belarusian unit of NooJ program. Those types of errors, which occur in Belarusian texts, were analyzed and corrected. Language model...

Full description

Saved in:
Bibliographic Details
Main Authors: Yu. S. Hetsevich, I. V. Reentovich
Format: Article
Language:Russian
Published: National Academy of Sciences of Belarus, the United Institute of Informatics Problems 2017-12-01
Series:Informatika
Online Access:https://inf.grid.by/jour/article/view/241
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832543205401296896
author Yu. S. Hetsevich
I. V. Reentovich
author_facet Yu. S. Hetsevich
I. V. Reentovich
author_sort Yu. S. Hetsevich
collection DOAJ
description The article focuses on the problems existing in text-to-speech synthesis. Different morphological, lexical and syntactical elements were localized with the help of the Belarusian unit of NooJ program. Those types of errors, which occur in Belarusian texts, were analyzed and corrected. Language model and part of speech tagging model were built. The natural language processing of Belarusian corpus with the help of developed algorithm using machine learning was carried out. The precision of developed models of machine learning has been 80–90 %. The dictionary was enriched with new words for the further using it in the systems of Belarusian speech synthesis.
format Article
id doaj-art-25daab8456b843e29d73ec8619a3911c
institution Kabale University
issn 1816-0301
language Russian
publishDate 2017-12-01
publisher National Academy of Sciences of Belarus, the United Institute of Informatics Problems
record_format Article
series Informatika
spelling doaj-art-25daab8456b843e29d73ec8619a3911c2025-02-03T11:51:43ZrusNational Academy of Sciences of Belarus, the United Institute of Informatics ProblemsInformatika1816-03012017-12-0104(56)7077234LINGUISTIC ANALYSIS FOR THE BELARUSIAN CORPUS WITH THE APPLICATION OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING TECHNIQUESYu. S. Hetsevich0I. V. Reentovich1United Institute of Informatics Problems, National Academy of Sciences of BelarusUnited Institute of Informatics Problems, National Academy of Sciences of BelarusThe article focuses on the problems existing in text-to-speech synthesis. Different morphological, lexical and syntactical elements were localized with the help of the Belarusian unit of NooJ program. Those types of errors, which occur in Belarusian texts, were analyzed and corrected. Language model and part of speech tagging model were built. The natural language processing of Belarusian corpus with the help of developed algorithm using machine learning was carried out. The precision of developed models of machine learning has been 80–90 %. The dictionary was enriched with new words for the further using it in the systems of Belarusian speech synthesis.https://inf.grid.by/jour/article/view/241
spellingShingle Yu. S. Hetsevich
I. V. Reentovich
LINGUISTIC ANALYSIS FOR THE BELARUSIAN CORPUS WITH THE APPLICATION OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING TECHNIQUES
Informatika
title LINGUISTIC ANALYSIS FOR THE BELARUSIAN CORPUS WITH THE APPLICATION OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING TECHNIQUES
title_full LINGUISTIC ANALYSIS FOR THE BELARUSIAN CORPUS WITH THE APPLICATION OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING TECHNIQUES
title_fullStr LINGUISTIC ANALYSIS FOR THE BELARUSIAN CORPUS WITH THE APPLICATION OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING TECHNIQUES
title_full_unstemmed LINGUISTIC ANALYSIS FOR THE BELARUSIAN CORPUS WITH THE APPLICATION OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING TECHNIQUES
title_short LINGUISTIC ANALYSIS FOR THE BELARUSIAN CORPUS WITH THE APPLICATION OF NATURAL LANGUAGE PROCESSING AND MACHINE LEARNING TECHNIQUES
title_sort linguistic analysis for the belarusian corpus with the application of natural language processing and machine learning techniques
url https://inf.grid.by/jour/article/view/241
work_keys_str_mv AT yushetsevich linguisticanalysisforthebelarusiancorpuswiththeapplicationofnaturallanguageprocessingandmachinelearningtechniques
AT ivreentovich linguisticanalysisforthebelarusiancorpuswiththeapplicationofnaturallanguageprocessingandmachinelearningtechniques