ALGORITHMS FOR IDENTIFICATION OF CUES WITH AUTHORS’ TEXT INSERTIONS IN BELARUSIAN ELECTRONIC BOOKS

The main stages of algorithms for characters’ gender identification in Belarusian electronic texts are described. The algorithms are based on punctuation marking and gender indicators detection, such as past tense verbs and nouns with gender attributes. For indicators, special dictionaries are devel...

Full description

Saved in:
Bibliographic Details
Main Authors: Y. S. Hetsevich, T,. I. Okrut, B. M. Lobanov
Format: Article
Language:Russian
Published: National Academy of Sciences of Belarus, the United Institute of Informatics Problems 2016-10-01
Series:Informatika
Online Access:https://inf.grid.by/jour/article/view/129
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832543133532946432
author Y. S. Hetsevich
T,. I. Okrut
B. M. Lobanov
author_facet Y. S. Hetsevich
T,. I. Okrut
B. M. Lobanov
author_sort Y. S. Hetsevich
collection DOAJ
description The main stages of algorithms for characters’ gender identification in Belarusian electronic texts are described. The algorithms are based on punctuation marking and gender indicators detection, such as past tense verbs and nouns with gender attributes. For indicators, special dictionaries are developed, thus making the algorithms more language-independent and allowing to create dictionaries for cognate languages. Testing showed the following results: the mean harmonic quantity for masculine gender detection makes up 92,2 %, and for feminine gender detection – 90,4%.
format Article
id doaj-art-124d2a20bdfb4363b601a890aecdac49
institution Kabale University
issn 1816-0301
language Russian
publishDate 2016-10-01
publisher National Academy of Sciences of Belarus, the United Institute of Informatics Problems
record_format Article
series Informatika
spelling doaj-art-124d2a20bdfb4363b601a890aecdac492025-02-03T11:51:49ZrusNational Academy of Sciences of Belarus, the United Institute of Informatics ProblemsInformatika1816-03012016-10-01016876128ALGORITHMS FOR IDENTIFICATION OF CUES WITH AUTHORS’ TEXT INSERTIONS IN BELARUSIAN ELECTRONIC BOOKSY. S. Hetsevich0T,. I. Okrut1B. M. Lobanov2Аб’яднаны інстытут праблем інфарматыкі НАН БеларусіАб’яднаны інстытут праблем інфарматыкі НАН БеларусіАб’яднаны інстытут праблем інфарматыкі НАН БеларусіThe main stages of algorithms for characters’ gender identification in Belarusian electronic texts are described. The algorithms are based on punctuation marking and gender indicators detection, such as past tense verbs and nouns with gender attributes. For indicators, special dictionaries are developed, thus making the algorithms more language-independent and allowing to create dictionaries for cognate languages. Testing showed the following results: the mean harmonic quantity for masculine gender detection makes up 92,2 %, and for feminine gender detection – 90,4%.https://inf.grid.by/jour/article/view/129
spellingShingle Y. S. Hetsevich
T,. I. Okrut
B. M. Lobanov
ALGORITHMS FOR IDENTIFICATION OF CUES WITH AUTHORS’ TEXT INSERTIONS IN BELARUSIAN ELECTRONIC BOOKS
Informatika
title ALGORITHMS FOR IDENTIFICATION OF CUES WITH AUTHORS’ TEXT INSERTIONS IN BELARUSIAN ELECTRONIC BOOKS
title_full ALGORITHMS FOR IDENTIFICATION OF CUES WITH AUTHORS’ TEXT INSERTIONS IN BELARUSIAN ELECTRONIC BOOKS
title_fullStr ALGORITHMS FOR IDENTIFICATION OF CUES WITH AUTHORS’ TEXT INSERTIONS IN BELARUSIAN ELECTRONIC BOOKS
title_full_unstemmed ALGORITHMS FOR IDENTIFICATION OF CUES WITH AUTHORS’ TEXT INSERTIONS IN BELARUSIAN ELECTRONIC BOOKS
title_short ALGORITHMS FOR IDENTIFICATION OF CUES WITH AUTHORS’ TEXT INSERTIONS IN BELARUSIAN ELECTRONIC BOOKS
title_sort algorithms for identification of cues with authors text insertions in belarusian electronic books
url https://inf.grid.by/jour/article/view/129
work_keys_str_mv AT yshetsevich algorithmsforidentificationofcueswithauthorstextinsertionsinbelarusianelectronicbooks
AT tiokrut algorithmsforidentificationofcueswithauthorstextinsertionsinbelarusianelectronicbooks
AT bmlobanov algorithmsforidentificationofcueswithauthorstextinsertionsinbelarusianelectronicbooks