Using machine learning to study the population life quality: methodological aspects

Assessment of the population life quality is an important and relevant sociological task. Machine learning as a classification tool of social network users’ digital traces makes it possible to create a base to calculate subjective life quality index. The article consistently reviews all stages of the...

Full description

Saved in:
Bibliographic Details
Main Authors: E. V. Shchekotin, В. Л. Гойко, P. A. Basina, B. B. Bakulin
Format: Article
Language:Russian
Published: State University of Management 2022-03-01
Series:Цифровая социология
Subjects:
Online Access:https://digitalsociology.guu.ru/jour/article/view/132
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832540670122786816
author E. V. Shchekotin
В. Л. Гойко
P. A. Basina
B. B. Bakulin
author_facet E. V. Shchekotin
В. Л. Гойко
P. A. Basina
B. B. Bakulin
author_sort E. V. Shchekotin
collection DOAJ
description Assessment of the population life quality is an important and relevant sociological task. Machine learning as a classification tool of social network users’ digital traces makes it possible to create a base to calculate subjective life quality index. The article consistently reviews all stages of the machine learning algorithms application to assess the life quality of the population of the regions of the Russian Federation and the issues of improving neural network accuracy. To train the neural network the authors formed a set of marked-up data extracted from regional communities of the social network “VKontakte”. Various approaches to text vectorisation, publicly available neural network models pre-trained on large Russian-language text corpora, as well as metrics for evaluating the algorithms results were analysed. Computational experiments with different algorithms were carried out, according to the results of which the Rubert-tiny algorithm was selected due to its high learning and classification rate. During the model parameters adjustment, the accuracy of f1-macro 0.545 was achieved. Computational experiments were carried out using Python scripts.Typical errors that a neural network makes in the process of automatic content classification were considered. The results of the study can be used to calculate the online activity index in the VKontakte social network of users from various Russian regions, on the basis of which the subjective life quality index will be calculated in the future. Improving the neural network accuracy will make it possible to obtain more reliable data for assessing the life quality in Russian regions based on users’ digital traces.
format Article
id doaj-art-64e0f5998a9d444e93fc281d7d2a5032
institution Kabale University
issn 2658-347X
2713-1653
language Russian
publishDate 2022-03-01
publisher State University of Management
record_format Article
series Цифровая социология
spelling doaj-art-64e0f5998a9d444e93fc281d7d2a50322025-02-04T16:32:34ZrusState University of ManagementЦифровая социология2658-347X2713-16532022-03-0151879710.26425/2658-347X-2022-5-1-87-9789Using machine learning to study the population life quality: methodological aspectsE. V. Shchekotin0В. Л. Гойко1P. A. Basina2B. B. Bakulin3Novosibirsk State University of Economics and ManagementNational Research Tomsk State UniversityNational Research Tomsk State UniversityNational Research Tomsk State UniversityAssessment of the population life quality is an important and relevant sociological task. Machine learning as a classification tool of social network users’ digital traces makes it possible to create a base to calculate subjective life quality index. The article consistently reviews all stages of the machine learning algorithms application to assess the life quality of the population of the regions of the Russian Federation and the issues of improving neural network accuracy. To train the neural network the authors formed a set of marked-up data extracted from regional communities of the social network “VKontakte”. Various approaches to text vectorisation, publicly available neural network models pre-trained on large Russian-language text corpora, as well as metrics for evaluating the algorithms results were analysed. Computational experiments with different algorithms were carried out, according to the results of which the Rubert-tiny algorithm was selected due to its high learning and classification rate. During the model parameters adjustment, the accuracy of f1-macro 0.545 was achieved. Computational experiments were carried out using Python scripts.Typical errors that a neural network makes in the process of automatic content classification were considered. The results of the study can be used to calculate the online activity index in the VKontakte social network of users from various Russian regions, on the basis of which the subjective life quality index will be calculated in the future. Improving the neural network accuracy will make it possible to obtain more reliable data for assessing the life quality in Russian regions based on users’ digital traces.https://digitalsociology.guu.ru/jour/article/view/132life qualitywell-beingdigital methodsnon-reactive methodsdigital tracessocial networksvkontaktemachine learningtext classifications
spellingShingle E. V. Shchekotin
В. Л. Гойко
P. A. Basina
B. B. Bakulin
Using machine learning to study the population life quality: methodological aspects
Цифровая социология
life quality
well-being
digital methods
non-reactive methods
digital traces
social networks
vkontakte
machine learning
text classifications
title Using machine learning to study the population life quality: methodological aspects
title_full Using machine learning to study the population life quality: methodological aspects
title_fullStr Using machine learning to study the population life quality: methodological aspects
title_full_unstemmed Using machine learning to study the population life quality: methodological aspects
title_short Using machine learning to study the population life quality: methodological aspects
title_sort using machine learning to study the population life quality methodological aspects
topic life quality
well-being
digital methods
non-reactive methods
digital traces
social networks
vkontakte
machine learning
text classifications
url https://digitalsociology.guu.ru/jour/article/view/132
work_keys_str_mv AT evshchekotin usingmachinelearningtostudythepopulationlifequalitymethodologicalaspects
AT vlgojko usingmachinelearningtostudythepopulationlifequalitymethodologicalaspects
AT pabasina usingmachinelearningtostudythepopulationlifequalitymethodologicalaspects
AT bbbakulin usingmachinelearningtostudythepopulationlifequalitymethodologicalaspects