Using machine learning to study the population life quality: methodological aspects
Assessment of the population life quality is an important and relevant sociological task. Machine learning as a classification tool of social network users’ digital traces makes it possible to create a base to calculate subjective life quality index. The article consistently reviews all stages of the...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | Russian |
Published: |
State University of Management
2022-03-01
|
Series: | Цифровая социология |
Subjects: | |
Online Access: | https://digitalsociology.guu.ru/jour/article/view/132 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832540670122786816 |
---|---|
author | E. V. Shchekotin В. Л. Гойко P. A. Basina B. B. Bakulin |
author_facet | E. V. Shchekotin В. Л. Гойко P. A. Basina B. B. Bakulin |
author_sort | E. V. Shchekotin |
collection | DOAJ |
description | Assessment of the population life quality is an important and relevant sociological task. Machine learning as a classification tool of social network users’ digital traces makes it possible to create a base to calculate subjective life quality index. The article consistently reviews all stages of the machine learning algorithms application to assess the life quality of the population of the regions of the Russian Federation and the issues of improving neural network accuracy. To train the neural network the authors formed a set of marked-up data extracted from regional communities of the social network “VKontakte”. Various approaches to text vectorisation, publicly available neural network models pre-trained on large Russian-language text corpora, as well as metrics for evaluating the algorithms results were analysed. Computational experiments with different algorithms were carried out, according to the results of which the Rubert-tiny algorithm was selected due to its high learning and classification rate. During the model parameters adjustment, the accuracy of f1-macro 0.545 was achieved. Computational experiments were carried out using Python scripts.Typical errors that a neural network makes in the process of automatic content classification were considered. The results of the study can be used to calculate the online activity index in the VKontakte social network of users from various Russian regions, on the basis of which the subjective life quality index will be calculated in the future. Improving the neural network accuracy will make it possible to obtain more reliable data for assessing the life quality in Russian regions based on users’ digital traces. |
format | Article |
id | doaj-art-64e0f5998a9d444e93fc281d7d2a5032 |
institution | Kabale University |
issn | 2658-347X 2713-1653 |
language | Russian |
publishDate | 2022-03-01 |
publisher | State University of Management |
record_format | Article |
series | Цифровая социология |
spelling | doaj-art-64e0f5998a9d444e93fc281d7d2a50322025-02-04T16:32:34ZrusState University of ManagementЦифровая социология2658-347X2713-16532022-03-0151879710.26425/2658-347X-2022-5-1-87-9789Using machine learning to study the population life quality: methodological aspectsE. V. Shchekotin0В. Л. Гойко1P. A. Basina2B. B. Bakulin3Novosibirsk State University of Economics and ManagementNational Research Tomsk State UniversityNational Research Tomsk State UniversityNational Research Tomsk State UniversityAssessment of the population life quality is an important and relevant sociological task. Machine learning as a classification tool of social network users’ digital traces makes it possible to create a base to calculate subjective life quality index. The article consistently reviews all stages of the machine learning algorithms application to assess the life quality of the population of the regions of the Russian Federation and the issues of improving neural network accuracy. To train the neural network the authors formed a set of marked-up data extracted from regional communities of the social network “VKontakte”. Various approaches to text vectorisation, publicly available neural network models pre-trained on large Russian-language text corpora, as well as metrics for evaluating the algorithms results were analysed. Computational experiments with different algorithms were carried out, according to the results of which the Rubert-tiny algorithm was selected due to its high learning and classification rate. During the model parameters adjustment, the accuracy of f1-macro 0.545 was achieved. Computational experiments were carried out using Python scripts.Typical errors that a neural network makes in the process of automatic content classification were considered. The results of the study can be used to calculate the online activity index in the VKontakte social network of users from various Russian regions, on the basis of which the subjective life quality index will be calculated in the future. Improving the neural network accuracy will make it possible to obtain more reliable data for assessing the life quality in Russian regions based on users’ digital traces.https://digitalsociology.guu.ru/jour/article/view/132life qualitywell-beingdigital methodsnon-reactive methodsdigital tracessocial networksvkontaktemachine learningtext classifications |
spellingShingle | E. V. Shchekotin В. Л. Гойко P. A. Basina B. B. Bakulin Using machine learning to study the population life quality: methodological aspects Цифровая социология life quality well-being digital methods non-reactive methods digital traces social networks vkontakte machine learning text classifications |
title | Using machine learning to study the population life quality: methodological aspects |
title_full | Using machine learning to study the population life quality: methodological aspects |
title_fullStr | Using machine learning to study the population life quality: methodological aspects |
title_full_unstemmed | Using machine learning to study the population life quality: methodological aspects |
title_short | Using machine learning to study the population life quality: methodological aspects |
title_sort | using machine learning to study the population life quality methodological aspects |
topic | life quality well-being digital methods non-reactive methods digital traces social networks vkontakte machine learning text classifications |
url | https://digitalsociology.guu.ru/jour/article/view/132 |
work_keys_str_mv | AT evshchekotin usingmachinelearningtostudythepopulationlifequalitymethodologicalaspects AT vlgojko usingmachinelearningtostudythepopulationlifequalitymethodologicalaspects AT pabasina usingmachinelearningtostudythepopulationlifequalitymethodologicalaspects AT bbbakulin usingmachinelearningtostudythepopulationlifequalitymethodologicalaspects |