The structure of the local detector of the reprint model of the object in the image

Currently, methods for recognizing objects in images work poorly and use intellectually unsatisfactory methods. The existing identification systems and methods do not completely solve the problem of identification, namely, identification in difficult conditions: interference, lighting, various chang...

Full description

Saved in:

Bibliographic Details
Main Author:	A. A. Kulikov
Format:	Article
Language:	Russian
Published:	MIREA - Russian Technological University 2021-10-01
Series:	Российский технологический журнал
Subjects:	neural network image recognition pattern recognition identification model
Online Access:	https://www.rtj-mirea.ru/jour/article/view/363
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832543410228035584
author	A. A. Kulikov
author_facet	A. A. Kulikov
author_sort	A. A. Kulikov
collection	DOAJ
description	Currently, methods for recognizing objects in images work poorly and use intellectually unsatisfactory methods. The existing identification systems and methods do not completely solve the problem of identification, namely, identification in difficult conditions: interference, lighting, various changes on the face, etc. To solve these problems, a local detector for a reprint model of an object in an image was developed and described. A transforming autocoder (TA), a model of a neural network, was developed for the local detector. This neural network model is a subspecies of the general class of neural networks of reduced dimension. The local detector is able, in addition to determining the modified object, to determine the original shape of the object as well. A special feature of TA is the representation of image sections in a compact form and the evaluation of the parameters of the affine transformation. The transforming autocoder is a heterogeneous network (HS) consisting of a set of networks of smaller dimension. These networks are called capsules. Artificial neural networks should use local capsules that perform some rather complex internal calculations on their inputs, and then encapsulate the results of these calculations in a small vector of highly informative outputs. Each capsule learns to recognize an implicitly defined visual object in a limited area of viewing conditions and deformations. It outputs both the probability that the object is present in its limited area and a set of “instance parameters” that can include the exact pose, lighting, and deformation of the visual object relative to an implicitly defined canonical version of this object. The main advantage of capsules that output instance parameters is a simple way to recognize entire objects by recognizing their parts. The capsule can learn to display the pose of its visual object in a vector that is linearly related to the “natural” representations of the pose that are used in computer graphics. There is a simple and highly selective test for whether visual objects represented by two active capsules A and B have the correct spatial relationships for activating a higher-level capsule C. The transforming autoencoder solves the problem of identifying facial images in conditions of interference (noise), changes in illumination and angle.
format	Article
id	doaj-art-2a31a4f27a3e4e05a5cdf20768248282
institution	Kabale University
issn	2500-316X
language	Russian
publishDate	2021-10-01
publisher	MIREA - Russian Technological University
record_format	Article
series	Российский технологический журнал
spelling	doaj-art-2a31a4f27a3e4e05a5cdf207682482822025-02-03T11:45:50ZrusMIREA - Russian Technological UniversityРоссийский технологический журнал2500-316X2021-10-019571310.32362/2500-316X-2021-9-5-7-13278The structure of the local detector of the reprint model of the object in the imageA. A. Kulikov0IREA – Russian Technological UniversityCurrently, methods for recognizing objects in images work poorly and use intellectually unsatisfactory methods. The existing identification systems and methods do not completely solve the problem of identification, namely, identification in difficult conditions: interference, lighting, various changes on the face, etc. To solve these problems, a local detector for a reprint model of an object in an image was developed and described. A transforming autocoder (TA), a model of a neural network, was developed for the local detector. This neural network model is a subspecies of the general class of neural networks of reduced dimension. The local detector is able, in addition to determining the modified object, to determine the original shape of the object as well. A special feature of TA is the representation of image sections in a compact form and the evaluation of the parameters of the affine transformation. The transforming autocoder is a heterogeneous network (HS) consisting of a set of networks of smaller dimension. These networks are called capsules. Artificial neural networks should use local capsules that perform some rather complex internal calculations on their inputs, and then encapsulate the results of these calculations in a small vector of highly informative outputs. Each capsule learns to recognize an implicitly defined visual object in a limited area of viewing conditions and deformations. It outputs both the probability that the object is present in its limited area and a set of “instance parameters” that can include the exact pose, lighting, and deformation of the visual object relative to an implicitly defined canonical version of this object. The main advantage of capsules that output instance parameters is a simple way to recognize entire objects by recognizing their parts. The capsule can learn to display the pose of its visual object in a vector that is linearly related to the “natural” representations of the pose that are used in computer graphics. There is a simple and highly selective test for whether visual objects represented by two active capsules A and B have the correct spatial relationships for activating a higher-level capsule C. The transforming autoencoder solves the problem of identifying facial images in conditions of interference (noise), changes in illumination and angle.https://www.rtj-mirea.ru/jour/article/view/363neural networkimage recognitionpattern recognitionidentification model
spellingShingle	A. A. Kulikov The structure of the local detector of the reprint model of the object in the image Российский технологический журнал neural network image recognition pattern recognition identification model
title	The structure of the local detector of the reprint model of the object in the image
title_full	The structure of the local detector of the reprint model of the object in the image
title_fullStr	The structure of the local detector of the reprint model of the object in the image
title_full_unstemmed	The structure of the local detector of the reprint model of the object in the image
title_short	The structure of the local detector of the reprint model of the object in the image
title_sort	structure of the local detector of the reprint model of the object in the image
topic	neural network image recognition pattern recognition identification model
url	https://www.rtj-mirea.ru/jour/article/view/363
work_keys_str_mv	AT aakulikov thestructureofthelocaldetectorofthereprintmodeloftheobjectintheimage AT aakulikov structureofthelocaldetectorofthereprintmodeloftheobjectintheimage

The structure of the local detector of the reprint model of the object in the image

Similar Items