Graphic Perception System for Visually Impaired Groups

In the age of internet, the demand of visually impaired groups to perceive graphic images through tactile sense is becoming stronger and stronger. Image object recognition is a basic task in the field of computer vision. In recent years, deep neural networks have promoted the development of image ob...

Full description

Saved in:

Bibliographic Details
Main Author:	Jingzi Wen
Format:	Article
Language:	English
Published:	Wiley 2022-01-01
Series:	Advances in Multimedia
Online Access:	http://dx.doi.org/10.1155/2022/8437979
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832545857844543488
author	Jingzi Wen
author_facet	Jingzi Wen
author_sort	Jingzi Wen
collection	DOAJ
description	In the age of internet, the demand of visually impaired groups to perceive graphic images through tactile sense is becoming stronger and stronger. Image object recognition is a basic task in the field of computer vision. In recent years, deep neural networks have promoted the development of image object recognition. However, existing methods generally have problems of image details’ loss and edge refinement, which cannot improve the accuracy rate of object recognition for visually impaired groups. In order to solve this problem, this study proposes a graphic perception system, which improves the attention mechanism. This system mainly consists of three modules: mixing attention module (MAM), enhanced receptive field module (ERFM), and multilevel fusion module (MLAM). MAM can generate better semantic features, which can be used to guide feature fusion in the decoding process, so that the aggregated features can better locate significant objects. ERFM can enrich the context information of low-level features and input the enhanced features into MLAM. MLAM uses the semantic information generated by MAM to guide the fusion of the current decoded features and the low-level features’ output by ERFM, and gradually recover boundary details in a cascading manner. Finally, the proposed algorithm is compared with other algorithms on PASCAL VOC and MS-COCO data. Experimental results show that the proposed method can effectively improve the accuracy of graphic object recognition.
format	Article
id	doaj-art-6a43f83bcc97406f8ddb7b385c2b661e
institution	Kabale University
issn	1687-5699
language	English
publishDate	2022-01-01
publisher	Wiley
record_format	Article
series	Advances in Multimedia
spelling	doaj-art-6a43f83bcc97406f8ddb7b385c2b661e2025-02-03T07:24:26ZengWileyAdvances in Multimedia1687-56992022-01-01202210.1155/2022/8437979Graphic Perception System for Visually Impaired GroupsJingzi Wen0School of Visual ArtsIn the age of internet, the demand of visually impaired groups to perceive graphic images through tactile sense is becoming stronger and stronger. Image object recognition is a basic task in the field of computer vision. In recent years, deep neural networks have promoted the development of image object recognition. However, existing methods generally have problems of image details’ loss and edge refinement, which cannot improve the accuracy rate of object recognition for visually impaired groups. In order to solve this problem, this study proposes a graphic perception system, which improves the attention mechanism. This system mainly consists of three modules: mixing attention module (MAM), enhanced receptive field module (ERFM), and multilevel fusion module (MLAM). MAM can generate better semantic features, which can be used to guide feature fusion in the decoding process, so that the aggregated features can better locate significant objects. ERFM can enrich the context information of low-level features and input the enhanced features into MLAM. MLAM uses the semantic information generated by MAM to guide the fusion of the current decoded features and the low-level features’ output by ERFM, and gradually recover boundary details in a cascading manner. Finally, the proposed algorithm is compared with other algorithms on PASCAL VOC and MS-COCO data. Experimental results show that the proposed method can effectively improve the accuracy of graphic object recognition.http://dx.doi.org/10.1155/2022/8437979
spellingShingle	Jingzi Wen Graphic Perception System for Visually Impaired Groups Advances in Multimedia
title	Graphic Perception System for Visually Impaired Groups
title_full	Graphic Perception System for Visually Impaired Groups
title_fullStr	Graphic Perception System for Visually Impaired Groups
title_full_unstemmed	Graphic Perception System for Visually Impaired Groups
title_short	Graphic Perception System for Visually Impaired Groups
title_sort	graphic perception system for visually impaired groups
url	http://dx.doi.org/10.1155/2022/8437979
work_keys_str_mv	AT jingziwen graphicperceptionsystemforvisuallyimpairedgroups

Graphic Perception System for Visually Impaired Groups

Similar Items