Graphic Perception System for Visually Impaired Groups

In the age of internet, the demand of visually impaired groups to perceive graphic images through tactile sense is becoming stronger and stronger. Image object recognition is a basic task in the field of computer vision. In recent years, deep neural networks have promoted the development of image ob...

Full description

Saved in:
Bibliographic Details
Main Author: Jingzi Wen
Format: Article
Language:English
Published: Wiley 2022-01-01
Series:Advances in Multimedia
Online Access:http://dx.doi.org/10.1155/2022/8437979
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832545857844543488
author Jingzi Wen
author_facet Jingzi Wen
author_sort Jingzi Wen
collection DOAJ
description In the age of internet, the demand of visually impaired groups to perceive graphic images through tactile sense is becoming stronger and stronger. Image object recognition is a basic task in the field of computer vision. In recent years, deep neural networks have promoted the development of image object recognition. However, existing methods generally have problems of image details’ loss and edge refinement, which cannot improve the accuracy rate of object recognition for visually impaired groups. In order to solve this problem, this study proposes a graphic perception system, which improves the attention mechanism. This system mainly consists of three modules: mixing attention module (MAM), enhanced receptive field module (ERFM), and multilevel fusion module (MLAM). MAM can generate better semantic features, which can be used to guide feature fusion in the decoding process, so that the aggregated features can better locate significant objects. ERFM can enrich the context information of low-level features and input the enhanced features into MLAM. MLAM uses the semantic information generated by MAM to guide the fusion of the current decoded features and the low-level features’ output by ERFM, and gradually recover boundary details in a cascading manner. Finally, the proposed algorithm is compared with other algorithms on PASCAL VOC and MS-COCO data. Experimental results show that the proposed method can effectively improve the accuracy of graphic object recognition.
format Article
id doaj-art-6a43f83bcc97406f8ddb7b385c2b661e
institution Kabale University
issn 1687-5699
language English
publishDate 2022-01-01
publisher Wiley
record_format Article
series Advances in Multimedia
spelling doaj-art-6a43f83bcc97406f8ddb7b385c2b661e2025-02-03T07:24:26ZengWileyAdvances in Multimedia1687-56992022-01-01202210.1155/2022/8437979Graphic Perception System for Visually Impaired GroupsJingzi Wen0School of Visual ArtsIn the age of internet, the demand of visually impaired groups to perceive graphic images through tactile sense is becoming stronger and stronger. Image object recognition is a basic task in the field of computer vision. In recent years, deep neural networks have promoted the development of image object recognition. However, existing methods generally have problems of image details’ loss and edge refinement, which cannot improve the accuracy rate of object recognition for visually impaired groups. In order to solve this problem, this study proposes a graphic perception system, which improves the attention mechanism. This system mainly consists of three modules: mixing attention module (MAM), enhanced receptive field module (ERFM), and multilevel fusion module (MLAM). MAM can generate better semantic features, which can be used to guide feature fusion in the decoding process, so that the aggregated features can better locate significant objects. ERFM can enrich the context information of low-level features and input the enhanced features into MLAM. MLAM uses the semantic information generated by MAM to guide the fusion of the current decoded features and the low-level features’ output by ERFM, and gradually recover boundary details in a cascading manner. Finally, the proposed algorithm is compared with other algorithms on PASCAL VOC and MS-COCO data. Experimental results show that the proposed method can effectively improve the accuracy of graphic object recognition.http://dx.doi.org/10.1155/2022/8437979
spellingShingle Jingzi Wen
Graphic Perception System for Visually Impaired Groups
Advances in Multimedia
title Graphic Perception System for Visually Impaired Groups
title_full Graphic Perception System for Visually Impaired Groups
title_fullStr Graphic Perception System for Visually Impaired Groups
title_full_unstemmed Graphic Perception System for Visually Impaired Groups
title_short Graphic Perception System for Visually Impaired Groups
title_sort graphic perception system for visually impaired groups
url http://dx.doi.org/10.1155/2022/8437979
work_keys_str_mv AT jingziwen graphicperceptionsystemforvisuallyimpairedgroups