Face super-resolution via iterative collaboration between multi-attention mechanism and landmark estimation

Abstract Face super-resolution technology can significantly enhance the resolution and quality of face images, which is crucial for applications such as surveillance, forensics, and face recognition. However, existing methods often fail to fully utilize multi-scale information and facial priors, res...

Full description

Saved in:
Bibliographic Details
Main Authors: Chang-Teng Shi, Meng-Jun Li, Zhi Yong An
Format: Article
Language:English
Published: Springer 2024-12-01
Series:Complex & Intelligent Systems
Subjects:
Online Access:https://doi.org/10.1007/s40747-024-01673-z
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832571182268809216
author Chang-Teng Shi
Meng-Jun Li
Zhi Yong An
author_facet Chang-Teng Shi
Meng-Jun Li
Zhi Yong An
author_sort Chang-Teng Shi
collection DOAJ
description Abstract Face super-resolution technology can significantly enhance the resolution and quality of face images, which is crucial for applications such as surveillance, forensics, and face recognition. However, existing methods often fail to fully utilize multi-scale information and facial priors, resulting in poor recovery of facial structures in complex images. To address this issue, we propose a face super-resolution method based on iterative collaboration between a facial reconstruction network and a landmark estimation network. This method employs a Multi-Convolutional Attention Block for multi-scale feature extraction, and an Attention Fusion Block is introduced to enhance features using facial priors. Subsequently, features are further refined using a Residual Window Attention Group. Furthermore, the method involves iterative collaboration between the facial reconstruction network and the landmark estimation network. At each step, landmark priors are used to generate higher quality images, which are then utilized for improved landmark estimation, thereby gradually enhancing performance. Through evaluation of the standard 4 $$\times $$ × , 8 $$\times $$ × , and 16 $$\times $$ × super-resolution tasks on the CelebA and Helen datasets, This method demonstrates strong performance and achieves competitive scores on SSIM, PSNR, and LPIPS metrics. Specifically, in the 8 $$\times $$ × super-resolution experiment, the PSNR/SSIM/LPIPS on CelebA dataset is 27.68dB/ 0.8112/0.0866, outperforming existing state-of-the-art methods in terms of both accuracy and visual quality.
format Article
id doaj-art-8ed74e4b84194d40b6492725cec296c4
institution Kabale University
issn 2199-4536
2198-6053
language English
publishDate 2024-12-01
publisher Springer
record_format Article
series Complex & Intelligent Systems
spelling doaj-art-8ed74e4b84194d40b6492725cec296c42025-02-02T12:48:42ZengSpringerComplex & Intelligent Systems2199-45362198-60532024-12-0111111910.1007/s40747-024-01673-zFace super-resolution via iterative collaboration between multi-attention mechanism and landmark estimationChang-Teng Shi0Meng-Jun Li1Zhi Yong An2College of Computer Science and Technology, Shandong Technology and Business UniversityCollege of Computer Science and Technology, Shandong Technology and Business UniversityCollege of Computer Science and Technology, Shandong Technology and Business UniversityAbstract Face super-resolution technology can significantly enhance the resolution and quality of face images, which is crucial for applications such as surveillance, forensics, and face recognition. However, existing methods often fail to fully utilize multi-scale information and facial priors, resulting in poor recovery of facial structures in complex images. To address this issue, we propose a face super-resolution method based on iterative collaboration between a facial reconstruction network and a landmark estimation network. This method employs a Multi-Convolutional Attention Block for multi-scale feature extraction, and an Attention Fusion Block is introduced to enhance features using facial priors. Subsequently, features are further refined using a Residual Window Attention Group. Furthermore, the method involves iterative collaboration between the facial reconstruction network and the landmark estimation network. At each step, landmark priors are used to generate higher quality images, which are then utilized for improved landmark estimation, thereby gradually enhancing performance. Through evaluation of the standard 4 $$\times $$ × , 8 $$\times $$ × , and 16 $$\times $$ × super-resolution tasks on the CelebA and Helen datasets, This method demonstrates strong performance and achieves competitive scores on SSIM, PSNR, and LPIPS metrics. Specifically, in the 8 $$\times $$ × super-resolution experiment, the PSNR/SSIM/LPIPS on CelebA dataset is 27.68dB/ 0.8112/0.0866, outperforming existing state-of-the-art methods in terms of both accuracy and visual quality.https://doi.org/10.1007/s40747-024-01673-zMulti-scale informationPrior knowledgeMulti-attentionIterative collaboration
spellingShingle Chang-Teng Shi
Meng-Jun Li
Zhi Yong An
Face super-resolution via iterative collaboration between multi-attention mechanism and landmark estimation
Complex & Intelligent Systems
Multi-scale information
Prior knowledge
Multi-attention
Iterative collaboration
title Face super-resolution via iterative collaboration between multi-attention mechanism and landmark estimation
title_full Face super-resolution via iterative collaboration between multi-attention mechanism and landmark estimation
title_fullStr Face super-resolution via iterative collaboration between multi-attention mechanism and landmark estimation
title_full_unstemmed Face super-resolution via iterative collaboration between multi-attention mechanism and landmark estimation
title_short Face super-resolution via iterative collaboration between multi-attention mechanism and landmark estimation
title_sort face super resolution via iterative collaboration between multi attention mechanism and landmark estimation
topic Multi-scale information
Prior knowledge
Multi-attention
Iterative collaboration
url https://doi.org/10.1007/s40747-024-01673-z
work_keys_str_mv AT changtengshi facesuperresolutionviaiterativecollaborationbetweenmultiattentionmechanismandlandmarkestimation
AT mengjunli facesuperresolutionviaiterativecollaborationbetweenmultiattentionmechanismandlandmarkestimation
AT zhiyongan facesuperresolutionviaiterativecollaborationbetweenmultiattentionmechanismandlandmarkestimation