ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing Images

The reflective characteristics of remote sensing image information depend on the scale of the observed area, with high-resolution images providing more detailed feature information. Currently, monitoring refined industries and extracting regional information necessitate higher-resolution remote sens...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yifeng Yang, Hengqian Zhao, Xiadan Huangfu, Zihan Li, Pan Wang
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:	Downscaling remote sensing super-resolution (SR) transformer vision transformer improved super-resolution generative adversarial network (ViT-ISRGAN) model
Online Access:	https://ieeexplore.ieee.org/document/10836746/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1832582364111306752
author	Yifeng Yang Hengqian Zhao Xiadan Huangfu Zihan Li Pan Wang
author_facet	Yifeng Yang Hengqian Zhao Xiadan Huangfu Zihan Li Pan Wang
author_sort	Yifeng Yang
collection	DOAJ
description	The reflective characteristics of remote sensing image information depend on the scale of the observed area, with high-resolution images providing more detailed feature information. Currently, monitoring refined industries and extracting regional information necessitate higher-resolution remote sensing images. Super-resolution reconstruction of remote sensing multispectral images not only enhances the spatial resolution of these images but also preserves and improves the spectral information of multispectral data, thereby providing richer ground object information and more accurate environmental monitoring data. To improve the effectiveness of feature extraction in the generator network while maintaining model efficiency, this article proposes the vision transformer improved super-resolution generative adversarial network (ViT-ISRGAN) model. This model is an improvement upon the original SRGAN super-resolution image reconstruction method, incorporating lightweight network modules, channel attention modules, spatial-spectral residual attention, and the vision transformer structure. The ViT-ISRGAN model focuses on reconstructing four types of typical ground objects based on Sentinel-2 images: urban, water, farmland, and forest. Results indicate that the ViT-ISRGAN model excels in capturing texture details and color restoration, effectively extracting spectral and texture information from multispectral remote sensing images across various scenes. Compared to other super-resolution (SR) models, this approach demonstrates superior effectiveness and performance in the SR tasks of remote sensing multispectral images.
format	Article
id	doaj-art-31e59eff5cd448898cf9c5723194b881
institution	Kabale University
issn	1939-1404 2151-1535
language	English
publishDate	2025-01-01
publisher	IEEE
record_format	Article
series	IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling	doaj-art-31e59eff5cd448898cf9c5723194b8812025-01-30T00:00:16ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing1939-14042151-15352025-01-01183973398810.1109/JSTARS.2025.352722610836746ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing ImagesYifeng Yang0https://orcid.org/0009-0001-6018-6690Hengqian Zhao1https://orcid.org/0000-0003-0776-142XXiadan Huangfu2Zihan Li3Pan Wang4https://orcid.org/0000-0003-2846-9882College of Geoscience and Surveying Engineering, China University of Mining and Technology, Beijing, ChinaCollege of Geoscience and Surveying Engineering, China University of Mining and Technology, Beijing, ChinaCollege of Geoscience and Surveying Engineering, China University of Mining and Technology, Beijing, ChinaCollege of Geoscience and Surveying Engineering, China University of Mining and Technology, Beijing, ChinaCollege of Geoscience and Surveying Engineering, China University of Mining and Technology, Beijing, ChinaThe reflective characteristics of remote sensing image information depend on the scale of the observed area, with high-resolution images providing more detailed feature information. Currently, monitoring refined industries and extracting regional information necessitate higher-resolution remote sensing images. Super-resolution reconstruction of remote sensing multispectral images not only enhances the spatial resolution of these images but also preserves and improves the spectral information of multispectral data, thereby providing richer ground object information and more accurate environmental monitoring data. To improve the effectiveness of feature extraction in the generator network while maintaining model efficiency, this article proposes the vision transformer improved super-resolution generative adversarial network (ViT-ISRGAN) model. This model is an improvement upon the original SRGAN super-resolution image reconstruction method, incorporating lightweight network modules, channel attention modules, spatial-spectral residual attention, and the vision transformer structure. The ViT-ISRGAN model focuses on reconstructing four types of typical ground objects based on Sentinel-2 images: urban, water, farmland, and forest. Results indicate that the ViT-ISRGAN model excels in capturing texture details and color restoration, effectively extracting spectral and texture information from multispectral remote sensing images across various scenes. Compared to other super-resolution (SR) models, this approach demonstrates superior effectiveness and performance in the SR tasks of remote sensing multispectral images.https://ieeexplore.ieee.org/document/10836746/Downscalingremote sensingsuper-resolution (SR)transformervision transformer improved super-resolution generative adversarial network (ViT-ISRGAN) model
spellingShingle	Yifeng Yang Hengqian Zhao Xiadan Huangfu Zihan Li Pan Wang ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing Images IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing Downscaling remote sensing super-resolution (SR) transformer vision transformer improved super-resolution generative adversarial network (ViT-ISRGAN) model
title	ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing Images
title_full	ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing Images
title_fullStr	ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing Images
title_full_unstemmed	ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing Images
title_short	ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing Images
title_sort	vit isrgan a high quality super resolution reconstruction method for multispectral remote sensing images
topic	Downscaling remote sensing super-resolution (SR) transformer vision transformer improved super-resolution generative adversarial network (ViT-ISRGAN) model
url	https://ieeexplore.ieee.org/document/10836746/
work_keys_str_mv	AT yifengyang vitisrganahighqualitysuperresolutionreconstructionmethodformultispectralremotesensingimages AT hengqianzhao vitisrganahighqualitysuperresolutionreconstructionmethodformultispectralremotesensingimages AT xiadanhuangfu vitisrganahighqualitysuperresolutionreconstructionmethodformultispectralremotesensingimages AT zihanli vitisrganahighqualitysuperresolutionreconstructionmethodformultispectralremotesensingimages AT panwang vitisrganahighqualitysuperresolutionreconstructionmethodformultispectralremotesensingimages

ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing Images

Similar Items