ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing Images

The reflective characteristics of remote sensing image information depend on the scale of the observed area, with high-resolution images providing more detailed feature information. Currently, monitoring refined industries and extracting regional information necessitate higher-resolution remote sens...

Full description

Saved in:
Bibliographic Details
Main Authors: Yifeng Yang, Hengqian Zhao, Xiadan Huangfu, Zihan Li, Pan Wang
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10836746/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832582364111306752
author Yifeng Yang
Hengqian Zhao
Xiadan Huangfu
Zihan Li
Pan Wang
author_facet Yifeng Yang
Hengqian Zhao
Xiadan Huangfu
Zihan Li
Pan Wang
author_sort Yifeng Yang
collection DOAJ
description The reflective characteristics of remote sensing image information depend on the scale of the observed area, with high-resolution images providing more detailed feature information. Currently, monitoring refined industries and extracting regional information necessitate higher-resolution remote sensing images. Super-resolution reconstruction of remote sensing multispectral images not only enhances the spatial resolution of these images but also preserves and improves the spectral information of multispectral data, thereby providing richer ground object information and more accurate environmental monitoring data. To improve the effectiveness of feature extraction in the generator network while maintaining model efficiency, this article proposes the vision transformer improved super-resolution generative adversarial network (ViT-ISRGAN) model. This model is an improvement upon the original SRGAN super-resolution image reconstruction method, incorporating lightweight network modules, channel attention modules, spatial-spectral residual attention, and the vision transformer structure. The ViT-ISRGAN model focuses on reconstructing four types of typical ground objects based on Sentinel-2 images: urban, water, farmland, and forest. Results indicate that the ViT-ISRGAN model excels in capturing texture details and color restoration, effectively extracting spectral and texture information from multispectral remote sensing images across various scenes. Compared to other super-resolution (SR) models, this approach demonstrates superior effectiveness and performance in the SR tasks of remote sensing multispectral images.
format Article
id doaj-art-31e59eff5cd448898cf9c5723194b881
institution Kabale University
issn 1939-1404
2151-1535
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj-art-31e59eff5cd448898cf9c5723194b8812025-01-30T00:00:16ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing1939-14042151-15352025-01-01183973398810.1109/JSTARS.2025.352722610836746ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing ImagesYifeng Yang0https://orcid.org/0009-0001-6018-6690Hengqian Zhao1https://orcid.org/0000-0003-0776-142XXiadan Huangfu2Zihan Li3Pan Wang4https://orcid.org/0000-0003-2846-9882College of Geoscience and Surveying Engineering, China University of Mining and Technology, Beijing, ChinaCollege of Geoscience and Surveying Engineering, China University of Mining and Technology, Beijing, ChinaCollege of Geoscience and Surveying Engineering, China University of Mining and Technology, Beijing, ChinaCollege of Geoscience and Surveying Engineering, China University of Mining and Technology, Beijing, ChinaCollege of Geoscience and Surveying Engineering, China University of Mining and Technology, Beijing, ChinaThe reflective characteristics of remote sensing image information depend on the scale of the observed area, with high-resolution images providing more detailed feature information. Currently, monitoring refined industries and extracting regional information necessitate higher-resolution remote sensing images. Super-resolution reconstruction of remote sensing multispectral images not only enhances the spatial resolution of these images but also preserves and improves the spectral information of multispectral data, thereby providing richer ground object information and more accurate environmental monitoring data. To improve the effectiveness of feature extraction in the generator network while maintaining model efficiency, this article proposes the vision transformer improved super-resolution generative adversarial network (ViT-ISRGAN) model. This model is an improvement upon the original SRGAN super-resolution image reconstruction method, incorporating lightweight network modules, channel attention modules, spatial-spectral residual attention, and the vision transformer structure. The ViT-ISRGAN model focuses on reconstructing four types of typical ground objects based on Sentinel-2 images: urban, water, farmland, and forest. Results indicate that the ViT-ISRGAN model excels in capturing texture details and color restoration, effectively extracting spectral and texture information from multispectral remote sensing images across various scenes. Compared to other super-resolution (SR) models, this approach demonstrates superior effectiveness and performance in the SR tasks of remote sensing multispectral images.https://ieeexplore.ieee.org/document/10836746/Downscalingremote sensingsuper-resolution (SR)transformervision transformer improved super-resolution generative adversarial network (ViT-ISRGAN) model
spellingShingle Yifeng Yang
Hengqian Zhao
Xiadan Huangfu
Zihan Li
Pan Wang
ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing Images
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Downscaling
remote sensing
super-resolution (SR)
transformer
vision transformer improved super-resolution generative adversarial network (ViT-ISRGAN) model
title ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing Images
title_full ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing Images
title_fullStr ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing Images
title_full_unstemmed ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing Images
title_short ViT-ISRGAN: A High-Quality Super-Resolution Reconstruction Method for Multispectral Remote Sensing Images
title_sort vit isrgan a high quality super resolution reconstruction method for multispectral remote sensing images
topic Downscaling
remote sensing
super-resolution (SR)
transformer
vision transformer improved super-resolution generative adversarial network (ViT-ISRGAN) model
url https://ieeexplore.ieee.org/document/10836746/
work_keys_str_mv AT yifengyang vitisrganahighqualitysuperresolutionreconstructionmethodformultispectralremotesensingimages
AT hengqianzhao vitisrganahighqualitysuperresolutionreconstructionmethodformultispectralremotesensingimages
AT xiadanhuangfu vitisrganahighqualitysuperresolutionreconstructionmethodformultispectralremotesensingimages
AT zihanli vitisrganahighqualitysuperresolutionreconstructionmethodformultispectralremotesensingimages
AT panwang vitisrganahighqualitysuperresolutionreconstructionmethodformultispectralremotesensingimages