GFTT: Geographical Feature Tokenization Transformer for SAR-to-Optical Image Translation

Synthetic aperture radar (SAR) image to optical image translation not only assists information interpretability, but also fills the gaps in optical applications due to weather and light limitations. However, several studies have pointed out that specialized methods heavily struggle to deliver images...

Full description

Saved in:
Bibliographic Details
Main Authors: Hongbo Liang, Xuezhi Yang, Xiangyu Yang, Jinjin Luo, Jiajia Zhu
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10816574/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832592935194984448
author Hongbo Liang
Xuezhi Yang
Xiangyu Yang
Jinjin Luo
Jiajia Zhu
author_facet Hongbo Liang
Xuezhi Yang
Xiangyu Yang
Jinjin Luo
Jiajia Zhu
author_sort Hongbo Liang
collection DOAJ
description Synthetic aperture radar (SAR) image to optical image translation not only assists information interpretability, but also fills the gaps in optical applications due to weather and light limitations. However, several studies have pointed out that specialized methods heavily struggle to deliver images with widely varying optical imaging styles, thus, resulting in poor image translation with disharmonious and repetitive artifacts. Another critical issue attributes to the scarcity of geographical prior knowledge. The generator always attempts to produce images within a narrow scope of the data space, which severely restricts the semantic correspondence between SAR content and optical styles. In this article, we introduce a novel tokenization, namely geographical imaging tokenizer (GIT), which captures imaging style of ground materials in the optical image. Based on the GIT, we propose a geographical feature tokenization transformer framework (GFTT) that discovers the consensus between SAR and optical images. In addition, we leverage a self-supervisory task to encourage the transformer to learn meaningful semantic correspondence from local and global style patterns. Finally, we utilize the noise-contrastive estimation loss to maximize mutual information between the input and translated image. Through qualitative and quantitative experimental evaluations, we verify the reliability of the proposed GIT that aligns with authentic expressions of the optical observation scenario, and indicates the superiority of GFTT in contrast to the state-of-the-art algorithms.
format Article
id doaj-art-7eb47e082054478187543e04c818f3bc
institution Kabale University
issn 1939-1404
2151-1535
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj-art-7eb47e082054478187543e04c818f3bc2025-01-21T00:00:40ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing1939-14042151-15352025-01-01182975298910.1109/JSTARS.2024.352327410816574GFTT: Geographical Feature Tokenization Transformer for SAR-to-Optical Image TranslationHongbo Liang0https://orcid.org/0000-0002-6046-9340Xuezhi Yang1https://orcid.org/0000-0002-2303-4858Xiangyu Yang2Jinjin Luo3Jiajia Zhu4https://orcid.org/0009-0008-9914-275XAnhui Key Laboratory of Industry Safety and Emergency Technology, Hefei, ChinaAnhui Key Laboratory of Industry Safety and Emergency Technology, Hefei, ChinaAnhui Key Laboratory of Industry Safety and Emergency Technology, Hefei, ChinaAnhui Key Laboratory of Industry Safety and Emergency Technology, Hefei, ChinaAnhui Key Laboratory of Industry Safety and Emergency Technology, Hefei, ChinaSynthetic aperture radar (SAR) image to optical image translation not only assists information interpretability, but also fills the gaps in optical applications due to weather and light limitations. However, several studies have pointed out that specialized methods heavily struggle to deliver images with widely varying optical imaging styles, thus, resulting in poor image translation with disharmonious and repetitive artifacts. Another critical issue attributes to the scarcity of geographical prior knowledge. The generator always attempts to produce images within a narrow scope of the data space, which severely restricts the semantic correspondence between SAR content and optical styles. In this article, we introduce a novel tokenization, namely geographical imaging tokenizer (GIT), which captures imaging style of ground materials in the optical image. Based on the GIT, we propose a geographical feature tokenization transformer framework (GFTT) that discovers the consensus between SAR and optical images. In addition, we leverage a self-supervisory task to encourage the transformer to learn meaningful semantic correspondence from local and global style patterns. Finally, we utilize the noise-contrastive estimation loss to maximize mutual information between the input and translated image. Through qualitative and quantitative experimental evaluations, we verify the reliability of the proposed GIT that aligns with authentic expressions of the optical observation scenario, and indicates the superiority of GFTT in contrast to the state-of-the-art algorithms.https://ieeexplore.ieee.org/document/10816574/Geographical imaging tokenizer (GIT)noise-contrastive estimation (NCE)self-supervisory tasksynthetic aperture radar (SAR)-to-optical (S2O) image translationtransformer
spellingShingle Hongbo Liang
Xuezhi Yang
Xiangyu Yang
Jinjin Luo
Jiajia Zhu
GFTT: Geographical Feature Tokenization Transformer for SAR-to-Optical Image Translation
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Geographical imaging tokenizer (GIT)
noise-contrastive estimation (NCE)
self-supervisory task
synthetic aperture radar (SAR)-to-optical (S2O) image translation
transformer
title GFTT: Geographical Feature Tokenization Transformer for SAR-to-Optical Image Translation
title_full GFTT: Geographical Feature Tokenization Transformer for SAR-to-Optical Image Translation
title_fullStr GFTT: Geographical Feature Tokenization Transformer for SAR-to-Optical Image Translation
title_full_unstemmed GFTT: Geographical Feature Tokenization Transformer for SAR-to-Optical Image Translation
title_short GFTT: Geographical Feature Tokenization Transformer for SAR-to-Optical Image Translation
title_sort gftt geographical feature tokenization transformer for sar to optical image translation
topic Geographical imaging tokenizer (GIT)
noise-contrastive estimation (NCE)
self-supervisory task
synthetic aperture radar (SAR)-to-optical (S2O) image translation
transformer
url https://ieeexplore.ieee.org/document/10816574/
work_keys_str_mv AT hongboliang gfttgeographicalfeaturetokenizationtransformerforsartoopticalimagetranslation
AT xuezhiyang gfttgeographicalfeaturetokenizationtransformerforsartoopticalimagetranslation
AT xiangyuyang gfttgeographicalfeaturetokenizationtransformerforsartoopticalimagetranslation
AT jinjinluo gfttgeographicalfeaturetokenizationtransformerforsartoopticalimagetranslation
AT jiajiazhu gfttgeographicalfeaturetokenizationtransformerforsartoopticalimagetranslation