STeInFormer: Spatial–Temporal Interaction Transformer Architecture for Remote Sensing Change Detection

Convolutional neural networks and attention mechanisms have greatly benefited remote sensing change detection (RSCD) because of their outstanding discriminative ability. Existent RSCD methods often follow a paradigm of using a noninteractive Siamese neural network for multitemporal feature extractio...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiaowen Ma, Zhenkai Wu, Mengting Ma, Mengjiao Zhao, Fan Yang, Zhenhong Du, Wei Zhang
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10815617/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832590426369949696
author Xiaowen Ma
Zhenkai Wu
Mengting Ma
Mengjiao Zhao
Fan Yang
Zhenhong Du
Wei Zhang
author_facet Xiaowen Ma
Zhenkai Wu
Mengting Ma
Mengjiao Zhao
Fan Yang
Zhenhong Du
Wei Zhang
author_sort Xiaowen Ma
collection DOAJ
description Convolutional neural networks and attention mechanisms have greatly benefited remote sensing change detection (RSCD) because of their outstanding discriminative ability. Existent RSCD methods often follow a paradigm of using a noninteractive Siamese neural network for multitemporal feature extraction and change detection heads for feature fusion and change representation. However, this paradigm lacks the contemplation of the characteristics of RSCD in temporal and spatial dimensions, and causes the drawback on spatial–temporal interaction that hinders high-quality feature extraction. To address this problem, we present a spatial–temporal interaction Transformer architecture for multitemporal feature extraction, which is the first general backbone network specifically designed for RSCD. In addition, we propose a parameter-free multifrequency token mixer to integrate frequency-domain features that provide spectral information for RSCD. Experimental results on three datasets validate the effectiveness of the proposed method, which can outperform the state-of-the-art methods and achieve the most satisfactory efficiency-accuracy tradeoff.
format Article
id doaj-art-cb88eafcfe2044a7b96e19a49e56aad4
institution Kabale University
issn 1939-1404
2151-1535
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj-art-cb88eafcfe2044a7b96e19a49e56aad42025-01-24T00:00:57ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing1939-14042151-15352025-01-01183735374510.1109/JSTARS.2024.352232910815617STeInFormer: Spatial–Temporal Interaction Transformer Architecture for Remote Sensing Change DetectionXiaowen Ma0https://orcid.org/0000-0001-5031-2641Zhenkai Wu1https://orcid.org/0009-0000-0613-0584Mengting Ma2https://orcid.org/0000-0002-6897-3576Mengjiao Zhao3https://orcid.org/0009-0006-1814-2404Fan Yang4https://orcid.org/0009-0002-1292-0894Zhenhong Du5https://orcid.org/0000-0001-9449-0415Wei Zhang6https://orcid.org/0000-0002-4424-079XSchool of Software Technology, Zhejiang University, Hangzhou, ChinaSchool of Software Technology, Zhejiang University, Hangzhou, ChinaSchool of Computer Science and Technology, Zhejiang University, Hangzhou, ChinaSchool of Software Technology, Zhejiang University, Hangzhou, ChinaSchool of Software Technology, Zhejiang University, Hangzhou, ChinaSchool of Earth Sciences, Zhejiang University, Hangzhou, ChinaSchool of Software Technology, Zhejiang University, Hangzhou, ChinaConvolutional neural networks and attention mechanisms have greatly benefited remote sensing change detection (RSCD) because of their outstanding discriminative ability. Existent RSCD methods often follow a paradigm of using a noninteractive Siamese neural network for multitemporal feature extraction and change detection heads for feature fusion and change representation. However, this paradigm lacks the contemplation of the characteristics of RSCD in temporal and spatial dimensions, and causes the drawback on spatial–temporal interaction that hinders high-quality feature extraction. To address this problem, we present a spatial–temporal interaction Transformer architecture for multitemporal feature extraction, which is the first general backbone network specifically designed for RSCD. In addition, we propose a parameter-free multifrequency token mixer to integrate frequency-domain features that provide spectral information for RSCD. Experimental results on three datasets validate the effectiveness of the proposed method, which can outperform the state-of-the-art methods and achieve the most satisfactory efficiency-accuracy tradeoff.https://ieeexplore.ieee.org/document/10815617/Change detectioncross-spatial interactioncross-temporal interactionmultifrequency
spellingShingle Xiaowen Ma
Zhenkai Wu
Mengting Ma
Mengjiao Zhao
Fan Yang
Zhenhong Du
Wei Zhang
STeInFormer: Spatial–Temporal Interaction Transformer Architecture for Remote Sensing Change Detection
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Change detection
cross-spatial interaction
cross-temporal interaction
multifrequency
title STeInFormer: Spatial–Temporal Interaction Transformer Architecture for Remote Sensing Change Detection
title_full STeInFormer: Spatial–Temporal Interaction Transformer Architecture for Remote Sensing Change Detection
title_fullStr STeInFormer: Spatial–Temporal Interaction Transformer Architecture for Remote Sensing Change Detection
title_full_unstemmed STeInFormer: Spatial–Temporal Interaction Transformer Architecture for Remote Sensing Change Detection
title_short STeInFormer: Spatial–Temporal Interaction Transformer Architecture for Remote Sensing Change Detection
title_sort steinformer spatial x2013 temporal interaction transformer architecture for remote sensing change detection
topic Change detection
cross-spatial interaction
cross-temporal interaction
multifrequency
url https://ieeexplore.ieee.org/document/10815617/
work_keys_str_mv AT xiaowenma steinformerspatialx2013temporalinteractiontransformerarchitectureforremotesensingchangedetection
AT zhenkaiwu steinformerspatialx2013temporalinteractiontransformerarchitectureforremotesensingchangedetection
AT mengtingma steinformerspatialx2013temporalinteractiontransformerarchitectureforremotesensingchangedetection
AT mengjiaozhao steinformerspatialx2013temporalinteractiontransformerarchitectureforremotesensingchangedetection
AT fanyang steinformerspatialx2013temporalinteractiontransformerarchitectureforremotesensingchangedetection
AT zhenhongdu steinformerspatialx2013temporalinteractiontransformerarchitectureforremotesensingchangedetection
AT weizhang steinformerspatialx2013temporalinteractiontransformerarchitectureforremotesensingchangedetection