WinMRSI: Feature Matching With Window Attention for Multimodal Remote Sensing Image

Multimodal remote sensing image matching is a crucial task with broad application potential. However, substantial nonlinear radiometric differences between multimodal images pose significant challenges, often leading to mismatches. To tackle these challenges, this article introduces WinMRSI, a windo...

Full description

Saved in:
Bibliographic Details
Main Authors: Yide Di, Yun Liao, Yunan Liu, Hao Zhou, Kaijun Zhu, Mingyu Lu, Qing Duan, Junhui Liu
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/11022727/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Multimodal remote sensing image matching is a crucial task with broad application potential. However, substantial nonlinear radiometric differences between multimodal images pose significant challenges, often leading to mismatches. To tackle these challenges, this article introduces WinMRSI, a window attention-based multimodal remote sensing image matching method designed to enhance cross-modal feature extraction and information interaction. For feature extraction, a siamese network with discrete cosine transform is employed to model inter-channel dependencies and extract multiscale features from cross-modal images. In addition, a dual-branch network is designed to capture contextual dependencies while refining local feature representations. For information interaction, WinMRSI integrates a window attention mechanism to strengthen fine-grained feature fusion within highly relevant windows, enabling the model to focus on discriminative regions. Furthermore, a multilevel matching module progressively refines matching accuracy in a coarse-to-fine manner across window, patch, and pixel levels. Extensive evaluations on benchmark datasets demonstrate that WinMRSI achieves state-of-the-art performance in multimodal remote sensing image matching. Ablation studies further validate the effectiveness of each component in WinMRSI.
ISSN:1939-1404
2151-1535