A Novel Change Detection Method Based on Visual Language From High-Resolution Remote Sensing Images

Recently, the release of “all-in-one” foundation models has sparked rapid developments in artificial intelligence. However, due to the fact that these models are typically trained on natural images, their potential in remote sensing remains largely untapped. To address this gap...

Full description

Saved in:
Bibliographic Details
Main Authors: Junlong Qiu, Wei Liu, Hui Zhang, Erzhu Li, Lianpeng Zhang, Xing Li
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10818767/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832540566348365824
author Junlong Qiu
Wei Liu
Hui Zhang
Erzhu Li
Lianpeng Zhang
Xing Li
author_facet Junlong Qiu
Wei Liu
Hui Zhang
Erzhu Li
Lianpeng Zhang
Xing Li
author_sort Junlong Qiu
collection DOAJ
description Recently, the release of &#x201C;all-in-one&#x201D; foundation models has sparked rapid developments in artificial intelligence. However, due to the fact that these models are typically trained on natural images, their potential in remote sensing remains largely untapped. To address this gap, this article proposes a novel change detection method based on visual language from high-resolution remote sensing images, named VLCD. Specifically, on the text side, we use context optimization to align text&#x2013;image semantics. On the image side, we construct a side fusion network, which integrates universal features from the foundation model with domain-specific features from remote sensing through a bridging module. In addition, we introduce a change feature computation module to integrate global features, difference features, and textual information. To validate the effectiveness of the proposed method, we conducted comparative experiments on three public datasets. The results show that the proposed VLCD achieved state-of-the-art <italic>F</italic>1-scores and IoUs on these three datasets: LEVIR-CD (90.99&#x0025;, 83.46&#x0025;), SYSU-CD (83.05&#x0025;, 71.01&#x0025;), and S2Looking (62.75&#x0025;, 45.89&#x0025;), outperforming the results obtained through full fine-tuning while using less than one-tenth of the number of parameters.
format Article
id doaj-art-e0d746c3bba34f2a9743e759dcf717fe
institution Kabale University
issn 1939-1404
2151-1535
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
spelling doaj-art-e0d746c3bba34f2a9743e759dcf717fe2025-02-05T00:00:24ZengIEEEIEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing1939-14042151-15352025-01-01184554456710.1109/JSTARS.2024.352438210818767A Novel Change Detection Method Based on Visual Language From High-Resolution Remote Sensing ImagesJunlong Qiu0https://orcid.org/0009-0001-2139-4550Wei Liu1https://orcid.org/0000-0001-8808-7961Hui Zhang2https://orcid.org/0009-0001-2138-2656Erzhu Li3https://orcid.org/0000-0002-5881-618XLianpeng Zhang4https://orcid.org/0000-0001-9765-9730Xing Li5https://orcid.org/0000-0003-3793-1581School of Geography, Geomatics, and Planning, Jiangsu Normal University, Xuzhou, ChinaSchool of Geography, Geomatics, and Planning, Jiangsu Normal University, Xuzhou, ChinaSchool of Geography, Geomatics, and Planning, Jiangsu Normal University, Xuzhou, ChinaSchool of Geography, Geomatics, and Planning, Jiangsu Normal University, Xuzhou, ChinaSchool of Geography, Geomatics, and Planning, Jiangsu Normal University, Xuzhou, ChinaSchool of Geography, Geomatics, and Planning, Jiangsu Normal University, Xuzhou, ChinaRecently, the release of &#x201C;all-in-one&#x201D; foundation models has sparked rapid developments in artificial intelligence. However, due to the fact that these models are typically trained on natural images, their potential in remote sensing remains largely untapped. To address this gap, this article proposes a novel change detection method based on visual language from high-resolution remote sensing images, named VLCD. Specifically, on the text side, we use context optimization to align text&#x2013;image semantics. On the image side, we construct a side fusion network, which integrates universal features from the foundation model with domain-specific features from remote sensing through a bridging module. In addition, we introduce a change feature computation module to integrate global features, difference features, and textual information. To validate the effectiveness of the proposed method, we conducted comparative experiments on three public datasets. The results show that the proposed VLCD achieved state-of-the-art <italic>F</italic>1-scores and IoUs on these three datasets: LEVIR-CD (90.99&#x0025;, 83.46&#x0025;), SYSU-CD (83.05&#x0025;, 71.01&#x0025;), and S2Looking (62.75&#x0025;, 45.89&#x0025;), outperforming the results obtained through full fine-tuning while using less than one-tenth of the number of parameters.https://ieeexplore.ieee.org/document/10818767/Change detectionfoundation modelprompt learningremote sensing (RS)
spellingShingle Junlong Qiu
Wei Liu
Hui Zhang
Erzhu Li
Lianpeng Zhang
Xing Li
A Novel Change Detection Method Based on Visual Language From High-Resolution Remote Sensing Images
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Change detection
foundation model
prompt learning
remote sensing (RS)
title A Novel Change Detection Method Based on Visual Language From High-Resolution Remote Sensing Images
title_full A Novel Change Detection Method Based on Visual Language From High-Resolution Remote Sensing Images
title_fullStr A Novel Change Detection Method Based on Visual Language From High-Resolution Remote Sensing Images
title_full_unstemmed A Novel Change Detection Method Based on Visual Language From High-Resolution Remote Sensing Images
title_short A Novel Change Detection Method Based on Visual Language From High-Resolution Remote Sensing Images
title_sort novel change detection method based on visual language from high resolution remote sensing images
topic Change detection
foundation model
prompt learning
remote sensing (RS)
url https://ieeexplore.ieee.org/document/10818767/
work_keys_str_mv AT junlongqiu anovelchangedetectionmethodbasedonvisuallanguagefromhighresolutionremotesensingimages
AT weiliu anovelchangedetectionmethodbasedonvisuallanguagefromhighresolutionremotesensingimages
AT huizhang anovelchangedetectionmethodbasedonvisuallanguagefromhighresolutionremotesensingimages
AT erzhuli anovelchangedetectionmethodbasedonvisuallanguagefromhighresolutionremotesensingimages
AT lianpengzhang anovelchangedetectionmethodbasedonvisuallanguagefromhighresolutionremotesensingimages
AT xingli anovelchangedetectionmethodbasedonvisuallanguagefromhighresolutionremotesensingimages
AT junlongqiu novelchangedetectionmethodbasedonvisuallanguagefromhighresolutionremotesensingimages
AT weiliu novelchangedetectionmethodbasedonvisuallanguagefromhighresolutionremotesensingimages
AT huizhang novelchangedetectionmethodbasedonvisuallanguagefromhighresolutionremotesensingimages
AT erzhuli novelchangedetectionmethodbasedonvisuallanguagefromhighresolutionremotesensingimages
AT lianpengzhang novelchangedetectionmethodbasedonvisuallanguagefromhighresolutionremotesensingimages
AT xingli novelchangedetectionmethodbasedonvisuallanguagefromhighresolutionremotesensingimages