Enhanced CLIP-GPT Framework for Cross-Lingual Remote Sensing Image Captioning

Remote Sensing Image Captioning (RSIC) aims to generate precise and informative descriptive text for remote sensing images using computational algorithms. Traditional “encoder-decoder” approaches face limitations due to their high training costs and heavy reliance on large-scal...

Full description

Saved in:
Bibliographic Details
Main Authors: Rui Song, Beigeng Zhao, Lizhi Yu
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10816156/
Tags: Add Tag
No Tags, Be the first to tag this record!