Enhanced CLIP-GPT Framework for Cross-Lingual Remote Sensing Image Captioning

Remote Sensing Image Captioning (RSIC) aims to generate precise and informative descriptive text for remote sensing images using computational algorithms. Traditional “encoder-decoder” approaches face limitations due to their high training costs and heavy reliance on large-scal...

Full description

Saved in:

Bibliographic Details
Main Authors:	Rui Song, Beigeng Zhao, Lizhi Yu
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Remote sensing image captioning CLIP GPT deep learning multimodal
Online Access:	https://ieeexplore.ieee.org/document/10816156/
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://ieeexplore.ieee.org/document/10816156/

Enhanced CLIP-GPT Framework for Cross-Lingual Remote Sensing Image Captioning

Internet

Similar Items