CuTCP: Custom Text Generation-based Class-aware Prompt Tuning for visual-language models

Abstract Visual-language models (VLMs) excel in cross-modal reasoning by synthesizing visual and linguistic features. Recent VLMs use prompt learning for fine-tuning, allowing adaptation to various downstream tasks. TCP applies class-aware prompt tuning to improve VLMs generalization, yet its relian...

Full description

Saved in:
Bibliographic Details
Main Authors: Min Huang, Chen Yang, Xiaoyan Yu
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-85838-x
Tags: Add Tag
No Tags, Be the first to tag this record!