-
1
Real-world super-resolution with VLM-based degradation prior learning
Published 2025-08-01Subjects: Get full text
Article -
2
In-Context Vision-Pattern-Language Model for Enhancing Vessel Activity Explanation
Published 2025-01-01Subjects: Get full text
Article -
3
Coherent Interpretation of Entire Visual Field Test Reports Using a Multimodal Large Language Model (ChatGPT)
Published 2025-04-01Subjects: Get full text
Article -
4
An Image Grid Can Be Worth a Video: Zero-Shot Video Question Answering Using a VLM
Published 2024-01-01Subjects: Get full text
Article -
5
FireCLIP: Enhancing Forest Fire Detection with Multimodal Prompt Tuning and Vision-Language Understanding
Published 2025-06-01Subjects: Get full text
Article -
6
RelVid: Relational Learning with Vision-Language Models for Weakly Video Anomaly Detection
Published 2025-03-01Subjects: “…vision-language model…”
Get full text
Article -
7
CLIP-Llama: A New Approach for Scene Text Recognition with a Pre-Trained Vision-Language Model and a Pre-Trained Language Model
Published 2024-11-01Subjects: Get full text
Article -
8
Estimating Age and Sex from Dental Panoramic Radiographs Using Neural Networks and Vision–Language Models
Published 2025-01-01Subjects: Get full text
Article -
9
LARE: Latent augmentation using regional embedding with vision-language model
Published 2025-06-01Subjects: Get full text
Article -
10
Performance of vision language models for optic disc swelling identification on fundus photographs
Published 2025-08-01Subjects: “…vision language model…”
Get full text
Article -
11
Review and emerging trends of embodied agent based on multimodal large language models
Published 2025-05-01Subjects: Get full text
Article -
12
Class Concept Representation From Contextual Texts for Training-Free Multi-Label Recognition
Published 2025-01-01Subjects: Get full text
Article -
13
Mixture of prompts learning for vision-language models
Published 2025-06-01Subjects: Get full text
Article -
14
Cross-Modal Data Fusion via Vision-Language Model for Crop Disease Recognition
Published 2025-06-01Subjects: Get full text
Article -
15
Open challenges and opportunities in federated foundation models towards biomedical healthcare
Published 2025-01-01Subjects: Get full text
Article -
16
Enhanced BLIP-2 Optimization Using LoRA for Generating Dashcam Captions
Published 2025-03-01Subjects: Get full text
Article -
17
An Exploratory Study on Workover Scenario Understanding Using Prompt-Enhanced Vision-Language Models
Published 2025-05-01Subjects: Get full text
Article -
18
Alzheimer’s disease recognition using graph neural network by leveraging image-text similarity from vision language model
Published 2025-01-01Subjects: Get full text
Article -
19
Chart Accessibility: A Review of Current Alt Text Generation
Published 2025-01-01Subjects: Get full text
Article -
20
AITtrack: Attention-Based Image-Text Alignment for Visual Tracking
Published 2025-01-01Subjects: Get full text
Article