BVQA: Connecting Language and Vision Through Multimodal Attention for Open-Ended Question Answering
Visual Question Answering (VQA) is a challenging problem of Artificial Intelligence (AI) that requires an understanding of natural language and computer vision to respond to inquiries based on visual content within images. Research on VQA has gained immense traction due to its wide range of applicat...
Saved in:
| Main Authors: | Md. Shalha Mucha Bhuyan, Eftekhar Hossain, Khaleda Akhter Sathi, Md. Azad Hossain, M. Ali Akber Dewan |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10878995/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Multimodal representative answer extraction in community question answering
by: Ming Li, et al.
Published: (2023-10-01) -
Enhancing Visual Question Answering for Multiple Choice Questions
by: Rashi Goel, et al.
Published: (2025-01-01) -
Designing and Evaluating a Dual-Stream Transformer-Based Architecture for Visual Question Answering
by: Faheem Shehzad, et al.
Published: (2024-01-01) -
Cross-Encoder-Based Semantic Evaluation of Extractive and Generative Question Answering in Low-Resourced African Languages
by: Funebi Francis Ijebu, et al.
Published: (2025-03-01) -
ReceiptQA: A Question-Answering Dataset for Receipt Understanding
by: Mahmoud Abdalla, et al.
Published: (2025-05-01)