BVQA: Connecting Language and Vision Through Multimodal Attention for Open-Ended Question Answering
Visual Question Answering (VQA) is a challenging problem of Artificial Intelligence (AI) that requires an understanding of natural language and computer vision to respond to inquiries based on visual content within images. Research on VQA has gained immense traction due to its wide range of applicat...
Saved in:
| Main Authors: | Md. Shalha Mucha Bhuyan, Eftekhar Hossain, Khaleda Akhter Sathi, Md. Azad Hossain, M. Ali Akber Dewan |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10878995/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Cross-Encoder-Based Semantic Evaluation of Extractive and Generative Question Answering in Low-Resourced African Languages
by: Funebi Francis Ijebu, et al.
Published: (2025-03-01) -
Multimodal representative answer extraction in community question answering
by: Ming Li, et al.
Published: (2023-10-01) -
Enhancing Visual Question Answering for Multiple Choice Questions
by: Rashi Goel, et al.
Published: (2025-01-01) -
SHIFA: SBERT-Based Healthcare Information Focused Arabic Question Answering
by: Rahaf Alruwaithi, et al.
Published: (2025-01-01) -
Envisioning Answers: Unleashing Deep Learning for Visual Question Answering in Artistic Images
by: Erfan Zolghadriha, et al.
Published: (2024-03-01)