Vision-language models for medical report generation and visual question answering: a review

Medical vision-language models (VLMs) combine computer vision (CV) and natural language processing (NLP) to analyze visual and textual medical data. Our paper reviews recent advancements in developing VLMs specialized for healthcare, focusing on publicly available models designed for medical report...

Full description

Saved in:
Bibliographic Details
Main Authors: Iryna Hartsock, Ghulam Rasool
Format: Article
Language:English
Published: Frontiers Media S.A. 2024-11-01
Series:Frontiers in Artificial Intelligence
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/frai.2024.1430984/full
Tags: Add Tag
No Tags, Be the first to tag this record!