A systematic evaluation of GPT-4V's multimodal capability for chest X-ray image analysis
This work evaluates GPT-4V's multimodal capability for medical image analysis, focusing on three representative tasks radiology report generation, medical visual question answering, and medical visual grounding. For the evaluation, a set of prompts is designed for each task to induce the corres...
Saved in:
Main Authors: | Yunyi Liu, Yingshu Li, Zhanyu Wang, Xinyu Liang, Lingqiao Liu, Lei Wang, Leyang Cui, Zhaopeng Tu, Longyue Wang, Luping Zhou |
---|---|
Format: | Article |
Language: | English |
Published: |
KeAi Communications Co., Ltd.
2024-12-01
|
Series: | Meta-Radiology |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2950162824000535 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Visual Question Answering in Robotic Surgery: A Comprehensive Review
by: Di Ding, et al.
Published: (2025-01-01) -
Answer Distillation Network With Bi-Text-Image Attention for Medical Visual Question Answering
by: Hongfang Gong, et al.
Published: (2025-01-01) -
ChestX-Transcribe: a multimodal transformer for automated radiology report generation from chest x-rays
by: Prateek Singh, et al.
Published: (2025-01-01) -
Prompting Large Language Models with Knowledge-Injection for Knowledge-Based Visual Question Answering
by: Zhongjian Hu, et al.
Published: (2024-09-01) -
SumGPT: A Multimodal Framework for Radiology Report Summarization to Improve Clinical Performance
by: Tipu Sultan, et al.
Published: (2025-01-01)