Visual-textual integration in LLMs for medical diagnosis: A preliminary quantitative analysis
Background and aim: Visual data from images is essential for many medical diagnoses. This study evaluates the performance of multimodal Large Language Models (LLMs) in integrating textual and visual information for diagnostic purposes. Methods: We tested GPT-4o and Claude Sonnet 3.5 on 120 clinical...
Saved in:
Main Authors: | Reem Agbareia, Mahmud Omar, Shelly Soffer, Benjamin S. Glicksberg, Girish N. Nadkarni, Eyal Klang |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2025-01-01
|
Series: | Computational and Structural Biotechnology Journal |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S2001037024004379 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Emerging applications of NLP and large language models in gastroenterology and hepatology: a systematic review
by: Mahmud Omar, et al.
Published: (2025-01-01) -
SumGPT: A Multimodal Framework for Radiology Report Summarization to Improve Clinical Performance
by: Tipu Sultan, et al.
Published: (2025-01-01) -
Visual Question Answering in Robotic Surgery: A Comprehensive Review
by: Di Ding, et al.
Published: (2025-01-01) -
When LLMs meet cybersecurity: a systematic literature review
by: Jie Zhang, et al.
Published: (2025-02-01) -
NDEExplorer: Visual Analytics for Exploring Damage Modes via Multimodal Data in the Non-Destructive Examination of Composite Materials
by: Dongliang Guo, et al.
Published: (2025-01-01)