Qualitative metrics from the biomedical literature for evaluating large language models in clinical decision-making: a narrative review

Abstract Background The large language models (LLMs), most notably ChatGPT, released since November 30, 2022, have prompted shifting attention to their use in medicine, particularly for supporting clinical decision-making. However, there is little consensus in the medical community on how LLM perfor...

Full description

Saved in:

Bibliographic Details
Main Authors:	Cindy N. Ho, Tiffany Tian, Alessandra T. Ayers, Rachel E. Aaron, Vidith Phillips, Risa M. Wolf, Nestoras Mathioudakis, Tinglong Dai, David C. Klonoff
Format:	Article
Language:	English
Published:	BMC 2024-11-01
Series:	BMC Medical Informatics and Decision Making
Subjects:	Artificial Intelligence ChatGPT Clinical decision-making Large Language Model Machine learning
Online Access:	https://doi.org/10.1186/s12911-024-02757-z
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://doi.org/10.1186/s12911-024-02757-z

Qualitative metrics from the biomedical literature for evaluating large language models in clinical decision-making: a narrative review

Internet

Similar Items