ADAGENT: Anomaly Detection Agent With Multimodal Large Models in Adverse Environments

Multimodal Language Models (MMLMs), such as LLaVA and GPT-4V, have shown zero-shot generalization capabilities for understanding images and text across various domains. However, their effectiveness in open-world visual tasks, particularly anomaly detection under challenging conditions, such as low l...

Full description

Saved in:

Bibliographic Details
Main Authors:	Miao Zhang, Yiqing Shen, Jun Yin, Shuai Lu, Xueqian Wang
Format:	Article
Language:	English
Published:	IEEE 2024-01-01
Series:	IEEE Access
Subjects:	Multimodal language model anomaly detection prompt engineering AI agent
Online Access:	https://ieeexplore.ieee.org/document/10716620/
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://ieeexplore.ieee.org/document/10716620/

ADAGENT: Anomaly Detection Agent With Multimodal Large Models in Adverse Environments

Internet

Similar Items