ADAGENT: Anomaly Detection Agent With Multimodal Large Models in Adverse Environments

Multimodal Language Models (MMLMs), such as LLaVA and GPT-4V, have shown zero-shot generalization capabilities for understanding images and text across various domains. However, their effectiveness in open-world visual tasks, particularly anomaly detection under challenging conditions, such as low l...

Full description

Saved in:
Bibliographic Details
Main Authors: Miao Zhang, Yiqing Shen, Jun Yin, Shuai Lu, Xueqian Wang
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10716620/
Tags: Add Tag
No Tags, Be the first to tag this record!