Enhancing human-centered dynamic scene understanding via multiple LLMs collaborated reasoning

Abstract Human-centered dynamic scene understanding plays a pivotal role in enhancing the capability of robotic and autonomous systems, where video-based human-object interaction (V-HOI) detection is a crucial task in semantic scene understanding, which aims to comprehensively understand HOI relatio...

Full description

Saved in:
Bibliographic Details
Main Authors: Hang Zhang, Wenxiao Zhang, Haoxuan Qu, Jun Liu
Format: Article
Language:English
Published: Springer 2025-03-01
Series:Visual Intelligence
Subjects:
Online Access:https://doi.org/10.1007/s44267-025-00074-1
Tags: Add Tag
No Tags, Be the first to tag this record!