Enhancing human-centered dynamic scene understanding via multiple LLMs collaborated reasoning

Enhancing human-centered dynamic scene understanding via multiple LLMs collaborated reasoning

Abstract Human-centered dynamic scene understanding plays a pivotal role in enhancing the capability of robotic and autonomous systems, where video-based human-object interaction (V-HOI) detection is a crucial task in semantic scene understanding, which aims to comprehensively understand HOI relatio...

Full description

Saved in:

Bibliographic Details
Main Authors:	Hang Zhang, Wenxiao Zhang, Haoxuan Qu, Jun Liu
Format:	Article
Language:	English
Published:	Springer 2025-03-01
Series:	Visual Intelligence
Subjects:	Scene understanding Large language models Knowledge-based reasoning
Online Access:	https://doi.org/10.1007/s44267-025-00074-1
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MuRelSGG: Multimodal Relationship Prediction for Neurosymbolic Scene Graph Generation
by: Muhammad Junaid Khan, et al.
Published: (2025-01-01)

Domain-Incremental Learning Paradigm for scene understanding via Pseudo-Replay Generation
by: Zhifeng Xie, et al.
Published: (2025-09-01)

From object to context: Scene knowledge enhanced visual grounding for geospatial understanding
by: Ke She, et al.
Published: (2025-08-01)

Knowledge reasoning with multiple relational paths
by: Hang Su, et al.
Published: (2023-12-01)

A Proactive Agent Collaborative Framework for Zero‐Shot Multimodal Medical Reasoning
by: Zishan Gu, et al.
Published: (2025-08-01)

Enabling High-Level Worker-Centric Semantic Understanding of Onsite Images Using Visual Language Models with Attention Mechanism and Beam Search Strategy
by: Hui Deng, et al.
Published: (2025-03-01)

A text clarification and deep relational reasoning method for Mongolian-Chinese bilingual arbitrary-shaped scene text detection
by: Yuefeng Liu, et al.
Published: (2025-07-01)

Cascaded Dual-Inpainting Network for Scene Text
by: Chunmei Liu
Published: (2025-07-01)

Block-PSPGOF: high-quality mesh reconstruction of large scenes based on progressive self-planarized Gaussian opacity fields
by: Yi Chen, et al.
Published: (2025-08-01)

Visual Commonsense Causal Reasoning From a Still Image
by: Xiaojing Wu, et al.
Published: (2025-01-01)

Design of electrocatalysts based on knowledge enhanced LLMs
by: WANG Ludi, et al.
Published: (2025-03-01)

Design of electrocatalysts based on knowledge enhanced LLMs
by: WANG Ludi, et al.
Published: (2025-03-01)

Тext Understanding as a Special Kind of Understanding
by: Nataliia Akimova, et al.
Published: (2018-10-01)

Semantic Scene Completion in Autonomous Driving: A Two-Stream Multi-Vehicle Collaboration Approach
by: Junxuan Li, et al.
Published: (2024-12-01)

Large Language Models (LLMs) as Traffic Control Systems at Urban Intersections: A New Paradigm
by: Sari Masri, et al.
Published: (2025-01-01)

MYSTERY AND THE POSTMODERN SCENE: PYNCHONEAN VIEW
by: Amira Halim
Published: (2016-06-01)

Vision-Degree-Driven Loading Strategy for Real-Time Large-Scale Scene Rendering
by: Yu Ding, et al.
Published: (2025-07-01)

Nav2Scene: Navigation-driven fine-tuning for robot-friendly scene generation
by: Bowei Jiang, et al.
Published: (2025-09-01)

Dynamic convolutional model based on distribution-collaboration strategy for remote sensing scene classification
by: Chenjun Xu, et al.
Published: (2025-08-01)

Commonsense Spatial Reasoning: an Informational Perspective
by: Stefania Bandini, et al.
Published: (2008-07-01)

TOSD: A Hierarchical Object-Centric Descriptor Integrating Shape, Color, and Topology
by: Jun-Hyeon Choi, et al.
Published: (2025-07-01)

An automated construction method of 3D knowledge graph based on multi-agent systems in virtual geographic scene
by: Yukun Guo, et al.
Published: (2025-08-01)

Understanding of medical students regarding the clinical scope of the plastic surgery specialty
by: Pedro Freire Guerra Boldrin, et al.
Published: (2020-12-01)

Road scene map for autonomous driving and modeling method
by: Juan Lei, et al.
Published: (2025-08-01)

Apvit: ViT with adaptive patches for scene text recognition
by: Ning Zhang, et al.
Published: (2025-03-01)

Artificial reason and artificial intelligence: The legal reasoning capabilities of GPT-4
by: Spaić Bojan, et al.
Published: (2024-01-01)

Mediation as Problem-Solving Scene in the Light of PTC
by: Viktor Németh
Published: (2014-05-01)

Not an Illusion but a Manifestation: Understanding Large Language Model Reasoning Limitations Through Dual-Process Theory
by: Boris Gorelik
Published: (2025-07-01)

Legal Reason : the use of analogy in legal argument /
by: Weinreb, Lloyd L
Published: (2016)

Domain Knowledge Decomposition for Cross-Domain Few-Shot Scene Classification From Remote Sensing Imagery
by: Can Li, et al.
Published: (2025-01-01)

Exploring teachers’ pedagogical reasoning in mathematics education using the TPACK framework
by: Roni Priyanda, et al.
Published: (2025-05-01)

MFSM-Net: Multimodal Feature Fusion for the Semantic Segmentation of Urban-Scale Textured 3D Meshes
by: Xinjie Hao, et al.
Published: (2025-04-01)

The audio description of sex scenes in Élite
by: Margherita Dore, et al.
Published: (2025-07-01)

MSER Fast Skewed Scene-text Location Algorithm
by: ZHANG Kai-yu, et al.
Published: (2019-04-01)

From knowledge silos to integrated insights: building a cardiovascular medication knowledge graph for enhanced medication knowledge retrieval, relationship discovery, and reasoning
by: Hongzhen Cui, et al.
Published: (2025-04-01)

Using e-scaffolding to develop students’ scientific reasoning through inquiry-based learning
by: S. Koes Handayanto, et al.
Published: (2023-11-01)

Analysis of Adaptive Reasoning Ability in Middle School Students Mathematics Learning
by: Dela Amelia Putri, et al.
Published: (2024-07-01)

LFEN: A language feature enhanced network for scene text recognition
by: Hui Chen, et al.
Published: (2025-01-01)

SceneDiffusion: Scene Generation Model Embedded with Spatial Constraints
by: Shanshan Yu, et al.
Published: (2025-06-01)

Examining the authority of reason from the perspective of religious sources
by: Morteza Barati, et al.
Published: (2021-08-01)