Enabling High-Level Worker-Centric Semantic Understanding of Onsite Images Using Visual Language Models with Attention Mechanism and Beam Search Strategy

Visual information is becoming increasingly essential in construction management. However, a significant portion of this information remains underutilized by construction managers due to the limitations of existing image processing algorithms. These algorithms primarily rely on low-level visual feat...

Full description

Saved in:
Bibliographic Details
Main Authors: Hui Deng, Kejie Fu, Binglin Yu, Huimin Li, Rui Duan, Yichuan Deng, Jia-rui Lin
Format: Article
Language:English
Published: MDPI AG 2025-03-01
Series:Buildings
Subjects:
Online Access:https://www.mdpi.com/2075-5309/15/6/959
Tags: Add Tag
No Tags, Be the first to tag this record!