Spatial Position Reasoning of Image Entities Based on Location Words

The endeavor of spatial position reasoning effectively simulates the sensory and comprehension faculties of artificial intelligence, especially within the purview of multimodal modeling that fuses imagery with linguistic data. Recent progress in visual image–language models has marked significant ad...

Full description

Saved in:
Bibliographic Details
Main Authors: Xingguo Qin, Ya Zhou, Jun Li
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/12/24/3940
Tags: Add Tag
No Tags, Be the first to tag this record!