Transferring CLIP for visual grounding in remote sensing images

Remote Sensing Visual Grounding (RSVG) task aims to localize specific objects in remote sensing (RS) images based on natural language queries and holds considerable potential for various applications. Existing approaches primarily rely on unimodal pre-trained encoders, leading to insufficient cross-...

Full description

Saved in:
Bibliographic Details
Main Authors: Linlin Liang, Yizhuo Quan, Chengbo Wang, Yuanfei Chang, Yanyou Qiao
Format: Article
Language:English
Published: Taylor & Francis Group 2025-08-01
Series:International Journal of Digital Earth
Subjects:
Online Access:https://www.tandfonline.com/doi/10.1080/17538947.2025.2512059
Tags: Add Tag
No Tags, Be the first to tag this record!