Transferring CLIP for visual grounding in remote sensing images
Remote Sensing Visual Grounding (RSVG) task aims to localize specific objects in remote sensing (RS) images based on natural language queries and holds considerable potential for various applications. Existing approaches primarily rely on unimodal pre-trained encoders, leading to insufficient cross-...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Taylor & Francis Group
2025-08-01
|
| Series: | International Journal of Digital Earth |
| Subjects: | |
| Online Access: | https://www.tandfonline.com/doi/10.1080/17538947.2025.2512059 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|