Multimodal Contrastive Learning for Remote Sensing Image Feature Extraction Based on Relaxed Positive Samples
Traditional multimodal contrastive learning brings text and its corresponding image closer together as a positive pair, where the text typically consists of fixed sentence structures or specific descriptive statements, and the image features are generally global features (with some fine-grained work...
Saved in:
| Main Authors: | Zhenshi Zhang, Qiujun Li, Wenxuan Jing, Guangjun He, Lili Zhu, Shijuan Gao |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2024-12-01
|
| Series: | Sensors |
| Subjects: | |
| Online Access: | https://www.mdpi.com/1424-8220/24/23/7719 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Contrastive learning based remote sensing text-to-image generation for few-shot remote sensing image captioning
by: Haonan Zhou, et al.
Published: (2025-08-01) -
A facial structure sampling contrastive learning method for sketch facial synthesis
by: Kangning Du, et al.
Published: (2025-05-01) -
ModFus-PD: synergizing cross-modal attention and contrastive learning for enhanced multimodal diagnosis of Parkinson’s disease
by: Xiangze Teng, et al.
Published: (2025-07-01) -
Relaxation phenomena in polyvinyl alcohol
by: А. А. Kukhtenkova, et al.
Published: (2015-02-01) -
American graphic Novel: Multimodality and Identity
by: M. A. Dubovitskaya
Published: (2022-04-01)