Clustering swap prediction for image-text pre-training

Abstract It is essential to delve into the strategy of multimodal model pre-training, which is an obvious impact on downstream tasks. Currently, clustering learning has achieved noteworthy benefits in multiple methods. However, due to the availability of open image-text pairs, it is challenging for...

Full description

Saved in:
Bibliographic Details
Main Authors: Sun Fayou, Hea Choon Ngo, Yong Wee Sek, Zuqiang Meng
Format: Article
Language:English
Published: Nature Portfolio 2024-05-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-024-60832-x
Tags: Add Tag
No Tags, Be the first to tag this record!