Tag‐inferring and tag‐guided Transformer for image captioning

Abstract Image captioning is an important task for understanding images. Recently, many studies have used tags to build alignments between image information and language information. However, existing methods ignore the problem that simple semantic tags have difficulty expressing the detailed semant...

Full description

Saved in:
Bibliographic Details
Main Authors: Yaohua Yi, Yinkai Liang, Dezhu Kong, Ziwei Tang, Jibing Peng
Format: Article
Language:English
Published: Wiley 2024-09-01
Series:IET Computer Vision
Subjects:
Online Access:https://doi.org/10.1049/cvi2.12280
Tags: Add Tag
No Tags, Be the first to tag this record!