Tag‐inferring and tag‐guided Transformer for image captioning

Abstract Image captioning is an important task for understanding images. Recently, many studies have used tags to build alignments between image information and language information. However, existing methods ignore the problem that simple semantic tags have difficulty expressing the detailed semant...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yaohua Yi, Yinkai Liang, Dezhu Kong, Ziwei Tang, Jibing Peng
Format:	Article
Language:	English
Published:	Wiley 2024-09-01
Series:	IET Computer Vision
Subjects:	computer vision image recognition learning (artificial intelligence)
Online Access:	https://doi.org/10.1049/cvi2.12280
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://doi.org/10.1049/cvi2.12280

Tag‐inferring and tag‐guided Transformer for image captioning

Internet

Similar Items