Combining Region-Guided Attention and Attribute Prediction for Thangka Image Captioning Method
To enhance the understanding of the core regions in Thangka images and improve the richness of generated content during decoding, we propose a Thangka image captioning method based on Region-Guided Feature Enhancement and Attribute Prediction (RGFEAP). The image feature enhancement encoder, guided b...
Saved in:
Main Authors: | Fujun Zhang, Wendong Kang, Wenjin Hu |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2025-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10833628/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
KE-RSIC: Remote Sensing Image Captioning Based on Knowledge Embedding
by: Kangda Cheng, et al.
Published: (2025-01-01) -
Enhanced CLIP-GPT Framework for Cross-Lingual Remote Sensing Image Captioning
by: Rui Song, et al.
Published: (2025-01-01) -
Offline visual aid system for the blind based on image captioning
by: Yue CHEN, et al.
Published: (2022-01-01) -
Undergraduate students’ perceptions toward writing Instagram captions in English
by: Nahda Nafisah Hutasuhut, et al.
Published: (2024-05-01) -
Incidental vocabulary recognition effects of subtitled, captioned and reverse subtitled audiovisual input
by: Jana van der Kolk, et al.
Published: (2024-07-01)