IDTransformer: Infrared image denoising method based on convolutional transposed self-attention

Image denoising is a quintessential challenge in computer vision, intending to produce high-quality, clean images from degraded, noisy counterparts. Infrared imaging holds a pivotal position across many research domains, attributed to its inherent benefits such as concealment and noninvasiveness. De...

Full description

Saved in:

Bibliographic Details
Main Authors:	Zhengwei Shen, Feiwei Qin, Ruiquan Ge, Changmiao Wang, Kai Zhang, Jie Huang
Format:	Article
Language:	English
Published:	Elsevier 2025-01-01
Series:	Alexandria Engineering Journal
Subjects:	Image denoising Infrared image Self-attention Feature fusion
Online Access:	http://www.sciencedirect.com/science/article/pii/S1110016824011256
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841553823778209792
author	Zhengwei Shen Feiwei Qin Ruiquan Ge Changmiao Wang Kai Zhang Jie Huang
author_facet	Zhengwei Shen Feiwei Qin Ruiquan Ge Changmiao Wang Kai Zhang Jie Huang
author_sort	Zhengwei Shen
collection	DOAJ
description	Image denoising is a quintessential challenge in computer vision, intending to produce high-quality, clean images from degraded, noisy counterparts. Infrared imaging holds a pivotal position across many research domains, attributed to its inherent benefits such as concealment and noninvasiveness. Despite these advantages, infrared images are often plagued by hardware-related imperfections resulting in poor contrast, diminished quality, and noise contamination. Extracting and characterizing features amidst these unique feature patterns in infrared imagery are taxing tasks. To surmount these obstacles, we introduce the Infrared image Denoising Transformer (IDTransformer), encapsulated in a symmetrical encoder–decoder architecture. Central to our approach is the Convolutional Transposed Self-Attention Block (CTSAB), which is ingeniously conceived to capture long-range dependencies via channel-wise self-attention, while simultaneously encapsulating local context through depth-wise convolution. In addition, we refine the conventional feed-forward network by integrating Convolutional Gated Linear Units (CGLU) and deploy the Channel Coordinate Attention Block (CCAB) during the feature fusion phase to dynamically apportion weights across the feature map, thereby facilitating a more nuanced representation of pattern features endemic to infrared images. Through rigorous experimentation, we establish that our IDTransformer attains superior visual enhancement across five infrared image datasets, compared with the state-of-the-art methods. The source codes are available at https://github.com/szw811/IDTransformer.
format	Article
id	doaj-art-745c450d4d5545248610665b18cbc389
institution	Kabale University
issn	1110-0168
language	English
publishDate	2025-01-01
publisher	Elsevier
record_format	Article
series	Alexandria Engineering Journal
spelling	doaj-art-745c450d4d5545248610665b18cbc3892025-01-09T06:13:18ZengElsevierAlexandria Engineering Journal1110-01682025-01-01110310321IDTransformer: Infrared image denoising method based on convolutional transposed self-attentionZhengwei Shen0Feiwei Qin1Ruiquan Ge2Changmiao Wang3Kai Zhang4Jie Huang5School of Computer Science and Technology, Hangzhou Dianzi University, ChinaSchool of Computer Science and Technology, Hangzhou Dianzi University, China; Corresponding author.School of Computer Science and Technology, Hangzhou Dianzi University, ChinaShenzhen Research Institute of Big Data, ChinaComputer Vision Lab, ETH Zurich, Zurich, SwitzerlandSchool of Information and Electronic Engineering, Zhejiang University of Science and Technology, ChinaImage denoising is a quintessential challenge in computer vision, intending to produce high-quality, clean images from degraded, noisy counterparts. Infrared imaging holds a pivotal position across many research domains, attributed to its inherent benefits such as concealment and noninvasiveness. Despite these advantages, infrared images are often plagued by hardware-related imperfections resulting in poor contrast, diminished quality, and noise contamination. Extracting and characterizing features amidst these unique feature patterns in infrared imagery are taxing tasks. To surmount these obstacles, we introduce the Infrared image Denoising Transformer (IDTransformer), encapsulated in a symmetrical encoder–decoder architecture. Central to our approach is the Convolutional Transposed Self-Attention Block (CTSAB), which is ingeniously conceived to capture long-range dependencies via channel-wise self-attention, while simultaneously encapsulating local context through depth-wise convolution. In addition, we refine the conventional feed-forward network by integrating Convolutional Gated Linear Units (CGLU) and deploy the Channel Coordinate Attention Block (CCAB) during the feature fusion phase to dynamically apportion weights across the feature map, thereby facilitating a more nuanced representation of pattern features endemic to infrared images. Through rigorous experimentation, we establish that our IDTransformer attains superior visual enhancement across five infrared image datasets, compared with the state-of-the-art methods. The source codes are available at https://github.com/szw811/IDTransformer.http://www.sciencedirect.com/science/article/pii/S1110016824011256Image denoisingInfrared imageSelf-attentionFeature fusion
spellingShingle	Zhengwei Shen Feiwei Qin Ruiquan Ge Changmiao Wang Kai Zhang Jie Huang IDTransformer: Infrared image denoising method based on convolutional transposed self-attention Alexandria Engineering Journal Image denoising Infrared image Self-attention Feature fusion
title	IDTransformer: Infrared image denoising method based on convolutional transposed self-attention
title_full	IDTransformer: Infrared image denoising method based on convolutional transposed self-attention
title_fullStr	IDTransformer: Infrared image denoising method based on convolutional transposed self-attention
title_full_unstemmed	IDTransformer: Infrared image denoising method based on convolutional transposed self-attention
title_short	IDTransformer: Infrared image denoising method based on convolutional transposed self-attention
title_sort	idtransformer infrared image denoising method based on convolutional transposed self attention
topic	Image denoising Infrared image Self-attention Feature fusion
url	http://www.sciencedirect.com/science/article/pii/S1110016824011256
work_keys_str_mv	AT zhengweishen idtransformerinfraredimagedenoisingmethodbasedonconvolutionaltransposedselfattention AT feiweiqin idtransformerinfraredimagedenoisingmethodbasedonconvolutionaltransposedselfattention AT ruiquange idtransformerinfraredimagedenoisingmethodbasedonconvolutionaltransposedselfattention AT changmiaowang idtransformerinfraredimagedenoisingmethodbasedonconvolutionaltransposedselfattention AT kaizhang idtransformerinfraredimagedenoisingmethodbasedonconvolutionaltransposedselfattention AT jiehuang idtransformerinfraredimagedenoisingmethodbasedonconvolutionaltransposedselfattention

IDTransformer: Infrared image denoising method based on convolutional transposed self-attention

Similar Items