MDIGCNet: Multidirectional Information-Guided Contextual Network for Infrared Small Target Detection

Infrared small target detection (ISTD) technology has extensive applications in the military field. Due to the quality of imaging equipment and environmental interference, infrared small target images lack texture and structural information. Deep learning-based algorithms have achieved superior accu...

Full description

Saved in:
Bibliographic Details
Main Authors: Luping Zhang, Junhai Luo, Yian Huang, Fengyi Wu, Xingye Cui, Zhenming Peng
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10770559/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Infrared small target detection (ISTD) technology has extensive applications in the military field. Due to the quality of imaging equipment and environmental interference, infrared small target images lack texture and structural information. Deep learning-based algorithms have achieved superior accuracy in this field compared to traditional algorithms; however, these methods are often not designed with domain knowledge integration. In this article, we propose a multidirectional information-guided contextual network (MDIGCNet) for ISTD. The primary structure of this network adopts the U-Net architecture. To address the issue of lacking texture and structural information in the target images, we employ an integrated differential convolution (IDConv) module to extract richer image features during both the encoding and decoding stages. Skip connections in the network utilize a multidirectional gradient information extraction block (MGIEB) to obtain gradient features of infrared small targets. Our domain-inspired multidirectional Gaussian differential convolution (MGDC) module is employed to extract features of Gaussian-distributed small targets, enhancing the distinction between targets and backgrounds. Additionally, we designed a local-global feature fusion (LGFF) module incorporating an attention mechanism to merge shallow and deep features, thereby improving the efficiency of feature utilization within the model. Furthermore, since both IDConv and MGDC are parallel multiconvolutional kernel structures, reparameterization techniques are used to avoid excessive parameters and computational load. Experimental results on public datasets NUDT-SIRST, IRSTD-1k, and SIRST-Aug demonstrate that our algorithm outperforms other state-of-the-art methods in detection performance.
ISSN:1939-1404
2151-1535