CGDINet: A Deep Learning-Based Salient Object Detection Algorithm

Salient object detection (SOD) is a key preprocessing step in computer vision, widely used in object tracking, action recognition, and image retrieval, among other fields. However, traditional SOD algorithms often face issues such as rough object boundaries, incomplete extraction of global image fea...

Full description

Saved in:

Bibliographic Details
Main Authors:	Chengyu Hu, Jianxin Guo, Hanfei Xie, Qing Zhu, Baoxi Yuan, Yujie Gao, Xiangyang Ma, Jialu Chen, Juan Tian
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Salient object detection deep learning InSPyReNet CAGM DAM
Online Access:	https://ieeexplore.ieee.org/document/10820319/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841542525308895232
author	Chengyu Hu Jianxin Guo Hanfei Xie Qing Zhu Baoxi Yuan Yujie Gao Xiangyang Ma Jialu Chen Juan Tian
author_facet	Chengyu Hu Jianxin Guo Hanfei Xie Qing Zhu Baoxi Yuan Yujie Gao Xiangyang Ma Jialu Chen Juan Tian
author_sort	Chengyu Hu
collection	DOAJ
description	Salient object detection (SOD) is a key preprocessing step in computer vision, widely used in object tracking, action recognition, and image retrieval, among other fields. However, traditional SOD algorithms often face issues such as rough object boundaries, incomplete extraction of global image features, and insufficient attention to key areas. To address these problems, an improved significance object detection network—CGDINet (Coordinate Attention-Group Consensus Aggregation Module-Depth Auxiliary Module-Inverse Saliency Pyramid Reconstruction Network)—is proposed. CGDINet introduces the Group Consensus Aggregation Module (GCAM) embedded with the Coordinate Attention (CA) mechanism, named CAGM (Coordinate Attention-Group Consensus Aggregation Module), to enhance global feature capture capabilities and improve the processing of directional features. Additionally, the Depth Auxiliary Module (DAM) is incorporated to enhance the focus on important regions. Experiments were conducted on five public datasets (DUT-OMRON, ECSSD, PASCAL-S, DUTS-TE, and HKU-IS). The results show that CGDINet outperforms other mainstream significance object detection models in evaluation metrics such as <inline-formula> <tex-math notation="LaTeX">${\mathrm {maxF}}_{\mathrm {\beta }}$ </tex-math></inline-formula>, <inline-formula> <tex-math notation="LaTeX">$\mathrm {S}_{\mathrm {\alpha }}$ </tex-math></inline-formula>, and MAE, with almost no increase in computational cost (FLOPs) and parameters. The experimental results validate that CGDINet can effectively address the issues of incomplete global feature extraction and insufficient attention to key areas, thereby significantly enhancing the performance of significance object detection.
format	Article
id	doaj-art-17c677a409494e6f8d98f69333cb478a
institution	Kabale University
issn	2169-3536
language	English
publishDate	2025-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-17c677a409494e6f8d98f69333cb478a2025-01-14T00:02:28ZengIEEEIEEE Access2169-35362025-01-01134697472310.1109/ACCESS.2024.352530310820319CGDINet: A Deep Learning-Based Salient Object Detection AlgorithmChengyu Hu0https://orcid.org/0009-0004-6838-2428Jianxin Guo1https://orcid.org/0000-0003-0490-7649Hanfei Xie2https://orcid.org/0009-0000-7252-4039Qing Zhu3Baoxi Yuan4https://orcid.org/0000-0002-5220-879XYujie Gao5Xiangyang Ma6https://orcid.org/0009-0006-2767-6424Jialu Chen7Juan Tian8https://orcid.org/0009-0004-4478-1758School of Electronic Information, Xijing University, Xi’an, ChinaSchool of Electronic Information, Xijing University, Xi’an, ChinaSchool of Electronic Information, Xijing University, Xi’an, ChinaBeijing Hengyue Intelligent Information Technology Company Ltd., Xi’an, ChinaSchool of Electronic Information, Xijing University, Xi’an, ChinaSchool of Electronic Information, Xijing University, Xi’an, ChinaSchool of Electronic Information, Xijing University, Xi’an, ChinaSchool of Electronic Information, Xijing University, Xi’an, ChinaSchool of Humanities and Education, Xijing University, Xi’an, ChinaSalient object detection (SOD) is a key preprocessing step in computer vision, widely used in object tracking, action recognition, and image retrieval, among other fields. However, traditional SOD algorithms often face issues such as rough object boundaries, incomplete extraction of global image features, and insufficient attention to key areas. To address these problems, an improved significance object detection network—CGDINet (Coordinate Attention-Group Consensus Aggregation Module-Depth Auxiliary Module-Inverse Saliency Pyramid Reconstruction Network)—is proposed. CGDINet introduces the Group Consensus Aggregation Module (GCAM) embedded with the Coordinate Attention (CA) mechanism, named CAGM (Coordinate Attention-Group Consensus Aggregation Module), to enhance global feature capture capabilities and improve the processing of directional features. Additionally, the Depth Auxiliary Module (DAM) is incorporated to enhance the focus on important regions. Experiments were conducted on five public datasets (DUT-OMRON, ECSSD, PASCAL-S, DUTS-TE, and HKU-IS). The results show that CGDINet outperforms other mainstream significance object detection models in evaluation metrics such as <inline-formula> <tex-math notation="LaTeX">${\mathrm {maxF}}_{\mathrm {\beta }}$ </tex-math></inline-formula>, <inline-formula> <tex-math notation="LaTeX">$\mathrm {S}_{\mathrm {\alpha }}$ </tex-math></inline-formula>, and MAE, with almost no increase in computational cost (FLOPs) and parameters. The experimental results validate that CGDINet can effectively address the issues of incomplete global feature extraction and insufficient attention to key areas, thereby significantly enhancing the performance of significance object detection.https://ieeexplore.ieee.org/document/10820319/Salient object detectiondeep learningInSPyReNetCAGMDAM
spellingShingle	Chengyu Hu Jianxin Guo Hanfei Xie Qing Zhu Baoxi Yuan Yujie Gao Xiangyang Ma Jialu Chen Juan Tian CGDINet: A Deep Learning-Based Salient Object Detection Algorithm IEEE Access Salient object detection deep learning InSPyReNet CAGM DAM
title	CGDINet: A Deep Learning-Based Salient Object Detection Algorithm
title_full	CGDINet: A Deep Learning-Based Salient Object Detection Algorithm
title_fullStr	CGDINet: A Deep Learning-Based Salient Object Detection Algorithm
title_full_unstemmed	CGDINet: A Deep Learning-Based Salient Object Detection Algorithm
title_short	CGDINet: A Deep Learning-Based Salient Object Detection Algorithm
title_sort	cgdinet a deep learning based salient object detection algorithm
topic	Salient object detection deep learning InSPyReNet CAGM DAM
url	https://ieeexplore.ieee.org/document/10820319/
work_keys_str_mv	AT chengyuhu cgdinetadeeplearningbasedsalientobjectdetectionalgorithm AT jianxinguo cgdinetadeeplearningbasedsalientobjectdetectionalgorithm AT hanfeixie cgdinetadeeplearningbasedsalientobjectdetectionalgorithm AT qingzhu cgdinetadeeplearningbasedsalientobjectdetectionalgorithm AT baoxiyuan cgdinetadeeplearningbasedsalientobjectdetectionalgorithm AT yujiegao cgdinetadeeplearningbasedsalientobjectdetectionalgorithm AT xiangyangma cgdinetadeeplearningbasedsalientobjectdetectionalgorithm AT jialuchen cgdinetadeeplearningbasedsalientobjectdetectionalgorithm AT juantian cgdinetadeeplearningbasedsalientobjectdetectionalgorithm

CGDINet: A Deep Learning-Based Salient Object Detection Algorithm

Similar Items