Scene categorization by Hessian-regularized active perceptual feature selection

Abstract Decoding the semantic categories of complex sceneries is fundamental to numerous artificial intelligence (AI) infrastructures. This work presents an advanced selection of multi-channel perceptual visual features for recognizing scenic images with elaborate spatial structures, focusing on de...

Full description

Saved in:

Bibliographic Details
Main Authors:	Junwu Zhou, Fuji Ren
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-01-01
Series:	Scientific Reports
Online Access:	https://doi.org/10.1038/s41598-024-84181-x
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1841559716295081984
author	Junwu Zhou Fuji Ren
author_facet	Junwu Zhou Fuji Ren
author_sort	Junwu Zhou
collection	DOAJ
description	Abstract Decoding the semantic categories of complex sceneries is fundamental to numerous artificial intelligence (AI) infrastructures. This work presents an advanced selection of multi-channel perceptual visual features for recognizing scenic images with elaborate spatial structures, focusing on developing a deep hierarchical model dedicated to learning human gaze behavior. Utilizing the BING objectness measure, we efficiently localize objects or their details across varying scales within scenes. To emulate humans observing semantically or visually significant areas within scenes, we propose a robust deep active learning (RDAL) strategy. This strategy progressively generates gaze shifting paths (GSP) and calculates deep GSP representations within a unified architecture. A notable advantage of RDAL is the robustness to label noise, which is implemented by a carefully-designed sparse penalty term. This mechanism ensures that irrelevant or misleading deep GSP features are intelligently discarded. Afterward, a novel Hessian-regularized Feature Selector (HFS) is proposed to select high-quality features from the deep GSP features, wherein (i) the spatial composition of scenic patches can be optimally maintained, and (ii) a linear SVM is learned simultaneously. Empirical evaluations across six standard scenic datasets demonstrated our method’s superior performance, highlighting its exceptional ability to differentiate various sophisticated scenery categories.
format	Article
id	doaj-art-1d9b9adcccc74b088623da4c97592036
institution	Kabale University
issn	2045-2322
language	English
publishDate	2025-01-01
publisher	Nature Portfolio
record_format	Article
series	Scientific Reports
spelling	doaj-art-1d9b9adcccc74b088623da4c975920362025-01-05T12:15:06ZengNature PortfolioScientific Reports2045-23222025-01-0115111410.1038/s41598-024-84181-xScene categorization by Hessian-regularized active perceptual feature selectionJunwu Zhou0Fuji Ren1School of Higher Vocational and Technical College, Shanghai Dianji UniversityCollege of Computer Sciences, Anhui UniversityAbstract Decoding the semantic categories of complex sceneries is fundamental to numerous artificial intelligence (AI) infrastructures. This work presents an advanced selection of multi-channel perceptual visual features for recognizing scenic images with elaborate spatial structures, focusing on developing a deep hierarchical model dedicated to learning human gaze behavior. Utilizing the BING objectness measure, we efficiently localize objects or their details across varying scales within scenes. To emulate humans observing semantically or visually significant areas within scenes, we propose a robust deep active learning (RDAL) strategy. This strategy progressively generates gaze shifting paths (GSP) and calculates deep GSP representations within a unified architecture. A notable advantage of RDAL is the robustness to label noise, which is implemented by a carefully-designed sparse penalty term. This mechanism ensures that irrelevant or misleading deep GSP features are intelligently discarded. Afterward, a novel Hessian-regularized Feature Selector (HFS) is proposed to select high-quality features from the deep GSP features, wherein (i) the spatial composition of scenic patches can be optimally maintained, and (ii) a linear SVM is learned simultaneously. Empirical evaluations across six standard scenic datasets demonstrated our method’s superior performance, highlighting its exceptional ability to differentiate various sophisticated scenery categories.https://doi.org/10.1038/s41598-024-84181-x
spellingShingle	Junwu Zhou Fuji Ren Scene categorization by Hessian-regularized active perceptual feature selection Scientific Reports
title	Scene categorization by Hessian-regularized active perceptual feature selection
title_full	Scene categorization by Hessian-regularized active perceptual feature selection
title_fullStr	Scene categorization by Hessian-regularized active perceptual feature selection
title_full_unstemmed	Scene categorization by Hessian-regularized active perceptual feature selection
title_short	Scene categorization by Hessian-regularized active perceptual feature selection
title_sort	scene categorization by hessian regularized active perceptual feature selection
url	https://doi.org/10.1038/s41598-024-84181-x
work_keys_str_mv	AT junwuzhou scenecategorizationbyhessianregularizedactiveperceptualfeatureselection AT fujiren scenecategorizationbyhessianregularizedactiveperceptualfeatureselection

Scene categorization by Hessian-regularized active perceptual feature selection

Similar Items