Scene categorization by Hessian-regularized active perceptual feature selection

Abstract Decoding the semantic categories of complex sceneries is fundamental to numerous artificial intelligence (AI) infrastructures. This work presents an advanced selection of multi-channel perceptual visual features for recognizing scenic images with elaborate spatial structures, focusing on de...

Full description

Saved in:
Bibliographic Details
Main Authors: Junwu Zhou, Fuji Ren
Format: Article
Language:English
Published: Nature Portfolio 2025-01-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-024-84181-x
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841559716295081984
author Junwu Zhou
Fuji Ren
author_facet Junwu Zhou
Fuji Ren
author_sort Junwu Zhou
collection DOAJ
description Abstract Decoding the semantic categories of complex sceneries is fundamental to numerous artificial intelligence (AI) infrastructures. This work presents an advanced selection of multi-channel perceptual visual features for recognizing scenic images with elaborate spatial structures, focusing on developing a deep hierarchical model dedicated to learning human gaze behavior. Utilizing the BING objectness measure, we efficiently localize objects or their details across varying scales within scenes. To emulate humans observing semantically or visually significant areas within scenes, we propose a robust deep active learning (RDAL) strategy. This strategy progressively generates gaze shifting paths (GSP) and calculates deep GSP representations within a unified architecture. A notable advantage of RDAL is the robustness to label noise, which is implemented by a carefully-designed sparse penalty term. This mechanism ensures that irrelevant or misleading deep GSP features are intelligently discarded. Afterward, a novel Hessian-regularized Feature Selector (HFS) is proposed to select high-quality features from the deep GSP features, wherein (i) the spatial composition of scenic patches can be optimally maintained, and (ii) a linear SVM is learned simultaneously. Empirical evaluations across six standard scenic datasets demonstrated our method’s superior performance, highlighting its exceptional ability to differentiate various sophisticated scenery categories.
format Article
id doaj-art-1d9b9adcccc74b088623da4c97592036
institution Kabale University
issn 2045-2322
language English
publishDate 2025-01-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-1d9b9adcccc74b088623da4c975920362025-01-05T12:15:06ZengNature PortfolioScientific Reports2045-23222025-01-0115111410.1038/s41598-024-84181-xScene categorization by Hessian-regularized active perceptual feature selectionJunwu Zhou0Fuji Ren1School of Higher Vocational and Technical College, Shanghai Dianji UniversityCollege of Computer Sciences, Anhui UniversityAbstract Decoding the semantic categories of complex sceneries is fundamental to numerous artificial intelligence (AI) infrastructures. This work presents an advanced selection of multi-channel perceptual visual features for recognizing scenic images with elaborate spatial structures, focusing on developing a deep hierarchical model dedicated to learning human gaze behavior. Utilizing the BING objectness measure, we efficiently localize objects or their details across varying scales within scenes. To emulate humans observing semantically or visually significant areas within scenes, we propose a robust deep active learning (RDAL) strategy. This strategy progressively generates gaze shifting paths (GSP) and calculates deep GSP representations within a unified architecture. A notable advantage of RDAL is the robustness to label noise, which is implemented by a carefully-designed sparse penalty term. This mechanism ensures that irrelevant or misleading deep GSP features are intelligently discarded. Afterward, a novel Hessian-regularized Feature Selector (HFS) is proposed to select high-quality features from the deep GSP features, wherein (i) the spatial composition of scenic patches can be optimally maintained, and (ii) a linear SVM is learned simultaneously. Empirical evaluations across six standard scenic datasets demonstrated our method’s superior performance, highlighting its exceptional ability to differentiate various sophisticated scenery categories.https://doi.org/10.1038/s41598-024-84181-x
spellingShingle Junwu Zhou
Fuji Ren
Scene categorization by Hessian-regularized active perceptual feature selection
Scientific Reports
title Scene categorization by Hessian-regularized active perceptual feature selection
title_full Scene categorization by Hessian-regularized active perceptual feature selection
title_fullStr Scene categorization by Hessian-regularized active perceptual feature selection
title_full_unstemmed Scene categorization by Hessian-regularized active perceptual feature selection
title_short Scene categorization by Hessian-regularized active perceptual feature selection
title_sort scene categorization by hessian regularized active perceptual feature selection
url https://doi.org/10.1038/s41598-024-84181-x
work_keys_str_mv AT junwuzhou scenecategorizationbyhessianregularizedactiveperceptualfeatureselection
AT fujiren scenecategorizationbyhessianregularizedactiveperceptualfeatureselection