SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image Classification

Hyperspectral image (HSI) classification is a crucial technique that assigns each pixel in an image to a specific land cover category by leveraging both spectral and spatial information. In recent years, HSI classification methods based on convolutional neural networks (CNNs) and Transformers have s...

Full description

Saved in:

Bibliographic Details
Main Authors:	Chunyang Wang, Chao Zhan, Bibo Lu, Wei Yang, Yingjie Zhang, Gaige Wang, Zongze Zhao
Format:	Article
Language:	English
Published:	MDPI AG 2024-11-01
Series:	Remote Sensing
Subjects:	deep learning hyperspectral image classification attention mechanisms convolutional neural networks spectral-spatial learning loss function
Online Access:	https://www.mdpi.com/2072-4292/16/22/4202
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1846152557499514880
author	Chunyang Wang Chao Zhan Bibo Lu Wei Yang Yingjie Zhang Gaige Wang Zongze Zhao
author_facet	Chunyang Wang Chao Zhan Bibo Lu Wei Yang Yingjie Zhang Gaige Wang Zongze Zhao
author_sort	Chunyang Wang
collection	DOAJ
description	Hyperspectral image (HSI) classification is a crucial technique that assigns each pixel in an image to a specific land cover category by leveraging both spectral and spatial information. In recent years, HSI classification methods based on convolutional neural networks (CNNs) and Transformers have significantly improved performance due to their strong feature extraction capabilities. However, these improvements often come with increased model complexity, leading to higher computational costs. To address this, we propose a compact and efficient spectral-spatial feature extraction and attention-based neural network (SSFAN) for HSI classification. The SSFAN model consists of three core modules: the Parallel Spectral-Spatial Feature Extraction Block (PSSB), the Scan Block, and the Squeeze-and-Excitation MLP Block (SEMB). After preprocessing the HSI data, it is fed into the PSSB module, which contains two parallel streams, each comprising a 3D convolutional layer and a 2D convolutional layer. The 3D convolutional layer extracts spectral and spatial features from the input hyperspectral data, while the 2D convolutional layer further enhances the spatial feature representation. Next, the Scan Block module employs a layered scanning strategy to extract spatial information at different scales from the central pixel outward, enabling the model to capture both local and global spatial relationships. The SEMB module combines the Spectral-Spatial Recurrent Block (SSRB) and the MLP Block. The SSRB, with its adaptive weight assignment mechanism in the SToken Module, flexibly handles time steps and feature dimensions, performing deep spectral and spatial feature extraction through multiple state updates. Finally, the MLP Block processes the input features through a series of linear transformations, GELU activation functions, and Dropout layers, capturing complex patterns and relationships within the data, and concludes with an argmax layer for classification. Experimental results show that the proposed SSFAN model delivers superior classification performance, outperforming the second-best method by <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1.72</mn><mo>%</mo></mrow></semantics></math></inline-formula>, <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>5.19</mn><mo>%</mo></mrow></semantics></math></inline-formula>, and <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1.94</mn><mo>%</mo></mrow></semantics></math></inline-formula> in OA, AA, and Kappa coefficient, respectively, on the Indian Pines dataset. Additionally, it requires less training and testing time compared to other state-of-the-art deep learning methods.
format	Article
id	doaj-art-3ece5c8d861f4a2c93a461c738d16cc2
institution	Kabale University
issn	2072-4292
language	English
publishDate	2024-11-01
publisher	MDPI AG
record_format	Article
series	Remote Sensing
spelling	doaj-art-3ece5c8d861f4a2c93a461c738d16cc22024-11-26T18:19:58ZengMDPI AGRemote Sensing2072-42922024-11-011622420210.3390/rs16224202SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image ClassificationChunyang Wang0Chao Zhan1Bibo Lu2Wei Yang3Yingjie Zhang4Gaige Wang5Zongze Zhao6School of Computer Science and Technology, Henan Polytechnic University, Jiaozuo 454000, ChinaSchool of Computer Science and Technology, Henan Polytechnic University, Jiaozuo 454000, ChinaSchool of Computer Science and Technology, Henan Polytechnic University, Jiaozuo 454000, ChinaCenter for Environmental Remote Sensing, Chiba University, Chiba 2638522, JapanState Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Luoyu Road No.129, Wuhan 430079, ChinaSchool of Computer Science and Technology, Ocean University of China, Qingdao 266100, ChinaSchool of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454000, ChinaHyperspectral image (HSI) classification is a crucial technique that assigns each pixel in an image to a specific land cover category by leveraging both spectral and spatial information. In recent years, HSI classification methods based on convolutional neural networks (CNNs) and Transformers have significantly improved performance due to their strong feature extraction capabilities. However, these improvements often come with increased model complexity, leading to higher computational costs. To address this, we propose a compact and efficient spectral-spatial feature extraction and attention-based neural network (SSFAN) for HSI classification. The SSFAN model consists of three core modules: the Parallel Spectral-Spatial Feature Extraction Block (PSSB), the Scan Block, and the Squeeze-and-Excitation MLP Block (SEMB). After preprocessing the HSI data, it is fed into the PSSB module, which contains two parallel streams, each comprising a 3D convolutional layer and a 2D convolutional layer. The 3D convolutional layer extracts spectral and spatial features from the input hyperspectral data, while the 2D convolutional layer further enhances the spatial feature representation. Next, the Scan Block module employs a layered scanning strategy to extract spatial information at different scales from the central pixel outward, enabling the model to capture both local and global spatial relationships. The SEMB module combines the Spectral-Spatial Recurrent Block (SSRB) and the MLP Block. The SSRB, with its adaptive weight assignment mechanism in the SToken Module, flexibly handles time steps and feature dimensions, performing deep spectral and spatial feature extraction through multiple state updates. Finally, the MLP Block processes the input features through a series of linear transformations, GELU activation functions, and Dropout layers, capturing complex patterns and relationships within the data, and concludes with an argmax layer for classification. Experimental results show that the proposed SSFAN model delivers superior classification performance, outperforming the second-best method by <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1.72</mn><mo>%</mo></mrow></semantics></math></inline-formula>, <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>5.19</mn><mo>%</mo></mrow></semantics></math></inline-formula>, and <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1.94</mn><mo>%</mo></mrow></semantics></math></inline-formula> in OA, AA, and Kappa coefficient, respectively, on the Indian Pines dataset. Additionally, it requires less training and testing time compared to other state-of-the-art deep learning methods.https://www.mdpi.com/2072-4292/16/22/4202deep learninghyperspectral image classificationattention mechanismsconvolutional neural networksspectral-spatial learningloss function
spellingShingle	Chunyang Wang Chao Zhan Bibo Lu Wei Yang Yingjie Zhang Gaige Wang Zongze Zhao SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image Classification Remote Sensing deep learning hyperspectral image classification attention mechanisms convolutional neural networks spectral-spatial learning loss function
title	SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image Classification
title_full	SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image Classification
title_fullStr	SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image Classification
title_full_unstemmed	SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image Classification
title_short	SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image Classification
title_sort	ssfan a compact and efficient spectral spatial feature extraction and attention based neural network for hyperspectral image classification
topic	deep learning hyperspectral image classification attention mechanisms convolutional neural networks spectral-spatial learning loss function
url	https://www.mdpi.com/2072-4292/16/22/4202
work_keys_str_mv	AT chunyangwang ssfanacompactandefficientspectralspatialfeatureextractionandattentionbasedneuralnetworkforhyperspectralimageclassification AT chaozhan ssfanacompactandefficientspectralspatialfeatureextractionandattentionbasedneuralnetworkforhyperspectralimageclassification AT bibolu ssfanacompactandefficientspectralspatialfeatureextractionandattentionbasedneuralnetworkforhyperspectralimageclassification AT weiyang ssfanacompactandefficientspectralspatialfeatureextractionandattentionbasedneuralnetworkforhyperspectralimageclassification AT yingjiezhang ssfanacompactandefficientspectralspatialfeatureextractionandattentionbasedneuralnetworkforhyperspectralimageclassification AT gaigewang ssfanacompactandefficientspectralspatialfeatureextractionandattentionbasedneuralnetworkforhyperspectralimageclassification AT zongzezhao ssfanacompactandefficientspectralspatialfeatureextractionandattentionbasedneuralnetworkforhyperspectralimageclassification

SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image Classification

Similar Items