SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image Classification

Hyperspectral image (HSI) classification is a crucial technique that assigns each pixel in an image to a specific land cover category by leveraging both spectral and spatial information. In recent years, HSI classification methods based on convolutional neural networks (CNNs) and Transformers have s...

Full description

Saved in:
Bibliographic Details
Main Authors: Chunyang Wang, Chao Zhan, Bibo Lu, Wei Yang, Yingjie Zhang, Gaige Wang, Zongze Zhao
Format: Article
Language:English
Published: MDPI AG 2024-11-01
Series:Remote Sensing
Subjects:
Online Access:https://www.mdpi.com/2072-4292/16/22/4202
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846152557499514880
author Chunyang Wang
Chao Zhan
Bibo Lu
Wei Yang
Yingjie Zhang
Gaige Wang
Zongze Zhao
author_facet Chunyang Wang
Chao Zhan
Bibo Lu
Wei Yang
Yingjie Zhang
Gaige Wang
Zongze Zhao
author_sort Chunyang Wang
collection DOAJ
description Hyperspectral image (HSI) classification is a crucial technique that assigns each pixel in an image to a specific land cover category by leveraging both spectral and spatial information. In recent years, HSI classification methods based on convolutional neural networks (CNNs) and Transformers have significantly improved performance due to their strong feature extraction capabilities. However, these improvements often come with increased model complexity, leading to higher computational costs. To address this, we propose a compact and efficient spectral-spatial feature extraction and attention-based neural network (SSFAN) for HSI classification. The SSFAN model consists of three core modules: the Parallel Spectral-Spatial Feature Extraction Block (PSSB), the Scan Block, and the Squeeze-and-Excitation MLP Block (SEMB). After preprocessing the HSI data, it is fed into the PSSB module, which contains two parallel streams, each comprising a 3D convolutional layer and a 2D convolutional layer. The 3D convolutional layer extracts spectral and spatial features from the input hyperspectral data, while the 2D convolutional layer further enhances the spatial feature representation. Next, the Scan Block module employs a layered scanning strategy to extract spatial information at different scales from the central pixel outward, enabling the model to capture both local and global spatial relationships. The SEMB module combines the Spectral-Spatial Recurrent Block (SSRB) and the MLP Block. The SSRB, with its adaptive weight assignment mechanism in the SToken Module, flexibly handles time steps and feature dimensions, performing deep spectral and spatial feature extraction through multiple state updates. Finally, the MLP Block processes the input features through a series of linear transformations, GELU activation functions, and Dropout layers, capturing complex patterns and relationships within the data, and concludes with an argmax layer for classification. Experimental results show that the proposed SSFAN model delivers superior classification performance, outperforming the second-best method by <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1.72</mn><mo>%</mo></mrow></semantics></math></inline-formula>, <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>5.19</mn><mo>%</mo></mrow></semantics></math></inline-formula>, and <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1.94</mn><mo>%</mo></mrow></semantics></math></inline-formula> in OA, AA, and Kappa coefficient, respectively, on the Indian Pines dataset. Additionally, it requires less training and testing time compared to other state-of-the-art deep learning methods.
format Article
id doaj-art-3ece5c8d861f4a2c93a461c738d16cc2
institution Kabale University
issn 2072-4292
language English
publishDate 2024-11-01
publisher MDPI AG
record_format Article
series Remote Sensing
spelling doaj-art-3ece5c8d861f4a2c93a461c738d16cc22024-11-26T18:19:58ZengMDPI AGRemote Sensing2072-42922024-11-011622420210.3390/rs16224202SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image ClassificationChunyang Wang0Chao Zhan1Bibo Lu2Wei Yang3Yingjie Zhang4Gaige Wang5Zongze Zhao6School of Computer Science and Technology, Henan Polytechnic University, Jiaozuo 454000, ChinaSchool of Computer Science and Technology, Henan Polytechnic University, Jiaozuo 454000, ChinaSchool of Computer Science and Technology, Henan Polytechnic University, Jiaozuo 454000, ChinaCenter for Environmental Remote Sensing, Chiba University, Chiba 2638522, JapanState Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Luoyu Road No.129, Wuhan 430079, ChinaSchool of Computer Science and Technology, Ocean University of China, Qingdao 266100, ChinaSchool of Surveying and Land Information Engineering, Henan Polytechnic University, Jiaozuo 454000, ChinaHyperspectral image (HSI) classification is a crucial technique that assigns each pixel in an image to a specific land cover category by leveraging both spectral and spatial information. In recent years, HSI classification methods based on convolutional neural networks (CNNs) and Transformers have significantly improved performance due to their strong feature extraction capabilities. However, these improvements often come with increased model complexity, leading to higher computational costs. To address this, we propose a compact and efficient spectral-spatial feature extraction and attention-based neural network (SSFAN) for HSI classification. The SSFAN model consists of three core modules: the Parallel Spectral-Spatial Feature Extraction Block (PSSB), the Scan Block, and the Squeeze-and-Excitation MLP Block (SEMB). After preprocessing the HSI data, it is fed into the PSSB module, which contains two parallel streams, each comprising a 3D convolutional layer and a 2D convolutional layer. The 3D convolutional layer extracts spectral and spatial features from the input hyperspectral data, while the 2D convolutional layer further enhances the spatial feature representation. Next, the Scan Block module employs a layered scanning strategy to extract spatial information at different scales from the central pixel outward, enabling the model to capture both local and global spatial relationships. The SEMB module combines the Spectral-Spatial Recurrent Block (SSRB) and the MLP Block. The SSRB, with its adaptive weight assignment mechanism in the SToken Module, flexibly handles time steps and feature dimensions, performing deep spectral and spatial feature extraction through multiple state updates. Finally, the MLP Block processes the input features through a series of linear transformations, GELU activation functions, and Dropout layers, capturing complex patterns and relationships within the data, and concludes with an argmax layer for classification. Experimental results show that the proposed SSFAN model delivers superior classification performance, outperforming the second-best method by <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1.72</mn><mo>%</mo></mrow></semantics></math></inline-formula>, <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>5.19</mn><mo>%</mo></mrow></semantics></math></inline-formula>, and <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><mrow><mn>1.94</mn><mo>%</mo></mrow></semantics></math></inline-formula> in OA, AA, and Kappa coefficient, respectively, on the Indian Pines dataset. Additionally, it requires less training and testing time compared to other state-of-the-art deep learning methods.https://www.mdpi.com/2072-4292/16/22/4202deep learninghyperspectral image classificationattention mechanismsconvolutional neural networksspectral-spatial learningloss function
spellingShingle Chunyang Wang
Chao Zhan
Bibo Lu
Wei Yang
Yingjie Zhang
Gaige Wang
Zongze Zhao
SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image Classification
Remote Sensing
deep learning
hyperspectral image classification
attention mechanisms
convolutional neural networks
spectral-spatial learning
loss function
title SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image Classification
title_full SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image Classification
title_fullStr SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image Classification
title_full_unstemmed SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image Classification
title_short SSFAN: A Compact and Efficient Spectral-Spatial Feature Extraction and Attention-Based Neural Network for Hyperspectral Image Classification
title_sort ssfan a compact and efficient spectral spatial feature extraction and attention based neural network for hyperspectral image classification
topic deep learning
hyperspectral image classification
attention mechanisms
convolutional neural networks
spectral-spatial learning
loss function
url https://www.mdpi.com/2072-4292/16/22/4202
work_keys_str_mv AT chunyangwang ssfanacompactandefficientspectralspatialfeatureextractionandattentionbasedneuralnetworkforhyperspectralimageclassification
AT chaozhan ssfanacompactandefficientspectralspatialfeatureextractionandattentionbasedneuralnetworkforhyperspectralimageclassification
AT bibolu ssfanacompactandefficientspectralspatialfeatureextractionandattentionbasedneuralnetworkforhyperspectralimageclassification
AT weiyang ssfanacompactandefficientspectralspatialfeatureextractionandattentionbasedneuralnetworkforhyperspectralimageclassification
AT yingjiezhang ssfanacompactandefficientspectralspatialfeatureextractionandattentionbasedneuralnetworkforhyperspectralimageclassification
AT gaigewang ssfanacompactandefficientspectralspatialfeatureextractionandattentionbasedneuralnetworkforhyperspectralimageclassification
AT zongzezhao ssfanacompactandefficientspectralspatialfeatureextractionandattentionbasedneuralnetworkforhyperspectralimageclassification