基于流形光谱降维和深度学习的高光谱影像分类

师芸; 马东晖; 吕杰; 李杰; 史经俭

doi:10.11975/j.issn.1002-6819.2020.06.018

摘要: 高光谱影像存在的“休斯（Hughes）现象”是制约高光谱影像分类精度的一个重要因素。为了提高高光谱影像分类精度，提出一种基于流形光谱特征的高光谱影像分类算法。首先使用t分布随机邻域嵌入算法对高光谱影像进行降维；其次将降维后的高光谱数据作为输入层，使用卷积神经网络提取空间深层特征；最后，将提取到的深层空间－光谱特征从隐层特征空间映射到样本标记空间并进行分类。结果表明，与其他算法相比，该研究究算法的总体精度和Kappa系数最高，3个数据集总体精度分别为99.05%、99.43%和98.90%，Kappa系数分别为98.78%、98.97%和98.34%，显著提高了高光谱影像的分类精度，减少了分类用时，有效解决了传统降维方法容易忽视局部特征的缺点。将流形学习降维和深度学习分类相结合为高光谱遥感影像分类和土地利用研究研究提供了一种思路。

Abstract: Abstract：Hyperspectral remote sensing image is rich in spectral information which has great application potential in forestry,agriculture, geosciences, and other fields. In order to solve the problem of the small sample, multi-dimension, correlation andnonlinearity, and to improve the accuracy of hyperspectral remote sensing image classification, this study proposed a methodof hyperspectral image dimension reduction based on t-distribution stochastic neighbor embedding (t-SNE). Convolutionalneural network (CNN) was used to extract features and to classify hyperspectral remote sensing images. The t-SNE used the tdistribution instead of the Gaussian distribution and defined a symmetric joint probability distribution based on the originalSNE, thus it could simplify the gradient calculation. T-distribution was more sensitive to local features because of its long tailcharacter. Using t-distribution instead of Gaussian distribution ensured that the points mapped from high-dimensional space tolow-dimensional space were almost unaffected by spatial changes. It was feasible to make intra-class points aggregatedclosely, and inter-class points dispersed. Meanwhile, it could use the local features of high-dimensional data and maintain thenon-linear features of the original data set. To improve the accuracy of hyperspectral remote sensing classification, a novelmethod based on manifold learning and CNN was proposed. First, the data points in the original high-dimensional space weremapped into the low-dimensional space. The dimensional reduction scale was important for classification results. In order tofind the best dimensional reduction scale, an experiment with dimensions ranging from 5 to 30 was conducted. The scale ofthe Indian Pines dataset was set at 20, the Pavia Center dataset was set at 16 and the Pavia University dataset was set at 18.Perplexity was another important parameter and it had been set at 30 according to the test. Their topological relations werepreserved after dimensional reduction. Second, a CNN with a seven layers network structure was designed. It consisted of twoconvolution layers, two pooling layers, two full connection layers, and one full connection layer. Two convolution layers andtwo pooling layers existed alternately, and the end of the network related to a full connection layer. A Softmax function wasused as a classifier and the AdaGrad algorithm was used for network optimization. With the progress of the optimizationprocess, the learning rate would be reduced for the variables that had decreased a lot. Rectified linear unit (ReLU) has beenused as an activation function. The advantages of the ReLU function are more efficient in gradient descent andbackpropagation because it avoids the problem of gradient explosion and gradient disappearance and it simplifies thecalculation process and reduces the overall calculation cost of CNN. The hyperspectral remote sensing data after dimensionreduction was used as the input layer to extract the deep features on CNN. Finally, the spatial-spectral features ofhyperspectral images were classified. The robustness of the proposed algorithm was verified in three open datasets; (i) IndianPines, (ii) Pavia Center and (iii) Pavia University. The overall accuracy of classification in three data sets had reached 99.05%,99.43%, and 98.90%. The proposed algorithm showed a better effect on dimension reduction compared with the originalCNN. Since t-SNE was more sensitive to local features and considered inter-class differences, remarkable results had beenachieved for small ground object samples. Compared with the original CNN, the problem of "salt and pepper noise" in thehyperspectral image was solved effectively and the overall classification accuracy was significantly improved. The method ofmanifold learning and convolutional neural networks could also provide a new approach for the hyperspectral remote sensingimage classification. It was usually difficult to obtain the labeled sample data of the hyperspectral image, while theperformance of the deep learning model depended on many mark samples. In future work, we would consider how toconstruct the classification model under the condition of limited labeled samples to obtain better classification results.

基于流形光谱降维和深度学习的高光谱影像分类

Hyperspectral image classification based on manifold spectral dimensionality reduction and deep learning method