基于条件生成对抗网络数据增强的土壤总氮高光谱反演

夏俊芳; 苑宏文; 张浩; 汪波; 杜俊; 魏薇

doi:10.11975/j.issn.1002-6819.202506058

基于条件生成对抗网络数据增强的土壤总氮高光谱反演

Improved Inversion of Soil Total Nitrogen Content Using Conditional Generative Adversarial Networks for Hyperspectral Data Augmentation

摘要

摘要: 土壤总氮含量是衡量土壤养分信息的重要指标，针对小样本条件下土壤可见-近红外光谱反演总氮含量精度不高的问题，通过生成对抗网络（generative adversarialnetwork，GAN）对土壤光谱数据集进行数据增强，为提高生成样本数据质量引入条件生成对抗网络（conditional generative adversarial network，CGAN），分别建立基于总氮含量作为条件的LCGAN和基于变量重要性投影指标（variable importance in projection，VIP）分数极值法筛选特征波长作为条件的VIP-CGAN。通过采集农田土壤原位光谱数据及总氮含量作为样本数据集，对不同生成对抗网络获取的生成样本质量进行定性和定量评估，结果显示VIP-CGAN(T9)生成样本的MMD（maximum mean discrepancy）和FID（Fréchet Inception Distance）分别为0.003和0.005；在训练集中加入数量为原始训练集300%比例的VIP-CGAN(T9)生成样本时，PLSR、SVR和1D-CNN三种模型均达到最佳预测性能，其决定系数R²分别为0.86、0.84和0.88，RMSE分别为0.028g/kg、0.009g/kg和0.026g/kg。本研究为小样本条件下提高土壤总氮含量高光谱反演精度提供了有效方法。

Abstract: Soil total nitrogen (TN) content is a critical indicator for assessing soil nutrient status. However, under small sample size conditions, the accuracy of inverting soil TN content using visible and near-infrared (Vis-NIR) spectroscopy is often unsatisfactory. To address this challenge, this study proposes a novel data augmentation framework based on generative sdversarial networks (GANs). Specifically, to improve the quality of the generated spectral data, a conditional generative adversarial network (CGAN) architecture is employed, which guides the generation process through relevant auxiliary information. The study evaluates three types of adversarial generative networks: a standard GAN, a label-conditional generative adversarial network (LCGAN) that uses soil TN content values as conditional labels, and a VIP-CGAN that employs feature wavelength sets selected based on the extremum method of variable importance in projection (VIP) scores as conditional vectors. Building on this, the feature wavelength selection method is refined by using the extremum method to identify appropriate extreme points on the VIP score curve and extending outward, thereby selecting feature wavelengths less affected by noise and external environmental interference. It is validated that this approach is more effective than using wavelengths with higher VIP scores alone.The experimental dataset consists of Vis-NIR spectra collected in situ from agricultural soils and corresponding laboratory-measured TN content. Through a comprehensive evaluation method combining qualitative and quantitative assessments, the fidelity of synthetic samples generated by the standard GAN, LCGAN, and various configurations of VIP-CGAN is compared and analyzed. Quantitative evaluation results show that the VIP-CGAN variant constructed based on 9 extended feature wavelength bands (referred to as VIP-CGAN(T9)) performs the best. The generated samples achieve maximum mean discrepancy (MMD) and fréchet inception distance (FID) scores as low as 0.003 and 0.005, respectively. These values indicate a high statistical consistency between the generated data and the original data distribution, confirming the model's ability to fully learn the relationship between constraints and features and generate realistic and reliable synthetic spectra.To evaluate the effect of data augmentation, an enhanced dataset is constructed by combining real samples with synthetic samples generated by VIP-CGAN. The predictive performance of three regression models—partial least squares regression (PLSR), support vector regression (SVR), and a one-dimensional convolutional neural network (1D-CNN)—is systematically tested. When using synthetic samples generated by VIP-CGAN(T9) at a proportion of 300% (three times the size of the original training set), all established models achieve optimal performance. The PLSR model attains a coefficient of determination (R²) of 0.86 with a root mean square error (RMSE) of 0.028 g/kg; the SVR model achieves an R²of 0.84 and an RMSE of 0.009 g/kg; and the 1D-CNN model performs best, with an R² of 0.88 and an RMSE of 0.026 g/kg. The results demonstrate significant improvement over baseline models trained solely on the original limited dataset. By conditioning the generator on VIP-selected wavelengths, the model is guided to focus on spectral regions associated with chemical information related to nitrogen compounds and organic matter, forming a physically meaningful constraint mechanism. The resulting augmented data creates a more robust training environment for regression models, effectively mitigating overfitting and improving generalization .In conclusion, this study establishes an effective framework for enhancing the hyperspectral inversion accuracy of soil TN content under small sample size conditions. The proposed method provides a solution to the challenge of small samples in soil Vis-NIR spectroscopy analysis, with potential future applications in analyzing other soil properties and exploring more advanced generative architectures.

HTML全文

参考文献(38)

施引文献

资源附件(0)