基于改进灰狼优化XGBoost的陕北黄土高原植被地上生物量估测

    Estimating above-ground biomass of vegetation in the Loess Plateau of northern Shaanxi using improved gray wolf optimizer of XGBoost

    • 摘要: 黄土高原是中国“三区四带”生态屏障的重要组成部分,精准估测植被地上生物量(aboveground biomass,AGB)对于评估生态建设成效具有重要意义。现有研究缺乏大尺度上高精度、多植被的估测方法,为提高AGB估测精度,该研究以陕北黄土高原为研究区,基于Landsat 8、Sentinel-1遥感影像数据和数字高程数据提取变量因子,利用随机森林算法筛选特征变量,结合实测数据,采用多元线性回归(multiple linear regression,MLR)、反向传播神经网络(back propagation neural network,BPNN)、随机森林(random forest,RF)和极端梯度提升(extreme gradient boosting,XGBoost)4种算法,针对森林和灌草构建了8种AGB反演模型,对比分析不同模型的估测精度。引入并改进灰狼优化(improved grey wolf optimizer,IGWO)算法,寻找“最优”参数组合改进XGBoost模型,选择最优模型对陕北黄土高原AGB进行反演。结果表明:纹理特征在AGB建模中发挥着重要作用,XGBoost模型较传统机器学习算法BPNN和RF表现出更高的估测精度;利用改进灰狼算法优化模型参数可以进一步提高估测精度,森林和灌草AGB最佳模型(IGWO-XGBoost)的决定系数(determination coefficient,R2)分别为0.823和0.735;森林AGB平均值为47.26 t/hm2,主要分布在30~60 t/hm2,灌草AGB平均值为1.96 t/hm2,主要分布在1.00~2.25 t/hm2,AGB表现出较强的空间异质性,总体空间分布呈“南高-中碎-北低”的格局,并随地形(海拔、坡向)变化呈现出不同的分异规律。该研究为生态脆弱区AGB精准估测提供了技术支撑,并为植被恢复措施的精确制定提供了重要参考。

       

      Abstract: Aboveground biomass (AGB) of vegetation is one of the most important indicators to evaluate the ecosystem health and sustainable utilization. Among them, the Loess Plateau in northern Shaanxi is characterized by the multi-dimensional attributes of an agricultural and pastoral intertwined zone, an ecological barrier area, and a carbon sink potential area. It is of great significance to accurately estimate the vegetation AGB for the ecological balance and biodiversity. The regional carbon sink potential can also be quantified to guide sustainable vegetation. Fortunately, the active and passive remote sensing data can be combined with machine learning for the broad application prospects in AGB estimation. This study aims to estimate the above-ground biomass of vegetation in the Loess Plateau of northern Shaanxi using improved gray wolf optimization of XGBoost. The variable parameters were also extracted from the Landsat 8 multispectral remote sensing data, Sentinel-1 radar remote sensing image data, and digital elevation data. The importance of feature variables was calculated using the random forest. The top 15 high-importance feature variables were selected as the modeling factors, because the higher IncNodePurity indicated the stronger variable importance. According to the optimal features and field survey data, four empirical models were established, including the multiple linear regression (MLR), back propagation neural network (BPNN), random forest (RF), and extreme gradient boosting (XGBoost). The AGB values of forest and shrub-grass land were estimated to compare the accuracy of different models. The improved gray wolf optimization (IGWO) was introduced to determine the optimal parameter combinations. The XGBoost model was enhanced for the best performance of the improved model. Finally, the optimal model was obtained to invert the vegetation AGB. The results showed that the texture features shared the outstanding advantages in the AGB modeling. The near infrared band (B5) and cross-polarization (VH) features were of high importance in the forest AGB, while the short-wave infrared bands (B6 and B7) and co-polarization (VV) features were more important in the shrub-grass AGB. The XGBoost model was adjusted for the minimal fitting errors after the residual correction and regularized optimization. Thereby, the higher predictive accuracy was achieved after optimization. The IGWO-XGBoost model that was constructed by the IGWO algorithm performed the best to invert both forest and shrub-grass AGB, where the determination coefficients (R2) were 0.823 and 0.735, respectively, with the increase by 0.070 and 0.053, respectively. And the root mean square error (RMSE) decreased by 12.1% and 9.9%, respectively. The average forest AGB was 47.26 t/hm2, which was distributed mainly between 30-60 t/hm2. While the average shrub-grass AGB was 1.96 t/hm2, which was distributed mainly between 1.00-2.25 t/hm2. The overall spatial distribution of AGB shared a pattern of "high in the south - broken in the middle - low in the north". There were some patterns of differentiation with the variation in the topography (elevation and altitude). The average AGB of forests showed a gradual increase in the distribution with elevation, which was consistent with the anthropogenic disturbances. The average AGB of shrubs and grasses showed the opposite pattern, due mainly to the decrease of the soil nutrients at high altitude and the air temperature, as well as the harsh conditions for the vegetation growth. There were some differences in the biomass on the different slopes, with the highest distribution of AGB in the forest on half-shady slopes and the highest distribution of AGB in the shrub-grass on the sunny slopes. There were the highest average AGB values of the forest and shrub-grass on the shady slopes, with the better standing conditions. This finding can provide the technical support for the accurate estimation of the AGB on the Loess Plateau in northern Shaanxi. A scientific basis can also offer for subsequent studies, such as ecosystem function evaluation.

       

    /

    返回文章
    返回