基于树冠精准分割和多源特征融合的无人机单木材积估测

龙开源; 龙江平; 林辉; 孙华; 徐川; 黄子加

doi:10.11975/j.issn.1002-6819.202503210

基于树冠精准分割和多源特征融合的无人机单木材积估测

Single-tree volume estimation using UAV-based crown segmentation and multi-source feature fusion

摘要

摘要: 随着森林资源管理逐步迈向精准化与数字化，无人机技术为智能化与自动化的森林资源样地调查提供了一种解决方案。然而，当前树冠分割边界刻画不够精细、单木材积估测精度较低的问题仍然突出，同时高精度激光雷达数据的获取成本较高，限制了其在实际应用中的广泛推广。为提高单木材积估测的精度与效率，克服现有方法中树冠分割不精细和高精度激光雷达数据成本高的问题，该研究提出了一种基于树冠精准分割和多源特征融合的无人机单木估测方法。在此方法中，基于YOLOv11算法，结合引入 ScaleEdgeExtractor（SEE）、DilatedFusion（DF）、C2BRA 和 GatedFPN 等模块，增强了树冠边界的感知能力和多尺度特征表达能力，并构建了高精度树冠分割网络 CrownSeg。在此基础上，基于树冠形态、光谱及纹理特征的多维特征融合策略，结合递进特征组合方法和加权集成学习模型构建了单木材积估测模型。结果表明，CrownSeg 树冠分割算法提升了树冠边界的刻画精度，交并比（intersection over union, IoU）阈值为0.5时的平均精度（AP50）达到94.9%，较基准模型提升1.5个百分点；IoU阈值从0.5到0.95区间的平均精度（AP50-95）达到66.2%，较基准模型提升3.8个百分点。此外，多源特征融合有效强化了单木材积的预测能力，最终加权集成模型表现优异，其决定系数（R²）达到0.9215，平均绝对误差（MAE）为0.022 8 m³，平均绝对百分比误差（MAPE）为17.00%，均优于单一模型，展现出良好的模型稳定性和泛化能力，可为无人机遥感技术在精准林业中的应用提供技术参考。

Abstract: Forest resource management is increasingly required for high precision and digitization in recent years. Alternatively, the unmanned aerial vehicle (UAV) technology has emerged as a promising potential for the intelligent and automated forest inventory. However, some challenges have hindered its adoption. For instance, there are the imprecise crown segmentation, limited accuracy in the single-tree volume estimation, and the high cost of the high-precision light detection and ranging (LiDAR) point cloud data. In this study, the novel single-tree volume estimation was developed using UAV-derived visible light imagery and low-density point cloud data. The precision of the crown segmentation was improved to integrate the multi-source features during volume estimation. A crown segmentation network (called CrownSeg) was introduced to utilize the UAV visible light imagery. The YOLOv11 framework was established with several specialized modules. Among them, the ScaleEdgeExtractor (SEE) module employed a three-stage mechanism—shallow filtering, edge enhancement, and cross-layer fusion—combining directional Sobel convolution, multi-scale downsampling, and adaptive edge-feature fusion, in order to effectively preserve and enhance crown boundary information. The gated feature pyramid Network (GatedFPN) adopted a bi-directional hierarchical structure with the spatial-channel dual-attention gating. The closed-loop multi-scale optimization and more refined crown segmentation were realized over different canopy densities. The C2BRA module introduced the bi-level routing attention and a channel-spatial dual-attention mechanism, in order to enhance the boundary perception while suppressing background interference from complex forest environments. Meanwhile, the DilatedFusion (DF) module was utilized to integrate the parallel dilated convolutions with the shared kernels, in order to extract the multi-granularity contextual information suitable for the trees with the various shapes and sizes. These modules worked collaboratively to enhance the spatial detail retention and semantic feature extraction, resulting in high-quality segmentation outputs. In volume estimation, the crown morphological, spectral, and textural features were extracted from the UAV imagery with the tree height data from the low-density LiDAR point clouds. A progressive feature combination and a weighted ensemble learning were employed to integrate these multi-source inputs for the robust prediction. The CrownSeg network was achieved in an Average Precision at an Intersection over Union threshold of 0.5 (AP50) of 94.9% and an AP50-95 of 66.2% over the baseline model 1.6 and 3.8 percentage points, respectively, indicating the boundary delineation and multi-scale feature representation. The weighted ensemble model of volume estimation yielded a coefficient of determination (R²) of 0.921 5, a mean absolute error (MAE) of 0.022 8 cubic meters, and a mean absolute percentage error (MAPE) of 17.00%, compared with the standalone models. Comparative analysis showed that the morphological, spectral, and textural features significantly reduced the estimation errors, demonstrating the superior stability and generalization across diverse forest conditions. A series of experiments was carried out to validate the improved model. The experimental data were collected from 749 single trees in a plantation forest. The error metrics were consistently lower than those of individual algorithms, like Random Forest or Neural Networks. Visual inspections confirmed that the CrownSeg shared excellent performance on the complex canopy structures and segmentation in the dense or heterogeneous stands. Ultimately, a high-precision crown segmentation network and an accurate single-tree volume estimation model were established using UAV-based data. A cost-effective alternative approach can be expected to replace the traditional ground-based surveys. The finding can also provide a practical technical framework for the UAV remote sensing applications in precision forestry. Future efforts are suggested to explore the multi-modal data integration. The LiDAR and optical imagery can further refine the segmentation and estimation accuracy in varied forest environments.

HTML全文

参考文献(36)

施引文献

资源附件(0)