基于单目深度估计的冬小麦株高提取方法

    Monocular Depth Estimation-Based Method for Winter Wheat Plant Height Extraction

    • 摘要: 为了满足利用图像技术测量冬小麦株高的需求,该研究提出了一种基于单目深度估计的冬小麦株高提取方法(monocular height regression method,MHRM),MHRM以相机采集的冬小麦图像作为输入,通过目标区域定位获取有效作物信息,生成像素级深度结果,并将深度信息映射为作物真实株高;在训练过程中,结合像素级约束与尺度一致性约束进行联合监督,提高深度估计精度与株高提取的可靠性。在山东泰安农业气象试验站采集冬小麦图像数据并开展试验,选取BTS、FCRN、DORN和DPT作为对比模型。实验结果表明,深度估计网络在均方根误差(2.759)、对数均方根误差(0.157)、相对误差(0.152)和平方相对误差(0.907)等指标上均优于对比模型。进一步将深度估计结果转换为株高,并与实测值进行对比分析,MHRM方法准确率达到97.97%,优于BTS(86.46%)、FCRN(92.40%)、DORN(94.35%)和DPT(96.52%),证明了该方法在冬小麦长势监测中的有效性和可靠性,能够用于科研和生产实践。

       

      Abstract: To meet the demand for accurate and non-destructive measurement of winter wheat plant height using imaging technology, this paper proposes a monocular height regression method (MHRM) for winter wheat plant height extraction. Plant height is an important agronomic parameter for assessing crop growth status, biomass accumulation, lodging risk, and yield potential, and it plays a key role in crop management and yield estimation. Conventional plant height measurement methods are typically labor-intensive, time-consuming, and difficult to apply efficiently in large-scale field environments. In contrast, monocular image-based approaches provide a low-cost and flexible alternative, but they still face challenges such as scale ambiguity, background interference, and complex illumination conditions in natural agricultural scenes.The proposed MHRM framework takes RGB images of winter wheat acquired by a conventional camera as input and integrates target region localization, monocular depth estimation, and depth-to-height mapping to achieve accurate plant height extraction. First, effective crop regions are identified through target region localization to suppress background noise and reduce the influence of soil, shadows, and non-crop objects. This preprocessing step ensures that subsequent depth estimation focuses on relevant crop structures and improves the robustness and stability of the overall method.Following crop region localization, a monocular depth estimation network is employed to generate dense, pixel-level depth maps from single-view images. Unlike geometry-based methods that rely on stereo or multi-view inputs, the proposed approach learns depth-related visual cues directly from monocular images by exploiting semantic and structural information. During network training, pixel-level supervision and scale consistency constraints are jointly introduced to enhance depth estimation accuracy and alleviate the scale ambiguity inherent in monocular depth prediction. These constraints promote structural coherence in the predicted depth maps and improve the reliability of depth-based plant height estimation under varying field conditions.The estimated depth information is subsequently transformed into actual winter wheat plant height using a depth-to-height mapping strategy. This process enables quantitative plant height estimation at the pixel level and provides detailed spatial information on height distribution within crop canopies, which is beneficial for fine-grained crop growth analysis and phenotypic assessment.To evaluate the effectiveness of the proposed method, winter wheat image data were collected at the Agricultural Meteorology Experimental Station in Taian, Shandong Province, under real field conditions. The dataset covers the heading stage of winter wheat and includes variations in illumination, planting density, and background complexity. Comparative experiments were conducted using several representative monocular depth estimation models, including BTS, FCRN, DORN, and DPT, which are widely used benchmarks in depth estimation research.Experimental results demonstrate that the proposed depth estimation network outperforms the comparative models in terms of root mean square error (RMSE = 2.759), logarithmic root mean square error (LogRMSE= 0.157), relative error (REL = 0.152), and squared relative error (SqREL = 0.907), indicating superior depth estimation performance. Furthermore, the predicted depth maps were converted into winter wheat plant height and compared with ground-truth measurements. The proposed MHRM achieved a plant height estimation accuracy of 97.97%, which exceeds that of BTS (86.46%), FCRN (92.40%), DORN (94.35%), and DPT (96.52%).Overall, these results demonstrate that the proposed MHRM provides an effective and reliable solution for winter wheat plant height estimation using monocular imagery and shows strong potential for practical applications in crop growth monitoring, precision agriculture, and agricultural scientific research.

       

    /

    返回文章
    返回