基于YOLO-SDCG和椭圆傅里叶描述子的番茄苗表型检测

    Phenotypic detection of tomato seedlings based on YOLO-SDCG and elliptic Fourier descriptors

    • 摘要: 番茄苗的表型特征是判断其是否适宜移栽的重要依据,为快速准确识别番茄苗表型特征,该研究提出了一种基于YOLO-SDCG和椭圆傅里叶描述子的番茄苗表型检测方法。针对在穴盘育苗期,番茄穴盘苗(35~40 d)生长密集、遮挡情况复杂、难以检测其直径和直立度等表型参数的问题,首先搭建番茄苗图像采集系统,融合正视与侧视视角获取图像数据;其次,改进YOLOv8s-seg模型为YOLO-SDCG,将动态蛇卷积(dynamic snake convolution, DySConv)模块添加到C2f模块(cross-stage partial-connection with 2 convolutions)以增强茎秆分割能力;采用内容感知特征重组模块(content-aware reassembly of features, CARAFE)替代原有的卷积上采样模块以提升特征重建与融合;在骨干网络和颈部网络中加入幻影卷积(grouped hybrid one-shot tensor, GHOST)以减少模型参数量和计算量。最后,融合图像分割、椭圆傅里叶描述子(elliptic Fourier descriptors, EFDs)、最大内切圆法、弦弧比与分段拟合法,实现番茄苗茎秆直径和直立度等表型参数的检测。结果表明,YOLO-SDCG在自建番茄苗数据集上掩码水平的精确率、召回率和平均精度均值分别为93.1%、93.9%、94.9%,较基线模型(YOLOv8s-seg)分别提高了4.6、2.7和2.4个百分点,参数量与运算时间小幅增加0.32 M和0.4 ms,但满足部署要求。最大内切圆法在正视图、侧视图下茎秆直径的平均绝对误差均为0.03 mm,平均绝对百分比误差均为1.04%;弦弧比与分段拟合法在正视图、侧视图下直立度的平均绝对误差分别为1.60°、1.80°,平均绝对百分比误差分别为2.00%、2.14%;决定系数均大于0.96,验证了该方法可有效估测番茄苗表型参数。该研究可为其他穴盘苗表型特征检测提供方法参考。

       

      Abstract: Phenotypic parameters can be detected, such as the diameter and erectness, under dense growth and complex occlusion at the mature tomato seedlings (35-40 d) during the plug seedling stage. In this study, a dual-view detection was proposed to combine a variable-pitch stepped manipulator and the YOLO-SDCG model. Three aspects were employed, including the hardware design, the improved YOLOv8s-seg model, and phenotypic parameter extraction. Firstly, in terms of the hardware design, an image acquisition was constructed for the tomato seedlings. The top- and side-view perspectives were integrated to capture the image data. Secondly, the YOLOv8s-seg model was enhanced into the YOLO-SDCG model, in terms of the vision algorithm. Dynamic snake convolution (DySConv) was introduced to extract the slender tubular features of the main stem during the initial feature extraction. The content-aware reassembly of the features (CARAFE) module was adopted to enhance the resolution for the high-level semantic features. The tomato seedling stems were extracted from the deep layers of the backbone network using content-adaptive upsampling. The high-resolution stem feature maps were then concatenated or weighted-fused from the shallow layers of the backbone network. Stem semantic information was integrated with the seedling spatial details. Grouped hybrid one-shot tensor (GHOST) convolution was incorporated to extract the intrinsic features using a small number of standard convolutions. The ghost features were generated after a linear transformation. Thereby, the parameters and computational cost were reduced for the parameter extraction. Finally, the image segmentation, elliptic Fourier descriptors (EFDs), the maximum inscribed circle, chord-to-arc ratio, and piecewise fitting were integrated to detect the phenotypic parameters, such as the stem diameter and erectness, after stem segmentation. Experimental results showed that a stepped arrangement of the multiple plants was achieved to effectively avoid the seedling occlusion using the variable-pitch stepped manipulator. Thereby, a stable input was provided for the subsequent image acquisition and detection after hardware design. In terms of the vision algorithm, compared with the YOLOv8-seg, the YOLO-SDCG model was achieved in the precision, recall, and mean average precision of 93.1%, 93.9%, and 94.9%, respectively, which was improved by 4.6, 2.7, and 2.4 percentage points, respectively, with the parameter count and inference time of 3.58 M and 3.0 ms, respectively. The contour structure of the stems was effectively segmented to maintain a better balance between accuracy and computational efficiency. The contour of the main stem of the tomato seedlings was a closed tubular curve after parameter extraction. The high-quality reconstruction was obtained using elliptic Fourier descriptors with a harmonic order n=16. The maximum inscribed circle yielded the mean absolute errors of 0.03 mm for the stem diameter in both top and side views, with the mean absolute percentage errors of 1.04%. The chord-to-arc ratio and piecewise fitting were achieved in the mean absolute errors for the erectness of 1.60° and 1.80° in the top and side views, respectively, with the mean absolute percentage errors of 2.00% and 2.14%, while the coefficients of determination all exceeded 0.96. Transplanting experiments demonstrated that the success rates were 93.13% and 92.50%, respectively, for the clamping of seedlings from 72-cell and 105-cell trays at a picking frequency of 120 seedlings per minute. The transplanting shared the operational efficiency to maintain the reliable seedling screening and grasping. Only two phenotypic traits—the diameter and erectness—were included in the phenotypic detection of the tomato seedling. The dataset can be expected to incorporate more phenotypic features in the more efficient detection of the tomato seedling phenotypes.

       

    /

    返回文章
    返回