吕佳, 李帅军, 曾梦瑶, 董保森. 基于半监督SPM-YOLOv5的套袋柑橘检测算法[J]. 农业工程学报, 2022, 38(18): 204-211. DOI: 10.11975/j.issn.1002-6819.2022.18.022
    引用本文: 吕佳, 李帅军, 曾梦瑶, 董保森. 基于半监督SPM-YOLOv5的套袋柑橘检测算法[J]. 农业工程学报, 2022, 38(18): 204-211. DOI: 10.11975/j.issn.1002-6819.2022.18.022
    Lyu Jia, Li Shuaijun, Zeng Mengyao, Dong Baosen. Detecting bagged citrus using a Semi-Supervised SPM-YOLOv5[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2022, 38(18): 204-211. DOI: 10.11975/j.issn.1002-6819.2022.18.022
    Citation: Lyu Jia, Li Shuaijun, Zeng Mengyao, Dong Baosen. Detecting bagged citrus using a Semi-Supervised SPM-YOLOv5[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2022, 38(18): 204-211. DOI: 10.11975/j.issn.1002-6819.2022.18.022

    基于半监督SPM-YOLOv5的套袋柑橘检测算法

    Detecting bagged citrus using a Semi-Supervised SPM-YOLOv5

    • 摘要: 为解决柑橘经过套袋后其形状从圆形变为条状且纹理细节急剧减低,导致当前目标检测算法对套袋柑橘检测难度增大,同时目标检测算法性能依赖于有标记样本数量的问题。该研究设计了一种基于教师学生模型的SPM(Strip Pooling Module)-YOLOv5算法,在YOLOv5的骨干网络中加入条带注意力模块使模型更加关注条状的套袋柑橘与树枝,同时教师学生模型为半监督方法,使目标检测算法可利用无标记样本提升模型的性能,降低对有标记样本的依赖。试验结果表明,该文算法在套袋柑橘与树枝检测的平均精度均值分别为77.4%与53.5%,相比YOLOv5分别提升了7.5个百分点与7.6个百分点,套袋柑橘检测的精度与召回率达到94%与76.2%。因此,基于教师学生模型的SPM-YOLOv5算法精度高、速度快,能有效用于套袋柑橘检测。

       

      Abstract: Abstract: Bagged citrus has triggered a dramatic decrease in the texture details, due to the shape evolution from round to stripping during processing. A great challenge has also been posed in citrus object detection, depending mainly on the number of labeled samples. In this study, an accurate and rapid detection method was proposed for the bagged citrus based on the Teacher-student model with a Strip Pooling Module (SPM)-YOLOv5 algorithm. The images of bagged citrus were collected in the Paidengte Agricultural Science and Technology Demonstration Park, Bishan District, Chongqing of China. The data set of bagged citrus was generated by the image cleaning, enhancement, and labelling tags. Firstly, the stripe attention module was added to the backbone network of YOLOv5. Much attention of the model was drawn to the striped bagged citrus and branches, in order to reduce the average pooling focus on a large number of unrelated areas. Besides, the SPM was integrated into the backbone network of YOLOv5. Among them, the horizontal and vertical pooling were focused on the encoding remote context along the horizontal or vertical spatial dimensions. The SPM was also used to solve the overlapping of each spatial position in the feature map. Specifically, the global horizontal and vertical information was encoded to balance the own weight for the feature modification, in order to effectively expand the receptive field of the backbone network. As such, the SPM was different from the global pooling that only focused on one area. The striped pooling was utilized to easily realize the characteristics of discrete distribution in the horizontal and vertical pooling for a long time. The stripe kernel was used for the feature extraction in the horizontal and vertical directions, in order to capture more local details in the stripe pooling. In doing so, the strip pooling was different from the traditional space pooling that depended on the square core. At the same time, the Teacher-student model was semi-supervised to calculate the consistency loss for the unlabeled samples. Two stages were mainly composed of the model. The first stage was Burn-In. The effective pseudo tags were generated for the teacher model to be well initialized. Therefore, the teacher model was then initialized with the labeled samples. The second stage was mutual learning between the teachers and students. The model was trained using the labeled and unlabeled samples. The robustness of the model was enhanced to reduce the consistency loss in the training process. The target detection was performed on the unlabeled samples, in order to improve the performance of the model and reduce the dependence on labeled samples. The experimental results demonstrated that the average precision of SPM-YOLOv5 for the bagged citrus and branch detection was 77.4% and 53.5%, respectively, which was 7.5% and 7.6% higher than that of YOLOv5. The precision and recall rate of bagged citrus detection reached 94% and 76.2%, respectively. More importantly, the precision of SPM-YOLOv5 based on the Teacher-student model reached 82.6% under the condition of occlusion and overlapping. Meanwhile, the best detection was achieved in 1500 unlabeled and 500 labeled samples. Therefore, the SPM-YOLOv5 based on the Teacher-student model can be expected to detect bagged citrus with higher accuracy and faster speed than before.

       

    /

    返回文章
    返回