靳红杰,马顾彧,唐梦圆,等. 复杂环境下黄花菜识别的YOLOv7-MOCA模型[J]. 农业工程学报,2023,39(15):181-188. DOI: 10.11975/j.issn.1002-6819.202305100
    引用本文: 靳红杰,马顾彧,唐梦圆,等. 复杂环境下黄花菜识别的YOLOv7-MOCA模型[J]. 农业工程学报,2023,39(15):181-188. DOI: 10.11975/j.issn.1002-6819.202305100
    JIN Hongjie, MA Guyu, TANG Mengyuan, et al. Identifying daylily in complex environment using YOLOv7-MOCA model [J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2023, 39(15): 181-188. DOI: 10.11975/j.issn.1002-6819.202305100
    Citation: JIN Hongjie, MA Guyu, TANG Mengyuan, et al. Identifying daylily in complex environment using YOLOv7-MOCA model [J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2023, 39(15): 181-188. DOI: 10.11975/j.issn.1002-6819.202305100

    复杂环境下黄花菜识别的YOLOv7-MOCA模型

    Identifying daylily in complex environment using YOLOv7-MOCA model

    • 摘要: 黄花菜是极具营养价值和经济效益的一种农作物,深受人们喜爱。目前黄花菜采摘大都是人工采摘,采摘效率低、人工成本较高,在设计黄花菜自动采摘机器人的过程中,复杂环境下黄花菜的目标识别是实现智能化采摘的核心问题。该研究建立了包含12000幅黄花菜样本的数据库, 比较了You Only Look Once(YOLOv7)、Faster Region Convolutional Neural Networks(Faster R-CNN)和Single Shot MultiBox Detector(SSD)三种模型的检测效果,提出一种基于改进YOLOv7目标检测算法的复杂环境下黄花菜识别的YOLOv7-MOCA模型,使用MobileOne网络作为主干特征提取网络,构建了一种轻量化网络模型,并在颈部网络中融合Coordinate Attention注意力机制改善对样本的检测效果。试验结果表明, YOLOv7-MOCA模型检测准确率为96.1%,召回率为96.6%,F1值为0.96,权重为10 MB,帧速率为58帧/s。较YOLOv7检测速度提高了26.1%,权重减少了86.7%,该研究所提出的YOLOv7-MOCA模型检测准确率等参数均大幅提升。该模型可以实现黄花菜的快速识别,模型权重小,识别速度快,为黄花菜智能化采摘设备研究提供技术支撑。

       

      Abstract: Daylily is one of the most favored crops with the great nutritional and economic value. Manual harvesting cannot fully meet the large-scale production at present, due to the inefficient and costly tasks. Mechanized picking can be expected for the large-scale development of daylily industry. However, it is still lacking on the suitable machinery and equipment for the large-scale automated harvesting, due to the complex environments, such as mutual shielding of daylily, interference of weeds, and changes in lighting. It is a high demand for the target identification of daylily under the complex environment for the intelligent harvesting in an automatic harvesting robot. In this study, an improved YOLOv7-MOCA model was established to identify the daylily under the complex environment. The image data of daylily samples were collected from the real harvesting scene in daylily farms. A daylily dataset was established with 12000 sample images, including 2000 raw sample image data of single-target, multi-target and occlusion type in sunny and cloudy weather, and 10000 image data processed by mirroring, brightness transformation, Gaussian filtering, and Gaussian noise processing. Sample images was then enhanced for the better generalization ability of image training models. The daylily dataset in the experiment was divided into a training set, a test set and a validation set, with 7200, 2400, and 2400 images, respectively. The training and identification experiments were then performed on the daylily dataset. The detection effectiveness was compared among the three classical object detection models YOLOv7(You Only Look Once), Faster R-CNN (Fast Region-based Convolutional Network), and SSD (Single Shot MultiBox Detector). The results show that the comprehensive performance of YOLOv7 was better than those of the other two models. As such, an object identification model (YOLOv7-MOCA) was proposed to recognize the daylily in complex environment using YOLOv7. Firstly, a lightweight network model was constructed using MobileOne network as the backbone feature extraction network. Deep convolution was used to replace the ordinary convolution in MobileOne for the less cost of network computing that caused by feature extraction and object detection. The feature extraction efficiency and model detection speed of the model backbone network were compared with the replacement of two mainstream lightweight neural networks. Secondly, the Coordinate Attention (CoordAtt) attention mechanism was integrated in the neck network, in order to improve the receptive field of the model before the convolution operation. The mechanism was enhanced the attention of the model in the characteristics of small target samples of daylily. The comprehensive performance of intelligent identification was achieved in the real-time detection effectiveness of daylily samples. Finally, a YOLOv7-MOCA lightweight daylily fast identification model was designed by ablation test. The MobileOne was used as the backbone network, whereas, a neck network was utilized the attention mechanism of CoordAtt. The experimental results showed that the model YOLOv7-MOCA detection accuracy reached 96.1% with the recall rate of 96.6% and F1 coefficient of 0.96. The model weight was 10 MB, whereas, 58 frames per second was used to transmit. Compared with the YOLOv7, the detection accuracy of the improved model increased by 3.2 percentage points, the average detection speed increased by 26.1%, whereas the weight was reduced by 86.7%, indicating the small model weight and fast identification speed. The optimal parameters were obtained from the improved lightweight network model. The detection speed and model weight were significantly redesigned to effectively optimize the identification and detection of daylily in the complex environment of the network model, in terms of the mutual occlusion of target samples, weed interference, and light change. This improved model can be expected to intelligently identify some targets under complex environments for the automatic harvesting equipment of daylily. The finding can provide the technical support for the research of intelligent harvesting equipment of daylily.

       

    /

    返回文章
    返回