基于改进YOLOv11n的轻量级多尺度水稻害虫识别模型

    Lightweight multi-scale rice pest recognition model based on improved YOLOv11n

    • 摘要: 为了解决水稻害虫种类繁多、尺寸和形态差异显著所导致的误检、漏检等问题,该文提出了多尺度水稻害虫检测与计数的轻量级模型YOLO-MSLP(multi-scale lightweight pest)。该模型以YOLOv11n为架构基础,首先,为了能更好地处理多尺度害虫的特征信息,在颈部网络中引入多尺度特征融合模块AP_BiFPN(adaptive pooling bidirectional feature pyramid network);其次,为增强模型对关键区域聚焦能力,强调跨维度交互,融合改进的多尺度三元组注意力模块MS-TAM(multi-scale triplet attention module);最后,为满足嵌入式设备部署的需求,利用RepViT(reparameterization vision transformer)和知识蒸馏技术进一步实现模型轻量化。结果显示,YOLO-MSLP的平均精度均值达到94.5%,召回率为91.7%,浮点运算量为6.5G,模型大小为4.5 MB;相较于基线模型YOLOv11n,检测精度提升了3.1个百分点,推理时耗降低了26.8%。结果表明,YOLO-MSLP模型在识别多尺度水稻害虫方面,具有精确度高和轻量化的优点,可为多尺度水稻害虫研究提供算法参考。

       

      Abstract: Rice is one of the most important staple crops worldwide. Its yield and quality can directly influence the global food security and the agricultural economy. Nevertheless, the rice pests have ranked the most among common biological disasters that threaten stable, high yield rice production. The International Rice Research Institute has reported that the rice pests and diseases have cut the yields by up to 37%, with the losses ranging from 24% to 41%. Real-time monitoring and counting of pests are often required for the prevention and control in green agriculture. It is very necessary for the accurate identification of pests at different scales and reliable tracking of their population dynamics Yet the pest community in paddy fields is extraordinarily diverse. Pest sizes are often ranged from the millimeter scale aphids and thrips to stem borers and leaf folder larvae exceeding ten millimeter. All pest can concurrently occur in the same plot, and then simultaneously inhabit leaves, leaf sheaths, stems, or panicles. Conventional manual scouting or simple image processing cannot fully meet the accurate detection and counting, due to the complex spatial distribution, extreme morphological variation, and heavy background clutter. In this study, the YOLO-MSLP (multi-scale lightweight pest), an intelligent lightweight model was proposed for the rice-pest detection and counting, in order to overcome these challenges. The latest YOLOv11n backbone, YOLO-MSLP was introduced three innovations that tailored to the complex scenes in the paddy field. Firstly, an adaptive pooling bidirectional feature pyramid network (AP-BiFPN) was embedded in the neck. The adaptive pooling was dynamically adjusted the receptive field and bidirectional cross scale fusion. Multiscale features were extracted and then aggregated in a stable manner, whether the targets were solitary pests or dense clusters. The small object detection was greatly improved for the accuracy of the large object localization. Secondly, a multi-scale triplet attention module (MS-TAM) was inserted between the backbone and detection heads. The channel, spatial, and scale dimensions were operated in parallel. The discriminative pest features were adaptively highlighted to suppress the redundant background information closely resembled the pests, such as the shape, texture, and color. Experimental results showed that the module was maintained on the high confidence outputs even under back lighting, leaf occlusion, or overlapping rice plants. Finally, the backbone was reengineered with a reparametrized vision transformer (RepViT), in order to lower deployment barriers. Furthermore, knowledge distillation was compressed to transfer the rich representations from a larger teacher network into the lightweight student. The YOLO-MSLP was achieved a mean average precision (mAP) of 94.5% and a recall of 91.7%, after pruning, quantization, and operator fusion. Floating point operations were reduced by 24.4%, and model size was shrunk by 40.7%. Inference latency for a single image on an edge GPU fell below 35 ms. Extensive testing confirmed that the YOLO MSLP can run in real time on embedded devices, thus providing for a low-cost, highly reliable tool for early warning, precise spraying, and green control of rice pests. The model can be expected for the large-scale smart-agriculture deployments to advance the sustainable rice industry. The finding can also provide the data referenec for the scientific interventions, thereby reducing the pesticide use and residue risk.

       

    /

    返回文章
    返回