王泽钧, 马凤英, 张瑜, 张芳芳, 纪鹏, 曹茂永. 基于注意力机制和多尺度轻量型网络的农作物病害识别[J]. 农业工程学报, 2022, 38(Z): 176-183. DOI: 10.11975/j.issn.1002-6819.2022.z.020
    引用本文: 王泽钧, 马凤英, 张瑜, 张芳芳, 纪鹏, 曹茂永. 基于注意力机制和多尺度轻量型网络的农作物病害识别[J]. 农业工程学报, 2022, 38(Z): 176-183. DOI: 10.11975/j.issn.1002-6819.2022.z.020
    Wang Zejun, Ma Fengying, Zhang Yu, Zhang Fangfang, Ji Peng, Cao Maoyong. Crop disease recognition using attention mechanism and multi-scale lightweight network[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2022, 38(Z): 176-183. DOI: 10.11975/j.issn.1002-6819.2022.z.020
    Citation: Wang Zejun, Ma Fengying, Zhang Yu, Zhang Fangfang, Ji Peng, Cao Maoyong. Crop disease recognition using attention mechanism and multi-scale lightweight network[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2022, 38(Z): 176-183. DOI: 10.11975/j.issn.1002-6819.2022.z.020

    基于注意力机制和多尺度轻量型网络的农作物病害识别

    Crop disease recognition using attention mechanism and multi-scale lightweight network

    • 摘要: 准确识别农作物病害并及时防护是保障农作物产量的重要措施。针对传统农作物病害识别模型体积大、准确率不高的问题,该研究提出一种基于注意力机制和多尺度特征融合的轻量型神经网络模型(Lightweight Multi-scale Attention Convolutional Neural Networks,LMA-CNNs)。首先,为减少参数量,使模型轻量化,网络主体结构采用深度可分离卷积;其次,在深度可分离卷积基础上设计出残差注意力模块和多尺度特征融合模块;同时引入Leaky ReLU激活函数增强负值特征的提取。残差注意力模块通过嵌入通道和空间注意力机制,增强有用特征信息的权重并减弱噪声等干扰信息的权重,残差连接能够有效防止网络退化。多尺度特征融合模块利用其不同尺度的卷积核提取多种尺度的病害特征,提高特征的丰富度。试验结果表明,LMA-CNNs模型在59类公开农作物病害图像测试集上的准确率为88.08%,参数量仅为0.14×107,优于ResNet34、ResNeXt、ShuffleNetV2等经典神经网络模型。通过比较不同研究者在同一数据集下所设计的网络模型,进一步验证LMA-CNNs模型不仅拥有更高的识别精度,还具有更少的参数。该研究提出的LMA-CNNs模型较好地平衡模型复杂程度和识别准确率,为移动端农作物病害检测提供参考。

       

      Abstract: Abstract: In recent years, diseases and pests have caused a huge loss in agricultural production. Accurate identification of crop diseases and timely protection are important measures to ensure crop yield. Traditional methods of diagnosing agricultural diseases typically depend on the expertise and judgment of specialists. This approach is dependent on human subjective perception, which is prone to error and cannot ensure timeliness. The optimal time to cure agricultural diseases may be missed by traditional methods, resulting in financial losses. The neural networks and the development of deep learning have brought new technologies to the appraisal of agricultural diseases. However, certain large-scale neural networks cannot be implemented on mobile terminals to accomplish crop disease detection in realistic settings due to the low identification accuracy and a huge number of parameters. To address the problems of large size and low accuracy of traditional crop disease recognition models, we proposed a Lightweight Multi-scale Attention Convolutional Neural Networks (LMA-CNNs) to solve the above problems. First, in order to reduce the number of parameters and make the model lightweight, depthwise separable convolution was adopted as the main structure of the network; secondly, the residual attention module and multi-scale feature fusion module were designed on the basis of depthwise separable convolution; at the same time, the Leaky ReLU activation function was introduced to enhance the extraction of negative-valued features. The residual attention module enhanced the weight of useful feature information and weakened the weight of interference information such as noise by embedding channels and spatial attention mechanisms, and improved the recognition of important features by the network model. Residual connections could effectively prevent network degradation. The multi-scale feature fusion module used its convolution kernels of different scales to extract disease features of multiple scales, which improved the richness of features. The experimental results showed that the accuracy of the LMA-CNNs model on the test set of 59 types of disease images was 88.08%, and the number of parameters was only 0.14×107. Through comparative experiments, the LMA-CNNs model outperformed ResNet34, ResNeXt, ShuffleNetV2, MobileNetV3, and the more popular Vision Transformer recently. This study further verified the effectiveness of the LMA-CNNs model by comparing the network models designed by different researchers under the same dataset. Comparative experiments showed that the LMA-CNNs model reduced the number of model parameters on the premise of improving accuracy. Because of the problem of poor interpretability of the neural network model, this study used Grad-CAM to visualize the features extracted by the middle layer of the model and explained the model through the visualization results to obtain different feature information on different convolutional layers. As the number of layers increased, the LMA-CNNs model paid more attention to the diseased area. In summary, the LMA-CNNs model could extract more disease feature information, better balanc the model complexity and model recognition accuracy, and provid a reference for mobile crop disease recognition. In the future, we will continue to optimize the algorithm, deploy the model to the mobile terminal to detect crop diseases in real-field scenarios, and improve detection accuracy and efficiency.

       

    /

    返回文章
    返回