SUN Shouxin, SUN Yiming, ZHANG Feng, et al. Counting tobacco plants in hilly areas using UAV imagery and MSA-TCN modelJ. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2025, 41(21): 146-154. DOI: 10.11975/j.issn.1002-6819.202504141
    Citation: SUN Shouxin, SUN Yiming, ZHANG Feng, et al. Counting tobacco plants in hilly areas using UAV imagery and MSA-TCN modelJ. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2025, 41(21): 146-154. DOI: 10.11975/j.issn.1002-6819.202504141

    Counting tobacco plants in hilly areas using UAV imagery and MSA-TCN model

    • Accurate plant counting can play a vital role in precision agriculture and digital crop management. Especially, the tobacco cultivation is often required for the plant monitoring in the mountainous areas, due to the complex terrain and irregular planting patterns. Among them, tobacco plants can frequently suffer from severe occlusion, overlap, and morphological variations at the different growth stages. Existing computer vision can cause large deviations during predictions. In this study, the counting framework was proposed using unmanned aerial vehicle (UAV) imagery and a Multi-Scale Attention Temporal Convolutional Network (MSA-TCN). Several specially designed modules were integrated to enhance the feature extraction of the model for robustness under complex field conditions. Firstly, a front-end backbone network was constructed using VGG16 (Visual Geometry Group), in order to extract the primary structural features of the tobacco plants from UAV images. Secondly, a GroupDC (group convolution with feature shift and channel attention) module was introduced to combine the feature shifting with grouped convolution and channel attention. The receptive field was effectively enlarged to reinforce the local spatial interactions. The overlapped plants were then separated from the dense canopies. Thirdly, an Optimized DC (optimized multi-scale feature extraction with attention) module was developed using multi-branch dilated convolution. The information was captured from different receptive fields to incorporate the standard convolution. Thereby, the fine-grained details were refined suitable for the variations in the plant size and growth stage. Furthermore, an attention mechanism was embedded to selectively emphasize the discriminative features, in order to further suppress the background noise and non-target interference. Finally, an UP-Block (upsampling and feature aggregation block) structure was proposed to progressively aggregate the multi-scale features and then refine the density maps. The counting errors were reduced to produce more reliable outputs. The dataset consisted of 390 UAV images covering approximately 140 000 tobacco plants, including intercropped regions and areas heavily affected by weed interference, thereby providing diversity at the growth stages under field conditions. Experimental results demonstrate that the MSA-TCN model achieved a mean absolute error (MAE) of 6.07 plants, a root mean square error (RMSE) of 7.78 plants, a relative error (RE) of 1.69%, and a coefficient of determination (R²) of 0.996 on the test set. Compared with the existing density regression, superior robustness was obtained to overcome the occlusion, overlap, and background interference. The finding can provide an accurate and stable counting performance in complex mountainous environments. The valuable technical support can also offer precision tobacco cultivation, growth monitoring, and decision-making in intelligent agriculture.
    • loading

    Catalog

      /

      DownLoad:  Full-Size Img  PowerPoint
      Return
      Return