Abstract:
Intelligent and automation technologies have been widely applied to agricultural production in recent years. A green apricot is one of the most significant economic fruits in Asian areas. However, traditional identification cannot fully meet large-scale production under complex environmental conditions in precision management. Fruit recognition and detection technologies can be required to enhance the accuracy and efficiency of large-scale production at present. In this study, an efficient and lightweight target recognition model (MCBPnet) was developed to automatically detect the greet apricot fruits using advanced deep learning. The CBAM (convolutional block attention module) was first integrated into the MobileNetV3 architecture. The precision of the model was significantly enhanced to identify the apricot fruits with the most rich image areas. The regions of an image were then prioritized with the most relevant to fruit detection. More computational resources were effectively allocated to the features closely associated with the task. As a result, the model structure, IRCBAM (inverted residual convolutional block attention module) was then developed to incorporate as the backbone network of MCBPnet. BiFPN (Bi-directional feature pyramid network) was also introduced into the neck network, in order to further enhance the performance of the model. A feature pyramid was constructed to capture the multi-scale features from images using the BiFPN model. The fruits with varying sizes were then detected within a single frame. A high detection rate was also maintained in a wide range of fruit sizes and orientations. Moreover, some occlusions were also removed during detection, where the fruits were partially obscured by leaves or other fruits. The PConv (partial convolution) structure was utilized in the detection head. The high accuracy of detection was achieved, even when only a portion of a fruit was visible. As such, reliable recognition was obtained to increase its accuracy and robustness under conditions of partial visibility. The results demonstrated that the MCBPnet model achieved significant advantages across key performance metrics, including precision and mAP
50. Specifically, the high accuracy of detection was obtained with a precision of 0.988, an mAP
50 of 0.994. Furthermore, the detection speed of the MCBPnet model was 109.890 frames/s, which was 70.33% higher than that of the YOLOv8n model, and the model arithmetic was 6.1 G, which was 75.31% of that of the original one. This lightweight model was also deployed on edge devices, such as drones or handheld scanners, without any energy-intensive computational resources. In summary, the high-performance metrics were achieved by the MCBPnet model. The effectiveness of the integrated techniques was validated to maintain high accuracy during agricultural automation. Various indicators represented the significant advancement in the MCBPnet model. Advanced deep learning techniques and lightweight can be expected to improve the efficiency and accuracy of apricot detection during harvesting. The findings can provide technological support to the automated recognition and picking of green apricots. This MCBPnet model can greatly contribute to modern, intelligent, and sustainable crop management, thus driving the transition toward more efficient and intelligent agricultural practices.