Ripeness detection of lotus seedpod in natural environment based on improved YOLOv10n
-
Graphical Abstract
-
Abstract
Lotus seedpod is one of the most important components of the lotus flowers. It is often required for the accurate and efficient detection of lotus seedpod maturity under natural environments in intelligent harvesting and precision agriculture. However, there are variable lighting conditions, as well as frequent occlusion by stems and leaves, due mainly to the small size of the lotus seedpods under complex backgrounds. Conventional object detection models can be limited in maintaining high accuracy in such environments, especially under strong light, weak light, or severe occlusion. In this study, the LotusM-YOLO model was proposed to enhance the YOLOv10n architecture after a series of targeted improvements. The great contributions included three key enhancements. Firstly, the dynamic convolution (DynamicConv) module was integrated into the backbone of the YOLOv10n model to enhance its adaptability under varying lighting conditions. The multiple convolutional kernels were dynamically combined to effectively extract the robust features from the images under the strong or low light environments. The irrelevant background noise was suppressed to preserve the essential features of the lotus seedpods, thereby significantly enhancing the detection accuracy and stability in natural paddy field scenes with complex illumination. Secondly, the Multi-scale efficient attention module (MultiSEAM) was improved to detect the small and partially occluded lotus seedpods, where the contextual information was captured over multiple feature scales. At the same time, some interference was further suppressed from the complex backgrounds in order to enhance the robustness under visually cluttered environments. Finally, the convolutional block attention module (CBAM) was sequentially applied as the channel and spatial attention to refine the feature representation. The detection precision was effectively enhanced for the lotus seedpods. The rate of the missed detections was significantly reduced using the attention mechanism. Together, these attention modules synergistically strengthened the sensitivity to the occluded and small targets under natural environments, in order to maintain the high detection accuracy. The performance of the LotusM-YOLO model was evaluated after optimization. A high-quality dataset contained 2 411 manually annotated images of lotus seedpods under natural conditions. The dataset was randomly divided into the training, validation, and test sets at a 7:2:1 ratio. The experimental results show that the LotusM-YOLO achieved a precision of 84.3%, a recall of 81.7%, and a mean average precision at IoU 0.5 (mAP0.5) of 86.7%, indicating an increase of 2.7 percentage points, 2.5 percentage points, and 3.9 percentage points, respectively, over the YOLOv10n baseline. Subsequently, the comparative experiments were conducted using multiple detection models, including Faster R-CNN, YOLOv5n, YOLOv8n, YOLOv9, and YOLOv10n. The results demonstrated that the LotusM-YOLO model achieved higher detection precision and recall under strong light, low light, and partial occlusion conditions, in order to significantly reduce the missed detections. The LotusM-YOLO model also exhibited stronger robustness in the lotus seedpod detection tasks under natural environmental conditions. Additionally, the heatmaps were generated using Gradient-weighted Class Activation Mapping (Grad-CAM). The improved model was more focused on the actual target areas. The attention was reduced to the background clutter, especially compared with the YOLOv10n model. Beyond technical performance, the LotusM-YOLO model can offer strong potential for real-world applications. The detection can be integrated with the depth data from RGB-D or LiDAR sensors. The accurate 3D localization of seedpods can guide the robotic arms in the picking tasks of the lotus seedpod. Consequently, the LotusM-YOLO model can provide a theoretical basis to monitor the growth status of the lotus seedpods for the intelligent harvesting equipment under a natural environment.
-
-