安志强, 李智军, 刘硕, 赵永刚, 陈启俊, 左然涛, 林远山. 融合YOLOv7和BYTE多目标跟踪的多类别海珍品计数方法[J]. 农业工程学报, 2023, 39(9): 183-189. DOI: 10.11975/j.issn.1002-6819.202301076
    引用本文: 安志强, 李智军, 刘硕, 赵永刚, 陈启俊, 左然涛, 林远山. 融合YOLOv7和BYTE多目标跟踪的多类别海珍品计数方法[J]. 农业工程学报, 2023, 39(9): 183-189. DOI: 10.11975/j.issn.1002-6819.202301076
    AN Zhiqiang, LI Zhijun, LIU Shuo, ZHAO Yonggang, CHEN Qijun, ZUO Rantao, LIN Yuanshan. Multi-category sea foods counting method integrating YOLOv7 and BYTE multi-target tracking[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2023, 39(9): 183-189. DOI: 10.11975/j.issn.1002-6819.202301076
    Citation: AN Zhiqiang, LI Zhijun, LIU Shuo, ZHAO Yonggang, CHEN Qijun, ZUO Rantao, LIN Yuanshan. Multi-category sea foods counting method integrating YOLOv7 and BYTE multi-target tracking[J]. Transactions of the Chinese Society of Agricultural Engineering (Transactions of the CSAE), 2023, 39(9): 183-189. DOI: 10.11975/j.issn.1002-6819.202301076

    融合YOLOv7和BYTE多目标跟踪的多类别海珍品计数方法

    Multi-category sea foods counting method integrating YOLOv7 and BYTE multi-target tracking

    • 摘要: 针对目前养殖过程中海珍品计数方法成本高、效率低、计数精度难以保障等问题,该研究以真实底播养殖环境下的海珍品为研究对象,以水下拍摄的海珍品视频为数据源,提出一种基于视频多目标跟踪的多类别海珍品计数方法。首先,采用性能优异的YOLOv7算法实现海珍品目标检测器,为多目标跟踪提供输入;然后,结合真实养殖环境下同类别海珍品外观相似性高、不清晰等特点,借鉴BYTE算法的多目标跟踪思想,设计多类别轨迹生成策略和基于轨迹ID号的计数策略,提出一种多类别海珍品跟踪与计数方法。并提出一套更适用于基于轨迹ID号计数方法的评估指标。试验结果表明,改进平均计数精度、改进平均绝对误差、改进均方根误差及帧率分别为91.62%、5.75、6.38和32帧/s,各项指标多优于YOLOv5+DeepSORT、YOLOv7+DeepSORT、YOLOv5+BYTE、YOLOv7+BYTE等算法,尤其改进平均计数精度、帧率指标比YOLOv5+DeepSORT高了29.51个百分点和8帧/s,且在改进平均绝对误差、改进均方根误差指标上分别降低19.50和12.08。该研究方法可有效帮助水产养殖企业掌握水下海珍品数量,为现代化渔业的测产研究提供技术参考,为水产养殖的智慧管理提供科学决策依据。

       

      Abstract: Accurate and rapid counting has been one of the most important steps on sea foods in modern aquaculture. Taking the sea foods in a real bottom-sowing aquaculture environment as the research object, this study aims to guarantee the cost-saving, highly efficient and easy counting on the multi-category sea foods using video multi-target tracking. The underwater videos of sea foods were taken as the data source. Firstly, the YOLOv7 algorithm with excellent performance was used to realize the target detector of sea treasure, providing the input for the multi-target tracking. Specifically, the image data was captured under real underwater using the "2017 Underwater Robot Picking Contest, URPC2017", and then used to train the YOLOv7 model. 17,655 images contained the holothurians, echinus, and scallops. The images were randomly divided into the training set of 14055 images in the ratio of 8:1:1, the verification set of 1800 images, and the test set of 1800 images. The training setting of the YOLOv7 detector was set as the image adaptive size of 640*640, the initial learning rate of 0.01, the momentum of 0.9, the weight decay of 0.0005, the batch size of 16, the number of training rounds of 300. One test was performed every 10 rounds of training round. The operating system was selected the Ubuntu18.04 for Haizhenpin detector training, the deep learning framework was PyTorch, the experimental processor was AMD Ryzen Threadripper 1920X 12 cores, and the graphics card was NVIDIA GeForce RTX2080. Secondly, the characteristics of high similarity were combined with the unclear appearance of the same type of sea foods in the real breeding environment. The multi-target tracking of the BYTE algorithm was used for the reference. A multi-category trajectory generation and a counting strategy were then designed for the sea treasure tracking using the trajectory ID number. Finally, an optimal combination of the indicators was also evaluated on the performance of the improved model. The test results show that the average counting precision, mean absolute error (MAE), root mean square error (RMSE), and frame rate were 91.62%, 5.75, 6.38, and 32 frames/s, respectively. All indicators were better than those of the current YOLOv5+DeepSORT, YOLOv7+DeepSORT, YOLOv5+BYTE, and YOLOv7+BYTE. Especially, the average counting accuracy and frame rate index ratio of YOLOv5+DeepSORT were improved by 29.51, and 28 percentage points than before. The MAE and RMSE of the improved model were reduced by 19.50 and 12.08, respectively. The quantity of underwater sea foods was effectively measured in the modern fishery. The finding can provide the technical reference for the production measurement and the scientific decision-making on the intelligent management of aquaculture. In addition, the underwater dataset was used to train the detection model under the same environment, in order to reduce the false detections caused by different conditions. Nevertheless, the underwater environment changes greatly, due to various influencing factors, such as lighting. Statistical counting was then performed to collect the video of marine treasures under the same environment every time during training. Therefore, the current model can be expected to improve on the different underwater environments in the future.

       

    /

    返回文章
    返回