ROFNet:基于RGB和光流多模态融合的羊产前行为识别

    ROFNet: Sheep behavior before lambing recognition with multimodel fusion of RGB and optical flow

    • 摘要: 针对母羊产前行为的视觉特征不明显,并且单一模态识别精度不足的问题,该研究提出了一种基于RGB流和光流多模态融合的母羊产前行为识别网络(RGB and optical flow fusion network,ROFNet),以实现对母羊分娩前的站立、躺卧、刨地等低区分度行为的检测。利用安装在羊圈旁的摄像头采集临产母羊行为视频,将视频帧预处理获取双流数据,选择视频序列的中间帧为代表帧作为RGB流输入以提取整体的空间特征,同时从视频的连续帧中获取光流可视化图像以提取运动的时序特征。双流特征提取层分别在RGB流和光流中提取对应特征。最后通过注意力特征融合模块对结合后的双流特征进行深度融合,获取最终的行为分类结果。试验结果表明,ROFNet的F1值达到88.54%,相比C3D、SlowFast、TimeSformer、Video Swin和TSN方法,分别提高了3.49、19.13、17.34、25.19、11.17个百分点。ROFNet通过分析监控视频数据,实现对分娩前母羊行为的无接触式识别,尤其能够识别母羊的刨地行为,可为规模化羊场下母羊分娩的及时预警提供技术支撑。

       

      Abstract: Animal husbandry has been characterized by the intensification and scale in recent years. Some issues of animal welfare have emerged gradually, such as abnormal animal behavior and disease outbreaks. The high mortality rate of newborn lambs is one of the primary influencing factors on the development of the sheep industry. Prolonged parturition resulting from dystocia in ewes can lead to hypoxia and subsequent asphyxia in lambs, which stands as the main cause of newborn lamb deaths. Timely manual intervention is required to mitigate such risks. Therefore, the intelligent farming technologies can be expected to promptly and accurately identify animal behaviors for high breeding efficiency. Attention to the status of ewes approaching parturition and timely artificial assistance can also increase the survival rate of ewes and neonatal lambs. The prenatal behaviors of ewes in labor include the repetitive actions of standing up and then lying down, along with pawing the ground using their forehooves. In particular, the pawing the ground with the front hooves can differ from the daily behaviors; It is also difficult to recognize using conventional identification networks, because it is performed very rapidly. This study aims to develop a sheep behavior before lambing recognition network model (ROFNet) using the multimodal fusion of RGB and optical flow features, particularly with the help of motion information expressed by optical streams. The behavioral videos of the lambing ewes were captured and manually screened to identify the video segments of standing up, lying down, walking, and pawing the ground. The filtered videos were processed by frame-cutting. The middle frame of each video segment was selected as the representative frame for the RGB stream, thus providing for the overall spatial features. At the same time, an optical flow model was employed to obtain the optical flow data of the video frames. The color channels of the optical flow data were altered and superimposed to provide the temporal features of motion. ROFNet adopted a dual-stream network architecture to extract the spatial and temporal features using the RGB representative frames and optical flow frames, respectively. The two features were then fused using the Attentional Feature Fusion (AFF) module. The experimental results show that the ROFNet outperformed the existing advanced methods in several key metrics, including precision (88.10%), recall (89.34%), F1 score (88.54%), and accuracy (92.07%). Visual analysis by t-distributed stochastic neighbor embedding(t-SNE) shows that the ROFNet performed best in the confusable behaviors. The CAM heatmap further demonstrated that the ROFNet was used to focus more strongly on the front-leg region. A series of experiments demonstrates that the single modality was effectively avoided in the action recognition. Among them, the RGB stream and optical flow were used to extract the features via a dual-stream network architecture. The spatial features of RGB images were combined with the temporal features of optical flow images. Therefore, the ROFNet was achieved in the non-contact behavior recognition of the lambing ewes in labor using surveillance videos. The excellent performance was achieved in the accuracy and computational efficiency, indicating the great potential for practical application. The finding can also provide the technical support for the ewe parturition in intelligent livestock farming.

       

    /

    返回文章
    返回