语义信息辅助的LiDAR-IMU融合定位方法

    Semantic Information-Assisted LiDAR-IMU Fusion Localization Method

    • 摘要: 农业机器人的精准定位是智慧农业中机器人自主作业的关键。但是在面对结构复杂或几何特征不明显的场景,传统的基于SLAM(Simultaneous Localization and Mapping)的定位方法常利用几何特征,对场景的理解能力不足,容易出现匹配不稳定,影响定位精度。为此,本文提出了一种融合语义信息的多传感器紧耦合定位方法。系统以LiDAR点云和IMU(inertial measurement unit)数据为输入,经预处理后提取几何与语义特征,结合语义类别的重要性设定差异化语义权重参与点云的匹配。后续采用历史配准残差进行统计分析实现语义权重的调整从而优化特征匹配的可靠性,提高位姿估计精度。在后端优化中,将前端里程计输出的位姿估计结果与IMU预积分约束共同作为因子图优化的约束因子,实现联合优化提高定位精度。在开源数据集和自采数据集中对本文方法进行了验证,在开源数据集的结果表明,本文方法在大部分指标上比其他经典的方法更优。本文方法在Sequence.05、Sequence 07、Sequence 10数据集序列的绝对轨迹误差分别为2.88 m、0.82 m、1.39 m,在自采的数据集中较其他两种方法的绝对估计误差分别降低了17.65%和59.02%,该研究结果为农业机器人的精准定位提供了思路。

       

      Abstract: Agricultural environments were generally characterized by weak geometric saliency, repetitive vegetation structures, and frequent dynamic disturbances, which posed significant challenges to conventional simultaneous localization and mapping systems that primarily relied on geometric feature matching. Under such conditions, unstable data association and unreliable residual estimation were often produced, leading to degraded pose accuracy and accumulated localization drift. Therefore, this study was conducted with the objective of improving localization accuracy and robustness for agricultural robots by introducing semantic information into a tightly coupled LiDAR–IMU SLAM framework, such that feature association and optimization could be guided by category-dependent reliability rather than geometry alone.In the proposed framework, LiDAR point clouds and inertial measurement unit data were jointly utilized. Raw LiDAR scans were first processed by motion distortion correction and outlier removal to mitigate the effects of sensor motion and noise. Subsequently, a semantic segmentation network (RangeNet++) was employed to assign semantic labels to each point, generating semantically annotated point clouds. To balance computational efficiency and semantic preservation, a semantic-aware downsampling strategy was adopted, in which different semantic categories were sampled with different densities according to their spatial distribution and relevance to pose estimation. During LiDAR odometry estimation, curvature-based edge and planar features were extracted and associated between consecutive frames. Semantic information was explicitly embedded into the optimization process by assigning category-specific weights to residual terms. Moreover, a dynamic semantic weighting mechanism based on historical residual statistics was introduced. Residuals accumulated within a sliding window were statistically analyzed to adaptively update semantic weights, so that categories producing larger residuals were down-weighted, while categories exhibiting stable matching behavior were reinforced. On the back end, LiDAR odometry constraints and IMU pre-integration constraints were jointly formulated within a factor-graph framework. Keyframes were selected according to translational and rotational motion thresholds, and all constraints were optimized through nonlinear least-squares optimization.Extensive experiments were carried out on both public benchmark datasets and self-collected agricultural datasets to evaluate the effectiveness of the proposed method. On the KITTI benchmark, the method, referred to as LID-SLAM, was evaluated on Sequences 05, 07, and 10 and compared with LIS-SLAM, LIO-SAM, and LeGO-LOAM, with loop closure disabled for fair comparison. On Sequence 05, which contained frequent turns, occlusions, and dynamic traffic participants, the influence of unreliable semantic categories was effectively suppressed. As a result, a mean absolute trajectory error of 2.48 m and a root mean square error (RMSE) of 2.88 m were achieved, outperforming LIO-SAM (mean 2.88 m, RMSE 3.41 m) and significantly surpassing LeGO-LOAM (mean 9.36 m, RMSE 10.1 m). On Sequence 07, representing a relatively static and open environment, stable performance was maintained, with a mean error of 0.78 m and an RMSE of 0.82 m, which were comparable to those of LIO-SAM and markedly better than those of LIS-SLAM and LeGO-LOAM. On the more challenging Sequence 10, which involved complex dynamics and higher motion speeds, peak and long-tail errors were substantially reduced. The maximum error was lowered to 3.77 m, and the RMSE was reduced to 1.39 m, compared with 2.22 m for LIO-SAM and 7.92 m for LeGO-LOAM.Ablation experiments were further conducted to assess the contribution of the dynamic semantic weighting mechanism. When the dynamic weighting module was removed, larger local deviations and increased error fluctuations were observed, although the overall trajectory shape was preserved. In contrast, introducing dynamic semantic weights led to consistent improvements, reducing the maximum error by approximately 19.23% and decreasing RMSE by up to 14.9% on representative sequences. On a self-collected agricultural dataset featuring dense vegetation, mixed terrain, and narrow roads, the proposed method consistently produced trajectories that closely aligned with the ground truth. Even after long straight motions followed by sharp turns, no obvious drift or trajectory break was observed. Quantitatively, the RMSE was reduced to 0.084 m, compared with 0.102 m for LIO-SAM and 0.205 m for A-LOAM, corresponding to relative reductions of 17.65% and 59.02%, respectively.In conclusion, a semantic-assisted, LiDAR–IMU tightly coupled SLAM framework was presented in this study. By combining semantic-aware downsampling, category-weighted residual modeling, and residual-feedback-driven dynamic semantic weight adaptation within a factor-graph optimization backend, data association reliability was improved and error accumulation was effectively suppressed in weak-geometry and dynamic environments. Experimental results on benchmark and real-world agricultural datasets demonstrated that the proposed approach significantly enhanced localization accuracy and robustness, providing a practical and effective solution for high-precision autonomous navigation of agricultural robots.

       

    /

    返回文章
    返回