基于大模型规则增强与通道注意力的两阶段小麦知识图谱补全

    Two-stage wheat knowledge graph completion method based on large language models rule augmentation and channel attention

    • 摘要: 农业数据呈现零散和碎片化的特点,导致构建的农业知识图谱常面临数据稀疏和缺失问题。现有的基于神经网络的知识图谱补全模型在长距离依赖关系的建模上仍存在不足。为解决上述问题,该研究以小麦为例,提出两阶段的知识图谱补全模型(rule augmentation and se-enhanced atrous spatial pyramid pooling with residual connections for knowledge graph completion,RASE-ARKGC)。第一阶段,使用大语言模型挖掘潜在规则来进行数据增强(rule augmentation,RA),解决数据稀疏问题。第二阶段,引入通道注意力机制和空洞卷积(se-enhanced atrous spatial pyramid pooling with residual connections for knowledge graph completion,SE-ARKGC),进行特征提取和匹配,实现知识补全。在自构建的小麦知识图谱数据集WheatSeedBiz上进行方法验证,试验结果表明:RASE-ARKGC在MRR和Hits@10指标上分别达0.482和0.555,与ConvE模型相比分别提高了9.8%和10.2%。同时,为验证SE-ARKGC模型的有效性和泛化性,在开源数据集WN18和FB15k上进行了试验,在WN18数据集上MRR和Hits@10分别达到0.947和0.955,在FB15k数据集上MRR和Hits@10分别达0.826和0.891。两阶段的RASE-ARKGC模型不仅有效扩充了数据,还在知识补全任务中取得了最佳效果,为小麦领域的知识图谱补全提供了改进的思路,同时具备良好的通用性,可推广至其他领域。

       

      Abstract: A knowledge graph is often required to collect and integrate the vast amount of data with the rapid development of agricultural informatization in recent years. But the data can be characterized by dispersal and fragmentation. Relevant data is usually stored separately in different institutions and databases, leading to data sparsity and missing information in the agricultural knowledge system. Among them, knowledge graph completion can be used to extract the potential semantic information using a neural network. The domain-specific data features can also be obtained after special domain knowledge graph applications using convolution and pooling operations. However, existing knowledge completion models still struggle to effectively capture the long-range dependencies in the complex relational scenarios. In this study, a two-stage knowledge graph completion model (RASE-ARKGC) was proposed using large language models with rule augmentation and channel attention. The wheat was taken as an example. In the first stage, the large language models (LLMs) were used for the rule augmentation (RA). In particular, both the semantic and structural information of the knowledge graph were utilized to generate the logical rules. Firstly, the paths were sampled from the knowledge graph to represent the structural information. Secondly, the rule generator was adopted to mine the potential rules using semantic and structural information. Thirdly, the logical rule sequence was used to evaluate the quality of the rules and then filter out the outliers. As such, the rule augmentation module effectively expanded the dataset in order to alleviate the data sparsity in the knowledge graph. The channel attention mechanism and dilated convolution (SE-ARKGC) were introduced in the second stage. Specifically, the channel attention mechanism was enhanced to capture the long-range dependencies. While the dilated convolution was used to adjust the dilation rate for the multi-scale receptive fields, in order to capture the multi-level interactions between entities and relationships. In addition, the residual module was also used to effectively mitigate the gradient vanishing in the deep network, in order to further enhance the expression of the model. The complete transfer of feature information was then realized in the multilayer network. A series of experiments was conducted on the self-constructed wheat knowledge graph (WheatSeedBiz) in order to validate the effectiveness of the two-stage knowledge graph completion. The results showed that the RASE-ARKGC model achieved 0.482 in MRR and 0.555 in Hits@10, which were improved by 9.8% and 10.2%, respectively, compared with the ConvE model. In addition, the experiments were conducted on the open-source datasets WN18 and FB15k to evaluate the generalization of the SE-ARKGC model. The experimental results showed that the optimal or sub-optimal performance was achieved over multiple baseline models. Ablation experiments were also carried out to evaluate the rule augmentation and the SE-Block-based knowledge graph completion. Each module was enhanced the performance of the model. Overall, the RASE-ARKGC model effectively expanded the dataset for optimal performance in the knowledge completion task. The finding can provide an improved approach for the knowledge graph completion in the wheat domain. The generalization of the model can be applicable to the rest domains.

       

    /

    返回文章
    返回