Saved in:
Bibliographic Details
Main Authors: Xiao, Yongkang, Zhang, Sinian, Dai, Yi, Zhou, Huixue, Hou, Jue, Ding, Jie, Zhang, Rui
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2506.00708
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908639681314816
author Xiao, Yongkang
Zhang, Sinian
Dai, Yi
Zhou, Huixue
Hou, Jue
Ding, Jie
Zhang, Rui
author_facet Xiao, Yongkang
Zhang, Sinian
Dai, Yi
Zhou, Huixue
Hou, Jue
Ding, Jie
Zhang, Rui
contents Knowledge graph completion (KGC) aims to predict missing triples in knowledge graphs (KGs) by leveraging existing triples and textual information. Recently, generative large language models (LLMs) have been increasingly employed for graph tasks. However, current approaches typically encode graph context in textual form, which fails to fully exploit the potential of LLMs for perceiving and reasoning about graph structures. To address this limitation, we propose DrKGC (Dynamic Subgraph Retrieval-Augmented LLMs for Knowledge Graph Completion). DrKGC employs a flexible lightweight model training strategy to learn structural embeddings and logical rules within the KG. It then leverages a novel bottom-up graph retrieval method to extract a subgraph for each query guided by the learned rules. Finally, a graph convolutional network (GCN) adapter uses the retrieved subgraph to enhance the structural embeddings, which are then integrated into the prompt for effective LLM fine-tuning. Experimental results on two general domain benchmark datasets and two biomedical datasets demonstrate the superior performance of DrKGC. Furthermore, a realistic case study in the biomedical domain highlights its interpretability and practical utility.
format Preprint
id arxiv_https___arxiv_org_abs_2506_00708
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle DrKGC: Dynamic Subgraph Retrieval-Augmented LLMs for Knowledge Graph Completion across General and Biomedical Domains
Xiao, Yongkang
Zhang, Sinian
Dai, Yi
Zhou, Huixue
Hou, Jue
Ding, Jie
Zhang, Rui
Artificial Intelligence
Computation and Language
Machine Learning
Knowledge graph completion (KGC) aims to predict missing triples in knowledge graphs (KGs) by leveraging existing triples and textual information. Recently, generative large language models (LLMs) have been increasingly employed for graph tasks. However, current approaches typically encode graph context in textual form, which fails to fully exploit the potential of LLMs for perceiving and reasoning about graph structures. To address this limitation, we propose DrKGC (Dynamic Subgraph Retrieval-Augmented LLMs for Knowledge Graph Completion). DrKGC employs a flexible lightweight model training strategy to learn structural embeddings and logical rules within the KG. It then leverages a novel bottom-up graph retrieval method to extract a subgraph for each query guided by the learned rules. Finally, a graph convolutional network (GCN) adapter uses the retrieved subgraph to enhance the structural embeddings, which are then integrated into the prompt for effective LLM fine-tuning. Experimental results on two general domain benchmark datasets and two biomedical datasets demonstrate the superior performance of DrKGC. Furthermore, a realistic case study in the biomedical domain highlights its interpretability and practical utility.
title DrKGC: Dynamic Subgraph Retrieval-Augmented LLMs for Knowledge Graph Completion across General and Biomedical Domains
topic Artificial Intelligence
Computation and Language
Machine Learning
url https://arxiv.org/abs/2506.00708