Saved in:
Bibliographic Details
Main Authors: Thanapalasingam, Thiviyan, Vozikis, Antonis, Bloem, Peter, Groth, Paul
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.06707
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911427943464960
author Thanapalasingam, Thiviyan
Vozikis, Antonis
Bloem, Peter
Groth, Paul
author_facet Thanapalasingam, Thiviyan
Vozikis, Antonis
Bloem, Peter
Groth, Paul
contents Knowledge Graph (KG) generation requires models to learn complex semantic dependencies between triples while maintaining domain validity constraints. Unlike link prediction, which scores triples independently, generative models must capture interdependencies across entire subgraphs to produce semantically coherent structures. We present ARK (Auto-Regressive Knowledge Graph Generation), a family of autoregressive models that generate KGs by treating graphs as sequences of (head, relation, tail) triples. ARK learns implicit semantic constraints directly from data, including type consistency, temporal validity, and relational patterns, without explicit rule supervision. On the IntelliGraphs benchmark, our models achieve 89.2% to 100.0% semantic validity across diverse datasets while generating novel graphs not seen during training. We also introduce SAIL, a variational extension of ARK that enables controlled generation through learned latent representations, supporting both unconditional sampling and conditional completion from partial graphs. Our analysis reveals that model capacity (hidden dimensionality >= 64) is more critical than architectural depth for KG generation, with recurrent architectures achieving comparable validity to transformer-based alternatives while offering substantial computational efficiency. These results demonstrate that autoregressive models provide an effective framework for KG generation, with practical applications in knowledge base completion and query answering.
format Preprint
id arxiv_https___arxiv_org_abs_2602_06707
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Autoregressive Models for Knowledge Graph Generation
Thanapalasingam, Thiviyan
Vozikis, Antonis
Bloem, Peter
Groth, Paul
Artificial Intelligence
Knowledge Graph (KG) generation requires models to learn complex semantic dependencies between triples while maintaining domain validity constraints. Unlike link prediction, which scores triples independently, generative models must capture interdependencies across entire subgraphs to produce semantically coherent structures. We present ARK (Auto-Regressive Knowledge Graph Generation), a family of autoregressive models that generate KGs by treating graphs as sequences of (head, relation, tail) triples. ARK learns implicit semantic constraints directly from data, including type consistency, temporal validity, and relational patterns, without explicit rule supervision. On the IntelliGraphs benchmark, our models achieve 89.2% to 100.0% semantic validity across diverse datasets while generating novel graphs not seen during training. We also introduce SAIL, a variational extension of ARK that enables controlled generation through learned latent representations, supporting both unconditional sampling and conditional completion from partial graphs. Our analysis reveals that model capacity (hidden dimensionality >= 64) is more critical than architectural depth for KG generation, with recurrent architectures achieving comparable validity to transformer-based alternatives while offering substantial computational efficiency. These results demonstrate that autoregressive models provide an effective framework for KG generation, with practical applications in knowledge base completion and query answering.
title Autoregressive Models for Knowledge Graph Generation
topic Artificial Intelligence
url https://arxiv.org/abs/2602.06707