Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Chen, Dexiong, Krimmel, Markus, Borgwardt, Karsten
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2502.02216
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917132578586624
author	Chen, Dexiong Krimmel, Markus Borgwardt, Karsten
author_facet	Chen, Dexiong Krimmel, Markus Borgwardt, Karsten
contents	We introduce AutoGraph, a scalable autoregressive model for attributed graph generation using decoder-only transformers. By flattening graphs into random sequences of tokens through a reversible process, AutoGraph enables modeling graphs as sequences without relying on additional node features that are expensive to compute, in contrast to diffusion-based approaches. This results in sampling complexity and sequence lengths that scale optimally linearly with the number of edges, making it scalable and efficient for large, sparse graphs. A key success factor of AutoGraph is that its sequence prefixes represent induced subgraphs, creating a direct link to sub-sentences in language modeling. Empirically, AutoGraph achieves state-of-the-art performance on synthetic and molecular benchmarks, with up to 100x faster generation and 3x faster training than leading diffusion models. It also supports substructure-conditioned generation without fine-tuning and shows promising transferability, bridging language modeling and graph generation to lay the groundwork for graph foundation models. Our code is available at https://github.com/BorgwardtLab/AutoGraph.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_02216
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Flatten Graphs as Sequences: Transformers are Scalable Graph Generators Chen, Dexiong Krimmel, Markus Borgwardt, Karsten Machine Learning We introduce AutoGraph, a scalable autoregressive model for attributed graph generation using decoder-only transformers. By flattening graphs into random sequences of tokens through a reversible process, AutoGraph enables modeling graphs as sequences without relying on additional node features that are expensive to compute, in contrast to diffusion-based approaches. This results in sampling complexity and sequence lengths that scale optimally linearly with the number of edges, making it scalable and efficient for large, sparse graphs. A key success factor of AutoGraph is that its sequence prefixes represent induced subgraphs, creating a direct link to sub-sentences in language modeling. Empirically, AutoGraph achieves state-of-the-art performance on synthetic and molecular benchmarks, with up to 100x faster generation and 3x faster training than leading diffusion models. It also supports substructure-conditioned generation without fine-tuning and shows promising transferability, bridging language modeling and graph generation to lay the groundwork for graph foundation models. Our code is available at https://github.com/BorgwardtLab/AutoGraph.
title	Flatten Graphs as Sequences: Transformers are Scalable Graph Generators
topic	Machine Learning
url	https://arxiv.org/abs/2502.02216

Similar Items