Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zaporojets, Klim, Daza, Daniel, Barba, Edoardo, Assent, Ira, Navigli, Roberto, Groth, Paul
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2507.03617
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918431531466752
author	Zaporojets, Klim Daza, Daniel Barba, Edoardo Assent, Ira Navigli, Roberto Groth, Paul
author_facet	Zaporojets, Klim Daza, Daniel Barba, Edoardo Assent, Ira Navigli, Roberto Groth, Paul
contents	Knowledge Graphs (KGs) are structured knowledge repositories containing entities and relations between them. In this paper, we study the problem of automatically updating KGs over time in response to evolving knowledge in unstructured textual sources. Addressing this problem requires identifying a wide range of update operations based on the state of an existing KG at a given time and the information extracted from text. This contrasts with traditional information extraction pipelines, which extract knowledge from text independently of the current state of a KG. To address this challenge, we propose a method for construction of a dataset consisting of Wikidata KG snapshots over time and Wikipedia passages paired with the corresponding edit operations that they induce in a particular KG snapshot. The resulting dataset comprises 233K Wikipedia passages aligned with a total of 1.45 million KG edits over 7 different yearly snapshots of Wikidata from 2019 to 2025. Our experimental results highlight key challenges in updating KG snapshots based on emerging textual knowledge, particularly in integrating knowledge expressed in text with the existing KG structure. These findings position the dataset as a valuable benchmark for future research. Our dataset and model implementations are publicly available.
format	Preprint
id	arxiv_https___arxiv_org_abs_2507_03617
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	EMERGE: A Benchmark for Updating Knowledge Graphs with Emerging Textual Knowledge Zaporojets, Klim Daza, Daniel Barba, Edoardo Assent, Ira Navigli, Roberto Groth, Paul Computation and Language Knowledge Graphs (KGs) are structured knowledge repositories containing entities and relations between them. In this paper, we study the problem of automatically updating KGs over time in response to evolving knowledge in unstructured textual sources. Addressing this problem requires identifying a wide range of update operations based on the state of an existing KG at a given time and the information extracted from text. This contrasts with traditional information extraction pipelines, which extract knowledge from text independently of the current state of a KG. To address this challenge, we propose a method for construction of a dataset consisting of Wikidata KG snapshots over time and Wikipedia passages paired with the corresponding edit operations that they induce in a particular KG snapshot. The resulting dataset comprises 233K Wikipedia passages aligned with a total of 1.45 million KG edits over 7 different yearly snapshots of Wikidata from 2019 to 2025. Our experimental results highlight key challenges in updating KG snapshots based on emerging textual knowledge, particularly in integrating knowledge expressed in text with the existing KG structure. These findings position the dataset as a valuable benchmark for future research. Our dataset and model implementations are publicly available.
title	EMERGE: A Benchmark for Updating Knowledge Graphs with Emerging Textual Knowledge
topic	Computation and Language
url	https://arxiv.org/abs/2507.03617

Similar Items