Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Aivalis, Theodoros, Klampanos, Iraklis A., Troumpoukis, Antonis, Jose, Joemon M.
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2512.02713
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914177573978112
author	Aivalis, Theodoros Klampanos, Iraklis A. Troumpoukis, Antonis Jose, Joemon M.
author_facet	Aivalis, Theodoros Klampanos, Iraklis A. Troumpoukis, Antonis Jose, Joemon M.
contents	As generative models become powerful, concerns around transparency, accountability, and copyright violations have intensified. Understanding how specific training data contributes to a model's output is critical. We introduce a framework for interpreting generative outputs through the automatic construction of ontologyaligned knowledge graphs (KGs). While automatic KG construction from natural text has advanced, extracting structured and ontology-consistent representations from visual content remains challenging -- due to the richness and multi-object nature of images. Leveraging multimodal large language models (LLMs), our method extracts structured triples from images, aligned with a domain-specific ontology. By comparing the KGs of generated and training images, we can trace potential influences, enabling copyright analysis, dataset transparency, and interpretable AI. We validate our method through experiments on locally trained models via unlearning, and on large-scale models through a style-specific experiment. Our framework supports the development of AI systems that foster human collaboration, creativity and stimulate curiosity.
format	Preprint
id	arxiv_https___arxiv_org_abs_2512_02713
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Training Data Attribution for Image Generation using Ontology-Aligned Knowledge Graphs Aivalis, Theodoros Klampanos, Iraklis A. Troumpoukis, Antonis Jose, Joemon M. Artificial Intelligence As generative models become powerful, concerns around transparency, accountability, and copyright violations have intensified. Understanding how specific training data contributes to a model's output is critical. We introduce a framework for interpreting generative outputs through the automatic construction of ontologyaligned knowledge graphs (KGs). While automatic KG construction from natural text has advanced, extracting structured and ontology-consistent representations from visual content remains challenging -- due to the richness and multi-object nature of images. Leveraging multimodal large language models (LLMs), our method extracts structured triples from images, aligned with a domain-specific ontology. By comparing the KGs of generated and training images, we can trace potential influences, enabling copyright analysis, dataset transparency, and interpretable AI. We validate our method through experiments on locally trained models via unlearning, and on large-scale models through a style-specific experiment. Our framework supports the development of AI systems that foster human collaboration, creativity and stimulate curiosity.
title	Training Data Attribution for Image Generation using Ontology-Aligned Knowledge Graphs
topic	Artificial Intelligence
url	https://arxiv.org/abs/2512.02713

Similar Items