Table of Contents: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wührl, Amelie, Ruckdeschel, Mattes, Lo, Kyle, Rogers, Anna
Format:	Preprint
Published:	2026
Subjects:	Computers and Society
Online Access:	https://arxiv.org/abs/2602.10995
Tags:	Add Tag No Tags, Be the first to tag this record!

Table of Contents:

In the current Large Language Model (LLM) ecosystem, creators have little agency over how their data is used, and LLM users may find themselves unknowingly plagiarizing existing sources. Attribution of LLM-generated text to LLM input data could help with these challenges, but so far we have more questions than answers: what elements of LLM outputs require attribution, what goals should it serve, how should it be implemented? We contribute a human-centric data attribution framework, which situates the attribution problem within the broader data economy. Specific use cases for attribution, such as creative writing assistance or fact-checking, can be specified via a set of parameters (including stakeholder objectives and implementation criteria). These criteria are up for negotiation by the relevant stakeholder groups: creators, LLM users, and their intermediaries (publishers, platforms, AI companies). The outcome of domain-specific negotiations can be implemented and tested for whether the stakeholder goals are achieved. The proposed approach provides a bridge between methodological NLP work on data attribution, governance work on policy interventions, and economic analysis of creator incentives for a sustainable equilibrium in the data economy.

Similar Items