Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Mezzi, Emanuele, Mertzani, Asimina, Manis, Michael P., Lilova, Siyanna, Vadivoulis, Nicholas, Gatirdakis, Stamatis, Roussou, Styliani, Hmede, Rodayna
Format:	Preprint
Published:	2025
Subjects:	Computers and Society Artificial Intelligence
Online Access:	https://arxiv.org/abs/2504.01032
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915222891003904
author	Mezzi, Emanuele Mertzani, Asimina Manis, Michael P. Lilova, Siyanna Vadivoulis, Nicholas Gatirdakis, Stamatis Roussou, Styliani Hmede, Rodayna
author_facet	Mezzi, Emanuele Mertzani, Asimina Manis, Michael P. Lilova, Siyanna Vadivoulis, Nicholas Gatirdakis, Stamatis Roussou, Styliani Hmede, Rodayna
contents	Since the introduction of ChatGPT in 2022, Large language models (LLMs) and Large Multimodal Models (LMM) have transformed content creation, enabling the generation of human-quality content, spanning every medium, text, images, videos, and audio. The chances offered by generative AI models are endless and are drastically reducing the time required to generate content and usually raising the quality of the generation. However, considering the complexity and the difficult traceability of the generated content, the use of these tools provides challenges in attributing AI-generated content. The difficult attribution resides for a variety of reasons, starting from the lack of a systematic fingerprinting of the generated content and ending with the enormous amount of data on which LLMs and LMM are trained, which makes it difficult to connect generated content to the training data. This scenario is raising concerns about intellectual property and ethical responsibilities. To address these concerns, in this paper, we bridge the technological, ethical, and legislative aspects, by proposing a review of the legislative and technological instruments today available and proposing a legal framework to ensure accountability. In the end, we propose three use cases of how these can be combined to guarantee that attribution is respected. However, even though the techniques available today can guarantee a greater attribution to a greater extent, strong limitations still apply, that can be solved uniquely by the development of new attribution techniques, to be applied to LLMs and LMMs.
format	Preprint
id	arxiv_https___arxiv_org_abs_2504_01032
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Who Owns the Output? Bridging Law and Technology in LLMs Attribution Mezzi, Emanuele Mertzani, Asimina Manis, Michael P. Lilova, Siyanna Vadivoulis, Nicholas Gatirdakis, Stamatis Roussou, Styliani Hmede, Rodayna Computers and Society Artificial Intelligence Since the introduction of ChatGPT in 2022, Large language models (LLMs) and Large Multimodal Models (LMM) have transformed content creation, enabling the generation of human-quality content, spanning every medium, text, images, videos, and audio. The chances offered by generative AI models are endless and are drastically reducing the time required to generate content and usually raising the quality of the generation. However, considering the complexity and the difficult traceability of the generated content, the use of these tools provides challenges in attributing AI-generated content. The difficult attribution resides for a variety of reasons, starting from the lack of a systematic fingerprinting of the generated content and ending with the enormous amount of data on which LLMs and LMM are trained, which makes it difficult to connect generated content to the training data. This scenario is raising concerns about intellectual property and ethical responsibilities. To address these concerns, in this paper, we bridge the technological, ethical, and legislative aspects, by proposing a review of the legislative and technological instruments today available and proposing a legal framework to ensure accountability. In the end, we propose three use cases of how these can be combined to guarantee that attribution is respected. However, even though the techniques available today can guarantee a greater attribution to a greater extent, strong limitations still apply, that can be solved uniquely by the development of new attribution techniques, to be applied to LLMs and LMMs.
title	Who Owns the Output? Bridging Law and Technology in LLMs Attribution
topic	Computers and Society Artificial Intelligence
url	https://arxiv.org/abs/2504.01032

Similar Items