Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Ludwig, Florian, Zesch, Torsten, Zufall, Frederike
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2506.03009
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866908391664779264
author	Ludwig, Florian Zesch, Torsten Zufall, Frederike
author_facet	Ludwig, Florian Zesch, Torsten Zufall, Frederike
contents	The assessment of legal problems requires the consideration of a specific legal system and its levels of abstraction, from constitutional law to statutory law to case law. The extent to which Large Language Models (LLMs) internalize such legal systems is unknown. In this paper, we propose and investigate different approaches to condition LLMs at different levels of abstraction in legal systems. This paper examines different approaches to conditioning LLMs at multiple levels of abstraction in legal systems to detect potentially punishable hate speech. We focus on the task of classifying whether a specific social media posts falls under the criminal offense of incitement to hatred as prescribed by the German Criminal Code. The results show that there is still a significant performance gap between models and legal experts in the legal assessment of hate speech, regardless of the level of abstraction with which the models were conditioned. Our analysis revealed, that models conditioned on abstract legal knowledge lacked deep task understanding, often contradicting themselves and hallucinating answers, while models using concrete legal knowledge performed reasonably well in identifying relevant target groups, but struggled with classifying target conducts.
format	Preprint
id	arxiv_https___arxiv_org_abs_2506_03009
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Conditioning Large Language Models on Legal Systems? Detecting Punishable Hate Speech Ludwig, Florian Zesch, Torsten Zufall, Frederike Computation and Language Artificial Intelligence The assessment of legal problems requires the consideration of a specific legal system and its levels of abstraction, from constitutional law to statutory law to case law. The extent to which Large Language Models (LLMs) internalize such legal systems is unknown. In this paper, we propose and investigate different approaches to condition LLMs at different levels of abstraction in legal systems. This paper examines different approaches to conditioning LLMs at multiple levels of abstraction in legal systems to detect potentially punishable hate speech. We focus on the task of classifying whether a specific social media posts falls under the criminal offense of incitement to hatred as prescribed by the German Criminal Code. The results show that there is still a significant performance gap between models and legal experts in the legal assessment of hate speech, regardless of the level of abstraction with which the models were conditioned. Our analysis revealed, that models conditioned on abstract legal knowledge lacked deep task understanding, often contradicting themselves and hallucinating answers, while models using concrete legal knowledge performed reasonably well in identifying relevant target groups, but struggled with classifying target conducts.
title	Conditioning Large Language Models on Legal Systems? Detecting Punishable Hate Speech
topic	Computation and Language Artificial Intelligence
url	https://arxiv.org/abs/2506.03009

Similar Items