Table of Contents: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Day, Huw, Jezierska, Adrianna, Woodgate, Jessica
Format:	Preprint
Published:	2026
Subjects:	Human-Computer Interaction Artificial Intelligence
Online Access:	https://arxiv.org/abs/2603.01942
Tags:	Add Tag No Tags, Be the first to tag this record!

Table of Contents:

Large Language Models have intensified the scale and strategic manipulation of political discourse on social media, leading to conflict escalation. The existing literature largely focuses on platform-led moderation as a countermeasure. In this paper, we propose a user-centric view of "jailbreaking" as an emergent, non-violent de-escalation practice. Online users engage with suspected LLM-powered accounts to circumvent large language model safeguards, exposing automated behaviour and disrupting the circulation of misleading narratives.

Similar Items