Saved in:
Bibliographic Details
Main Authors: Bertollo, Giacomo, Bodemir, Naz, Burgess, Jonah
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.16005
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Analyzing 500 CTF participants, this paper shows that while participants readily bypassed simple AI guardrails using common techniques, layered multi-step defenses still posed significant challenges, offering concrete insights for building safer AI systems.