:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Shouqiao, Politi, Marcello, Marro, Samuele, Crapis, Davide
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2603.20925
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Complete Answer to Erdős Problem 690
by: Wang, Shouqiao, et al.
Published: (2026)

A Framework for Combined Transaction Posting and Pricing for Layer 2 Blockchains
by: Wang, Shouqiao, et al.
Published: (2025)

GOD model: Privacy Preserved AI School for Personal Assistant
by: PIN AI Team, et al.
Published: (2025)

End-to-end PDDL Planning with Hardcoded and Dynamic Agents
by: La Malfa, Emanuele, et al.
Published: (2025)

STARK: Strategic Team of Agents for Refining Kernels
by: Dong, Juncheng, et al.
Published: (2025)

Strategize Globally, Adapt Locally: A Multi-Turn Red Teaming Agent with Dual-Level Learning
by: Chen, Si, et al.
Published: (2025)

DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents
by: Chen, Zhaorun, et al.
Published: (2026)

Mechanism Design Is Not Enough: Prosocial Agents for Cooperative AI
by: Huang, Xuanqiang Angelo, et al.
Published: (2026)

Red Teaming AI Red Teaming
by: Majumdar, Subhabrata, et al.
Published: (2025)

Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming
by: Zhang, Zheng, et al.
Published: (2025)

Large Language Models Miss the Multi-Agent Mark
by: La Malfa, Emanuele, et al.
Published: (2025)

RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent
by: Xu, Huiyu, et al.
Published: (2024)

Red-Team Multi-Agent Reinforcement Learning for Emergency Braking Scenario
by: Chen, Yinsong, et al.
Published: (2025)

A Notion of Complexity for Theory of Mind via Discrete World Models
by: Huang, X. Angelo, et al.
Published: (2024)

MonitoringBench: Semi-Automated Red-Teaming for Agent Monitoring
by: Jotautaitė, Monika, et al.
Published: (2026)

Automated Red Teaming with GOAT: the Generative Offensive Agent Tester
by: Pavlova, Maya, et al.
Published: (2024)

LLM Agents Are the Antidote to Walled Gardens
by: Marro, Samuele, et al.
Published: (2025)

Effective Red-Teaming of Policy-Adherent Agents
by: Nakash, Itay, et al.
Published: (2025)

EVA: Red-Teaming GUI Agents via Evolving Indirect Prompt Injection
by: Lu, Yijie, et al.
Published: (2025)

Proteus: A Self-Evolving Red Team for Agent Skill Ecosystems
by: Zhou, Zhaojiacheng
Published: (2026)

SoK: Blockchain-Based Decentralized AI (DeAI)
by: Lui, Elizabeth, et al.
Published: (2024)

SafeSearch: Automated Red-Teaming of LLM-Based Search Agents
by: Dong, Jianshuo, et al.
Published: (2025)

BlackIce: A Containerized Red Teaming Toolkit for AI Security Testing
by: Kaplan, Caelin, et al.
Published: (2025)

Automatic LLM Red Teaming
by: Belaire, Roman, et al.
Published: (2025)

CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge
by: Chiu, Yu Ying, et al.
Published: (2024)

CoT Red-Handed: Stress Testing Chain-of-Thought Monitoring
by: Arnav, Benjamin, et al.
Published: (2025)

Geometric Red-Teaming for Robotic Manipulation
by: Goel, Divyam, et al.
Published: (2025)

TroubleLLM: Align to Red Team Expert
by: Xu, Zhuoer, et al.
Published: (2024)

FLIRT: Feedback Loop In-context Red Teaming
by: Mehrabi, Ninareh, et al.
Published: (2023)

T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search
by: Lee, Hyomin, et al.
Published: (2026)

Red-Teaming Coding Agents from a Tool-Invocation Perspective: An Empirical Security Assessment
by: Xie, Yuchong, et al.
Published: (2025)

AgenticRed: Evolving Agentic Systems for Red-Teaming
by: Yuan, Jiayi, et al.
Published: (2026)

Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction
by: Zhang, Jinchuan, et al.
Published: (2024)

LLM Wardens: Mitigating Adversarial Persuasion with Third-Party Conversational Oversight
by: Wachowiak, Lennart, et al.
Published: (2026)

Red Teaming Large Reasoning Models
by: Chen, Jiawei, et al.
Published: (2025)

Exploring Straightforward Conversational Red-Teaming
by: Kour, George, et al.
Published: (2024)

Whispers of Wealth: Red-Teaming Google's Agent Payments Protocol via Prompt Injection
by: Debi, Tanusree, et al.
Published: (2026)

Red-Teaming Agent Execution Contexts: Open-World Security Evaluation on OpenClaw
by: Yao, Hongwei, et al.
Published: (2026)

EVOCHAMBER: Test-Time Co-evolution of Multi-Agent System at Individual, Team, and Population Scales
by: Zhang, Yaolun, et al.
Published: (2026)

ARMs: Adaptive Red-Teaming Agent against Multimodal Models with Plug-and-Play Attacks
by: Chen, Zhaorun, et al.
Published: (2025)