Saved in:
| Main Authors: | Wang, Shouqiao, Politi, Marcello, Marro, Samuele, Crapis, Davide |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.20925 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Complete Answer to Erdős Problem 690
by: Wang, Shouqiao, et al.
Published: (2026)
by: Wang, Shouqiao, et al.
Published: (2026)
A Framework for Combined Transaction Posting and Pricing for Layer 2 Blockchains
by: Wang, Shouqiao, et al.
Published: (2025)
by: Wang, Shouqiao, et al.
Published: (2025)
GOD model: Privacy Preserved AI School for Personal Assistant
by: PIN AI Team, et al.
Published: (2025)
by: PIN AI Team, et al.
Published: (2025)
End-to-end PDDL Planning with Hardcoded and Dynamic Agents
by: La Malfa, Emanuele, et al.
Published: (2025)
by: La Malfa, Emanuele, et al.
Published: (2025)
STARK: Strategic Team of Agents for Refining Kernels
by: Dong, Juncheng, et al.
Published: (2025)
by: Dong, Juncheng, et al.
Published: (2025)
Strategize Globally, Adapt Locally: A Multi-Turn Red Teaming Agent with Dual-Level Learning
by: Chen, Si, et al.
Published: (2025)
by: Chen, Si, et al.
Published: (2025)
DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents
by: Chen, Zhaorun, et al.
Published: (2026)
by: Chen, Zhaorun, et al.
Published: (2026)
Mechanism Design Is Not Enough: Prosocial Agents for Cooperative AI
by: Huang, Xuanqiang Angelo, et al.
Published: (2026)
by: Huang, Xuanqiang Angelo, et al.
Published: (2026)
Red Teaming AI Red Teaming
by: Majumdar, Subhabrata, et al.
Published: (2025)
by: Majumdar, Subhabrata, et al.
Published: (2025)
Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming
by: Zhang, Zheng, et al.
Published: (2025)
by: Zhang, Zheng, et al.
Published: (2025)
Large Language Models Miss the Multi-Agent Mark
by: La Malfa, Emanuele, et al.
Published: (2025)
by: La Malfa, Emanuele, et al.
Published: (2025)
RedAgent: Red Teaming Large Language Models with Context-aware Autonomous Language Agent
by: Xu, Huiyu, et al.
Published: (2024)
by: Xu, Huiyu, et al.
Published: (2024)
Red-Team Multi-Agent Reinforcement Learning for Emergency Braking Scenario
by: Chen, Yinsong, et al.
Published: (2025)
by: Chen, Yinsong, et al.
Published: (2025)
A Notion of Complexity for Theory of Mind via Discrete World Models
by: Huang, X. Angelo, et al.
Published: (2024)
by: Huang, X. Angelo, et al.
Published: (2024)
MonitoringBench: Semi-Automated Red-Teaming for Agent Monitoring
by: Jotautaitė, Monika, et al.
Published: (2026)
by: Jotautaitė, Monika, et al.
Published: (2026)
Automated Red Teaming with GOAT: the Generative Offensive Agent Tester
by: Pavlova, Maya, et al.
Published: (2024)
by: Pavlova, Maya, et al.
Published: (2024)
LLM Agents Are the Antidote to Walled Gardens
by: Marro, Samuele, et al.
Published: (2025)
by: Marro, Samuele, et al.
Published: (2025)
Effective Red-Teaming of Policy-Adherent Agents
by: Nakash, Itay, et al.
Published: (2025)
by: Nakash, Itay, et al.
Published: (2025)
EVA: Red-Teaming GUI Agents via Evolving Indirect Prompt Injection
by: Lu, Yijie, et al.
Published: (2025)
by: Lu, Yijie, et al.
Published: (2025)
Proteus: A Self-Evolving Red Team for Agent Skill Ecosystems
by: Zhou, Zhaojiacheng
Published: (2026)
by: Zhou, Zhaojiacheng
Published: (2026)
SoK: Blockchain-Based Decentralized AI (DeAI)
by: Lui, Elizabeth, et al.
Published: (2024)
by: Lui, Elizabeth, et al.
Published: (2024)
SafeSearch: Automated Red-Teaming of LLM-Based Search Agents
by: Dong, Jianshuo, et al.
Published: (2025)
by: Dong, Jianshuo, et al.
Published: (2025)
BlackIce: A Containerized Red Teaming Toolkit for AI Security Testing
by: Kaplan, Caelin, et al.
Published: (2025)
by: Kaplan, Caelin, et al.
Published: (2025)
Automatic LLM Red Teaming
by: Belaire, Roman, et al.
Published: (2025)
by: Belaire, Roman, et al.
Published: (2025)
CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge
by: Chiu, Yu Ying, et al.
Published: (2024)
by: Chiu, Yu Ying, et al.
Published: (2024)
CoT Red-Handed: Stress Testing Chain-of-Thought Monitoring
by: Arnav, Benjamin, et al.
Published: (2025)
by: Arnav, Benjamin, et al.
Published: (2025)
Geometric Red-Teaming for Robotic Manipulation
by: Goel, Divyam, et al.
Published: (2025)
by: Goel, Divyam, et al.
Published: (2025)
TroubleLLM: Align to Red Team Expert
by: Xu, Zhuoer, et al.
Published: (2024)
by: Xu, Zhuoer, et al.
Published: (2024)
FLIRT: Feedback Loop In-context Red Teaming
by: Mehrabi, Ninareh, et al.
Published: (2023)
by: Mehrabi, Ninareh, et al.
Published: (2023)
T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search
by: Lee, Hyomin, et al.
Published: (2026)
by: Lee, Hyomin, et al.
Published: (2026)
Red-Teaming Coding Agents from a Tool-Invocation Perspective: An Empirical Security Assessment
by: Xie, Yuchong, et al.
Published: (2025)
by: Xie, Yuchong, et al.
Published: (2025)
AgenticRed: Evolving Agentic Systems for Red-Teaming
by: Yuan, Jiayi, et al.
Published: (2026)
by: Yuan, Jiayi, et al.
Published: (2026)
Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction
by: Zhang, Jinchuan, et al.
Published: (2024)
by: Zhang, Jinchuan, et al.
Published: (2024)
LLM Wardens: Mitigating Adversarial Persuasion with Third-Party Conversational Oversight
by: Wachowiak, Lennart, et al.
Published: (2026)
by: Wachowiak, Lennart, et al.
Published: (2026)
Red Teaming Large Reasoning Models
by: Chen, Jiawei, et al.
Published: (2025)
by: Chen, Jiawei, et al.
Published: (2025)
Exploring Straightforward Conversational Red-Teaming
by: Kour, George, et al.
Published: (2024)
by: Kour, George, et al.
Published: (2024)
Whispers of Wealth: Red-Teaming Google's Agent Payments Protocol via Prompt Injection
by: Debi, Tanusree, et al.
Published: (2026)
by: Debi, Tanusree, et al.
Published: (2026)
Red-Teaming Agent Execution Contexts: Open-World Security Evaluation on OpenClaw
by: Yao, Hongwei, et al.
Published: (2026)
by: Yao, Hongwei, et al.
Published: (2026)
EVOCHAMBER: Test-Time Co-evolution of Multi-Agent System at Individual, Team, and Population Scales
by: Zhang, Yaolun, et al.
Published: (2026)
by: Zhang, Yaolun, et al.
Published: (2026)
ARMs: Adaptive Red-Teaming Agent against Multimodal Models with Plug-and-Play Attacks
by: Chen, Zhaorun, et al.
Published: (2025)
by: Chen, Zhaorun, et al.
Published: (2025)
Similar Items
-
A Complete Answer to Erdős Problem 690
by: Wang, Shouqiao, et al.
Published: (2026) -
A Framework for Combined Transaction Posting and Pricing for Layer 2 Blockchains
by: Wang, Shouqiao, et al.
Published: (2025) -
GOD model: Privacy Preserved AI School for Personal Assistant
by: PIN AI Team, et al.
Published: (2025) -
End-to-end PDDL Planning with Hardcoded and Dynamic Agents
by: La Malfa, Emanuele, et al.
Published: (2025) -
STARK: Strategic Team of Agents for Refining Kernels
by: Dong, Juncheng, et al.
Published: (2025)