Saved in:
| Main Authors: | Molina-Markham, Andres, Robaina, Luis, Steinle, Sean, Trivedi, Akash, Tsui, Derek, Potteiger, Nicholas, Brandt, Lauren, Winder, Ransom, Ridley, Ahmad |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.22531 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Designing Robust Cyber-Defense Agents with Evolving Behavior Trees
by: Potteiger, Nicholas, et al.
Published: (2024)
by: Potteiger, Nicholas, et al.
Published: (2024)
Out-of-Distribution Detection for Neurosymbolic Autonomous Cyber Agents
by: Samaddar, Ankita, et al.
Published: (2024)
by: Samaddar, Ankita, et al.
Published: (2024)
Proceedings of the 2nd International Workshop on Adaptive Cyber Defense
by: Carvalho, Marco, et al.
Published: (2023)
by: Carvalho, Marco, et al.
Published: (2023)
Interpreting Agent Behaviors in Reinforcement-Learning-Based Cyber-Battle Simulation Platforms
by: Claypoole, Jared, et al.
Published: (2025)
by: Claypoole, Jared, et al.
Published: (2025)
Interpretability-Guided Test-Time Adversarial Defense
by: Kulkarni, Akshay, et al.
Published: (2024)
by: Kulkarni, Akshay, et al.
Published: (2024)
The Autonomy Tax: Defense Training Breaks LLM Agents
by: Li, Shawn, et al.
Published: (2026)
by: Li, Shawn, et al.
Published: (2026)
Explainable Autonomous Cyber Defense using Adversarial Multi-Agent Reinforcement Learning
by: Zhang, Yiyao, et al.
Published: (2026)
by: Zhang, Yiyao, et al.
Published: (2026)
Anti-Sensing: Defense against Unauthorized Radar-based Human Vital Sign Sensing with Physically Realizable Wearable Oscillators
by: Oshim, Md Farhan Tasnim, et al.
Published: (2025)
by: Oshim, Md Farhan Tasnim, et al.
Published: (2025)
Attacks and Defenses Against LLM Fingerprinting
by: Kurian, Kevin, et al.
Published: (2025)
by: Kurian, Kevin, et al.
Published: (2025)
RL and Fingerprinting to Select Moving Target Defense Mechanisms for Zero-day Attacks in IoT
by: Celdrán, Alberto Huertas, et al.
Published: (2022)
by: Celdrán, Alberto Huertas, et al.
Published: (2022)
The Path To Autonomous Cyber Defense
by: Oesch, Sean, et al.
Published: (2024)
by: Oesch, Sean, et al.
Published: (2024)
Dynamic Dual-level Defense Routing for Continual Adversarial Training
by: Wang, Wenxuan, et al.
Published: (2025)
by: Wang, Wenxuan, et al.
Published: (2025)
RLShield: Practical Multi-Agent RL for Financial Cyber Defense with Attack-Surface MDPs and Real-Time Response Orchestration
by: Nayak, Srikumar
Published: (2026)
by: Nayak, Srikumar
Published: (2026)
AegisAgent: An Autonomous Defense Agent Against Prompt Injection Attacks in LLM-HARs
by: Wang, Yihan, et al.
Published: (2025)
by: Wang, Yihan, et al.
Published: (2025)
AgentDyn: Are Your Agent Security Defenses Deployable in Real-World Dynamic Environments?
by: Li, Hao, et al.
Published: (2026)
by: Li, Hao, et al.
Published: (2026)
AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks
by: Zeng, Yifan, et al.
Published: (2024)
by: Zeng, Yifan, et al.
Published: (2024)
Cross-Task Defense: Instruction-Tuning LLMs for Content Safety
by: Fu, Yu, et al.
Published: (2024)
by: Fu, Yu, et al.
Published: (2024)
Defending Against Prompt Injection With a Few DefensiveTokens
by: Chen, Sizhe, et al.
Published: (2025)
by: Chen, Sizhe, et al.
Published: (2025)
PocketAgents: A Manifest-Driven Library of Autonomous Defense Agents
by: Barbieri, Sidnei, et al.
Published: (2026)
by: Barbieri, Sidnei, et al.
Published: (2026)
AgentSentinel: An End-to-End and Real-Time Security Defense Framework for Computer-Use Agents
by: Hu, Haitao, et al.
Published: (2025)
by: Hu, Haitao, et al.
Published: (2025)
Decoding FL Defenses: Systemization, Pitfalls, and Remedies
by: Khan, Momin Ahmad, et al.
Published: (2025)
by: Khan, Momin Ahmad, et al.
Published: (2025)
Contextualized Privacy Defense for LLM Agents
by: Wen, Yule, et al.
Published: (2026)
by: Wen, Yule, et al.
Published: (2026)
Defensible Design for OpenClaw: Securing Autonomous Tool-Invoking Agents
by: Li, Zongwei, et al.
Published: (2026)
by: Li, Zongwei, et al.
Published: (2026)
Toward a Dynamic Intellectual Property Protection Model in High-Growth SMEs
by: Pitruzzello, Sam, et al.
Published: (2026)
by: Pitruzzello, Sam, et al.
Published: (2026)
Threat Intelligence Driven IP Protection for Entrepreneurial SMEs
by: Pitruzzello, Sam, et al.
Published: (2026)
by: Pitruzzello, Sam, et al.
Published: (2026)
Jatmo: Prompt Injection Defense by Task-Specific Finetuning
by: Piet, Julien, et al.
Published: (2023)
by: Piet, Julien, et al.
Published: (2023)
Elevating Defenses: Bridging Adversarial Training and Watermarking for Model Resilience
by: Thakkar, Janvi, et al.
Published: (2023)
by: Thakkar, Janvi, et al.
Published: (2023)
Differentially Private Optimization for Non-Decomposable Objective Functions
by: Kong, Weiwei, et al.
Published: (2023)
by: Kong, Weiwei, et al.
Published: (2023)
Preserving security in a world with powerful AI Considerations for the future Defense Architecture
by: Generous, Nicholas, et al.
Published: (2025)
by: Generous, Nicholas, et al.
Published: (2025)
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents
by: Zhang, Hanrong, et al.
Published: (2024)
by: Zhang, Hanrong, et al.
Published: (2024)
Memory Poisoning Attack and Defense on Memory Based LLM-Agents
by: Sunil, Balachandra Devarangadi, et al.
Published: (2026)
by: Sunil, Balachandra Devarangadi, et al.
Published: (2026)
Organizational Learning in Industry 4.0: Applying Crossan's 4I Framework with Double Loop Learning
by: Akram, Nimra, et al.
Published: (2025)
by: Akram, Nimra, et al.
Published: (2025)
Evaluating the Robustness of the "Ensemble Everything Everywhere" Defense
by: Zhang, Jie, et al.
Published: (2024)
by: Zhang, Jie, et al.
Published: (2024)
Efficient RL-based Cache Vulnerability Exploration by Penalizing Useless Agent Actions
by: Nakanishi, Kanato, et al.
Published: (2025)
by: Nakanishi, Kanato, et al.
Published: (2025)
Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents
by: Kumar, Priyanshu, et al.
Published: (2024)
by: Kumar, Priyanshu, et al.
Published: (2024)
RoboJailBench: Benchmarking Adversarial Attacks and Defenses in Embodied Robotic Agents
by: Yeke, Doguhuan, et al.
Published: (2026)
by: Yeke, Doguhuan, et al.
Published: (2026)
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
by: Debenedetti, Edoardo, et al.
Published: (2024)
by: Debenedetti, Edoardo, et al.
Published: (2024)
Automated Cyber Defense with Generalizable Graph-based Reinforcement Learning Agents
by: King, Isaiah J., et al.
Published: (2025)
by: King, Isaiah J., et al.
Published: (2025)
Taxonomy, Evaluation and Exploitation of IPI-Centric LLM Agent Defense Frameworks
by: Ji, Zimo, et al.
Published: (2025)
by: Ji, Zimo, et al.
Published: (2025)
WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections
by: Cao, Tri, et al.
Published: (2026)
by: Cao, Tri, et al.
Published: (2026)
Similar Items
-
Designing Robust Cyber-Defense Agents with Evolving Behavior Trees
by: Potteiger, Nicholas, et al.
Published: (2024) -
Out-of-Distribution Detection for Neurosymbolic Autonomous Cyber Agents
by: Samaddar, Ankita, et al.
Published: (2024) -
Proceedings of the 2nd International Workshop on Adaptive Cyber Defense
by: Carvalho, Marco, et al.
Published: (2023) -
Interpreting Agent Behaviors in Reinforcement-Learning-Based Cyber-Battle Simulation Platforms
by: Claypoole, Jared, et al.
Published: (2025) -
Interpretability-Guided Test-Time Adversarial Defense
by: Kulkarni, Akshay, et al.
Published: (2024)