:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Molina-Markham, Andres, Robaina, Luis, Steinle, Sean, Trivedi, Akash, Tsui, Derek, Potteiger, Nicholas, Brandt, Lauren, Winder, Ransom, Ridley, Ahmad
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence Cryptography and Security
Online Access:	https://arxiv.org/abs/2505.22531
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Designing Robust Cyber-Defense Agents with Evolving Behavior Trees
by: Potteiger, Nicholas, et al.
Published: (2024)

Out-of-Distribution Detection for Neurosymbolic Autonomous Cyber Agents
by: Samaddar, Ankita, et al.
Published: (2024)

Proceedings of the 2nd International Workshop on Adaptive Cyber Defense
by: Carvalho, Marco, et al.
Published: (2023)

Interpreting Agent Behaviors in Reinforcement-Learning-Based Cyber-Battle Simulation Platforms
by: Claypoole, Jared, et al.
Published: (2025)

Interpretability-Guided Test-Time Adversarial Defense
by: Kulkarni, Akshay, et al.
Published: (2024)

The Autonomy Tax: Defense Training Breaks LLM Agents
by: Li, Shawn, et al.
Published: (2026)

Explainable Autonomous Cyber Defense using Adversarial Multi-Agent Reinforcement Learning
by: Zhang, Yiyao, et al.
Published: (2026)

Anti-Sensing: Defense against Unauthorized Radar-based Human Vital Sign Sensing with Physically Realizable Wearable Oscillators
by: Oshim, Md Farhan Tasnim, et al.
Published: (2025)

Attacks and Defenses Against LLM Fingerprinting
by: Kurian, Kevin, et al.
Published: (2025)

RL and Fingerprinting to Select Moving Target Defense Mechanisms for Zero-day Attacks in IoT
by: Celdrán, Alberto Huertas, et al.
Published: (2022)

The Path To Autonomous Cyber Defense
by: Oesch, Sean, et al.
Published: (2024)

Dynamic Dual-level Defense Routing for Continual Adversarial Training
by: Wang, Wenxuan, et al.
Published: (2025)

RLShield: Practical Multi-Agent RL for Financial Cyber Defense with Attack-Surface MDPs and Real-Time Response Orchestration
by: Nayak, Srikumar
Published: (2026)

AegisAgent: An Autonomous Defense Agent Against Prompt Injection Attacks in LLM-HARs
by: Wang, Yihan, et al.
Published: (2025)

AgentDyn: Are Your Agent Security Defenses Deployable in Real-World Dynamic Environments?
by: Li, Hao, et al.
Published: (2026)

AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks
by: Zeng, Yifan, et al.
Published: (2024)

Cross-Task Defense: Instruction-Tuning LLMs for Content Safety
by: Fu, Yu, et al.
Published: (2024)

Defending Against Prompt Injection With a Few DefensiveTokens
by: Chen, Sizhe, et al.
Published: (2025)

PocketAgents: A Manifest-Driven Library of Autonomous Defense Agents
by: Barbieri, Sidnei, et al.
Published: (2026)

AgentSentinel: An End-to-End and Real-Time Security Defense Framework for Computer-Use Agents
by: Hu, Haitao, et al.
Published: (2025)

Decoding FL Defenses: Systemization, Pitfalls, and Remedies
by: Khan, Momin Ahmad, et al.
Published: (2025)

Contextualized Privacy Defense for LLM Agents
by: Wen, Yule, et al.
Published: (2026)

Defensible Design for OpenClaw: Securing Autonomous Tool-Invoking Agents
by: Li, Zongwei, et al.
Published: (2026)

Toward a Dynamic Intellectual Property Protection Model in High-Growth SMEs
by: Pitruzzello, Sam, et al.
Published: (2026)

Threat Intelligence Driven IP Protection for Entrepreneurial SMEs
by: Pitruzzello, Sam, et al.
Published: (2026)

Jatmo: Prompt Injection Defense by Task-Specific Finetuning
by: Piet, Julien, et al.
Published: (2023)

Elevating Defenses: Bridging Adversarial Training and Watermarking for Model Resilience
by: Thakkar, Janvi, et al.
Published: (2023)

Differentially Private Optimization for Non-Decomposable Objective Functions
by: Kong, Weiwei, et al.
Published: (2023)

Preserving security in a world with powerful AI Considerations for the future Defense Architecture
by: Generous, Nicholas, et al.
Published: (2025)

Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents
by: Zhang, Hanrong, et al.
Published: (2024)

Memory Poisoning Attack and Defense on Memory Based LLM-Agents
by: Sunil, Balachandra Devarangadi, et al.
Published: (2026)

Organizational Learning in Industry 4.0: Applying Crossan's 4I Framework with Double Loop Learning
by: Akram, Nimra, et al.
Published: (2025)

Evaluating the Robustness of the "Ensemble Everything Everywhere" Defense
by: Zhang, Jie, et al.
Published: (2024)

Efficient RL-based Cache Vulnerability Exploration by Penalizing Useless Agent Actions
by: Nakanishi, Kanato, et al.
Published: (2025)

Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents
by: Kumar, Priyanshu, et al.
Published: (2024)

RoboJailBench: Benchmarking Adversarial Attacks and Defenses in Embodied Robotic Agents
by: Yeke, Doguhuan, et al.
Published: (2026)

AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
by: Debenedetti, Edoardo, et al.
Published: (2024)

Automated Cyber Defense with Generalizable Graph-based Reinforcement Learning Agents
by: King, Isaiah J., et al.
Published: (2025)

Taxonomy, Evaluation and Exploitation of IPI-Centric LLM Agent Defense Frameworks
by: Ji, Zimo, et al.
Published: (2025)

WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections
by: Cao, Tri, et al.
Published: (2026)