:: Library Catalog

Image de couverture de livre

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Abdelnabi, Sahar, Hicks, Chris, Rieck, Konrad, Sadeghi, Ahmad-Reza
Format:	Preprint
Publié:	2026
Sujets:	Cryptography and Security Artificial Intelligence
Accès en ligne:	https://arxiv.org/abs/2605.22568
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

Documents similaires

Stateless Yet Not Forgetful: Implicit Memory as a Hidden Channel in LLMs
par: Salem, Ahmed, et autres
Publié: (2026)

Terrarium: Revisiting the Blackboard for Multi-Agent Safety, Privacy, and Security Studies
par: Nakamura, Mason, et autres
Publié: (2025)

No More, No Less: Task Alignment in Terminal Agents
par: Mavali, Sina, et autres
Publié: (2026)

CybORG++: An Enhanced Gym for the Development of Autonomous Cyber Agents
par: Emerson, Harry, et autres
Publié: (2024)

ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations
par: Gomaa, Amr, et autres
Publié: (2025)

HardSecBench: Benchmarking the Security Awareness of LLMs for Hardware Code Generation
par: Chen, Qirui, et autres
Publié: (2026)

SkillTester: Benchmarking Utility and Security of Agent Skills
par: Wang, Leye, et autres
Publié: (2026)

Measuring Safety Alignment Effects in Autonomous Security Agents
par: David, Isaac, et autres
Publié: (2026)

AI Agents May Always Fall for Prompt Injections
par: Abdelnabi, Sahar, et autres
Publié: (2026)

ZORRO: Zero-Knowledge Robustness and Privacy for Split Learning (Full Version)
par: Sheybani, Nojan, et autres
Publié: (2025)

Towards Unifying Quantitative Security Benchmarking for Multi Agent Systems
par: Sharma, Gauri, et autres
Publié: (2025)

Firewalls to Secure Dynamic LLM Agentic Networks
par: Abdelnabi, Sahar, et autres
Publié: (2025)

Why LLMs Fail: A Failure Analysis and Partial Success Measurement for Automated Security Patch Generation
par: Al-Maamari, Amir
Publié: (2026)

Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents
par: Zhang, Hanrong, et autres
Publié: (2024)

Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks
par: Schmotz, David, et autres
Publié: (2026)

WASP: Benchmarking Web Agent Security Against Prompt Injection Attacks
par: Evtimov, Ivan, et autres
Publié: (2025)

Secure On-Device Video OOD Detection Without Backpropagation
par: Li, Shawn, et autres
Publié: (2025)

Fooling SHAP with Output Shuffling Attacks
par: Yuan, Jun, et autres
Publié: (2024)

Mitigating Deep Reinforcement Learning Backdoors in the Neural Activation Space
par: Vyas, Sanyam, et autres
Publié: (2024)

Less is more? Rewards in RL for Cyber Defence
par: Bates, Elizabeth, et autres
Publié: (2025)

Too Easily Fooled? Prompt Injection Breaks LLMs on Frustratingly Simple Multiple-Choice Questions
par: Guo, Xuyang, et autres
Publié: (2025)

BadScientist: Can a Research Agent Write Convincing but Unsound Papers that Fool LLM Reviewers?
par: Jiang, Fengqing, et autres
Publié: (2025)

SecRepoBench: Benchmarking Code Agents for Secure Code Completion in Real-World Repositories
par: Shen, Chihao, et autres
Publié: (2025)

MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents
par: Zhang, Dongsen, et autres
Publié: (2025)

RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments
par: Fu, Yuchuan, et autres
Publié: (2025)

Security of AI Agents
par: He, Yifeng, et autres
Publié: (2024)

Autonomous Network Defence using Reinforcement Learning
par: Foley, Myles, et autres
Publié: (2024)

Symbolic Guardrails for Domain-Specific Agents: Stronger Safety and Security Guarantees Without Sacrificing Utility
par: Hong, Yining, et autres
Publié: (2026)

Offensive Security for AI Systems: Concepts, Practices, and Applications
par: Harguess, Josh, et autres
Publié: (2025)

Hidden in Memory: Sleeper Memory Poisoning in LLM Agents
par: Pulipaka, Sidharth, et autres
Publié: (2026)

Provably Secure Agent Guardrail
par: Wu, Benlong, et autres
Publié: (2026)

Large Language Models for Security Operations Centers: A Comprehensive Survey
par: Habibzadeh, Ali, et autres
Publié: (2025)

RAG Security and Privacy: Formalizing the Threat Model and Attack Surface
par: Arzanipour, Atousa, et autres
Publié: (2025)

Parallax: Why AI Agents That Think Must Never Act
par: Fokou, Joel
Publié: (2026)

Towards Effective Offensive Security LLM Agents: Hyperparameter Tuning, LLM as a Judge, and a Lightweight CTF Benchmark
par: Shao, Minghao, et autres
Publié: (2025)

Towards Secure Agent Skills: Architecture, Threat Taxonomy, and Security Analysis
par: Li, Zhiyuan, et autres
Publié: (2026)

Agent Security is a Systems Problem
par: Christodorescu, Mihai, et autres
Publié: (2026)

Security of Internet of Agents: Attacks and Countermeasures
par: Wang, Yuntao, et autres
Publié: (2025)

aCAPTCHA: Verifying That an Entity Is a Capable Agent via Asymmetric Hardness
par: Xu, Zuyao, et autres
Publié: (2026)

Dagger Behind Smile: Fool LLMs with a Happy Ending Story
par: Song, Xurui, et autres
Publié: (2025)