:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jiang, Xiaochong, Yang, Shiqi, Li, Ziwei, Liu, Lifei, Yu, Haoran, Liu, Yichen
Format:	Preprint
Published:	2026
Subjects:	Cryptography and Security Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.26542
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SOK: A Taxonomy of Attack Vectors and Defense Strategies for Agentic Supply Chain Runtime
by: Jiang, Xiaochong, et al.
Published: (2026)

When Safe Skills Collide: Measuring Compositional Risk in Agent Skill Ecosystems
by: Wang, Su, et al.
Published: (2026)

CapSeal: Capability-Sealed Secret Mediation for Secure Agent Execution
by: Jin, Shutong, et al.
Published: (2026)

SafeMobile: Chain-level Jailbreak Detection and Automated Evaluation for Multimodal Mobile Agents
by: Liang, Siyuan, et al.
Published: (2025)

SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents
by: Yin, Sheng, et al.
Published: (2024)

The Granularity Mismatch in Agent Security: Argument-Level Provenance Solves Enforcement and Isolates the LLM Reasoning Bottleneck
by: Fan, Linfeng, et al.
Published: (2026)

Beyond Max Tokens: Stealthy Resource Amplification via Tool Calling Chains in LLM Agents
by: Zhou, Kaiyu, et al.
Published: (2026)

SafeHarness: Lifecycle-Integrated Security Architecture for LLM-based Agent Deployment
by: Lin, Xixun, et al.
Published: (2026)

Evaluating Privilege Usage of Agents with Real-World Tools
by: Zhang, Quan, et al.
Published: (2026)

ToolTweak: An Attack on Tool Selection in LLM-based Agents
by: Sneh, Jonathan, et al.
Published: (2025)

AgentTrust: Runtime Safety Evaluation and Interception for AI Agent Tool Use
by: Yang, Chenglin
Published: (2026)

VIGIL: Defending LLM Agents Against Tool Stream Injection via Verify-Before-Commit
by: Lin, Junda, et al.
Published: (2026)

Red-Teaming Coding Agents from a Tool-Invocation Perspective: An Empirical Security Assessment
by: Xie, Yuchong, et al.
Published: (2025)

aCAPTCHA: Verifying That an Entity Is a Capable Agent via Asymmetric Hardness
by: Xu, Zuyao, et al.
Published: (2026)

MalURLBench: A Benchmark Evaluating Agents' Vulnerabilities When Processing Web URLs
by: Kong, Dezhang, et al.
Published: (2026)

Agent Tools Orchestration Leaks More: Dataset, Benchmark, and Mitigation
by: Qiao, Yuxuan, et al.
Published: (2025)

LLMs Can Covertly Sandbag on Capability Evaluations Against Chain-of-Thought Monitoring
by: Li, Chloe, et al.
Published: (2025)

SafeHarbor: Hierarchical Memory-Augmented Guardrail for LLM Agent Safety
by: Liu, Zhe, et al.
Published: (2026)

Atomicity for Agents: Exposing, Exploiting, and Mitigating TOCTOU Vulnerabilities in Browser-Use Agents
by: Jiang, Linxi, et al.
Published: (2026)

Hijacking Agent Memory: Stealthy Trojan Attacks Through Conversational Interaction
by: Wang, Hongtao, et al.
Published: (2026)

Babel: Jailbreaking Safety Attention via Obfuscation Distribution Optimized Sampling
by: Wang, Ziwei, et al.
Published: (2026)

ExploitBench: A Capability Ladder Benchmark for LLM Cybersecurity Agents
by: Lee, Seunghyun, et al.
Published: (2026)

Enhancing Linux Privilege Escalation Attack Capabilities of Local LLM Agents
by: Probst, Benjamin, et al.
Published: (2026)

RepliBench: Evaluating the Autonomous Replication Capabilities of Language Model Agents
by: Black, Sid, et al.
Published: (2025)

Towards Safe and Honest AI Agents with Neural Self-Other Overlap
by: Carauleanu, Marc, et al.
Published: (2024)

BadThink: Triggered Overthinking Attacks on Chain-of-Thought Reasoning in Large Language Models
by: Liu, Shuaitong, et al.
Published: (2025)

Jailbreaking Large Language Models through Iterative Tool-Disguised Attacks via Reinforcement Learning
by: Wang, Zhaoqi, et al.
Published: (2026)

PACEbench: A Framework for Evaluating Practical AI Cyber-Exploitation Capabilities
by: Liu, Zicheng, et al.
Published: (2025)

PatchPilot: A Cost-Efficient Software Engineering Agent with Early Attempts on Formal Verification
by: Li, Hongwei, et al.
Published: (2025)

RECUR: Resource Exhaustion Attack via Recursive-Entropy Guided Counterfactual Utilization and Reflection
by: Wang, Ziwei, et al.
Published: (2026)

AdInject: Real-World Black-Box Attacks on Web Agents via Advertising Delivery
by: Wang, Haowei, et al.
Published: (2025)

SFCoT: Safer Chain-of-Thought via Active Safety Evaluation and Calibration
by: Pan, Yu, et al.
Published: (2026)

SafeSearch: Automated Red-Teaming of LLM-Based Search Agents
by: Dong, Jianshuo, et al.
Published: (2025)

Your LLM Agent Can Leak Your Data: Data Exfiltration via Backdoored Tool Use
by: Zhang, Wuyang, et al.
Published: (2026)

ShadowMerge: A Novel Poisoning Attack on Graph-Based Agent Memory via Relation-Channel Conflicts
by: Luo, Yang, et al.
Published: (2026)

TrajAD: Trajectory Anomaly Detection for Trustworthy LLM Agents
by: Liu, Yibing, et al.
Published: (2026)

Supply-Chain Poisoning Attacks Against LLM Coding Agent Skill Ecosystems
by: Qu, Yubin, et al.
Published: (2026)

VulDetectBench: Evaluating the Deep Capability of Vulnerability Detection with Large Language Models
by: Liu, Yu, et al.
Published: (2024)

STAC: When Innocent Tools Form Dangerous Chains to Jailbreak LLM Agents
by: Li, Jing-Jing, et al.
Published: (2025)

Agent Capability Negotiation and Binding Protocol (ACNBP)
by: Huang, Ken, et al.
Published: (2025)