Saved in:
| Main Authors: | Zhang, Yilin, Hua, Yingkai, Wei, Chunyu, Wang, Xin, Chen, Yueguo |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.09497 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Please Don't Kill My Vibe: Empowering Agents with Data Flow Control
by: Summers, Charlie, et al.
Published: (2025)
by: Summers, Charlie, et al.
Published: (2025)
Don't believe everything you read: Understanding and Measuring MCP Behavior under Misleading Tool Descriptions
by: Li, Zhihao, et al.
Published: (2026)
by: Li, Zhihao, et al.
Published: (2026)
Prompts Don't Protect: Architectural Enforcement via MCP Proxy for LLM Tool Access Control
by: Uppala, Rohith
Published: (2026)
by: Uppala, Rohith
Published: (2026)
Secret Collusion among AI Agents: Multi-Agent Deception via Steganography
by: Motwani, Sumeet Ramesh, et al.
Published: (2024)
by: Motwani, Sumeet Ramesh, et al.
Published: (2024)
I Don't Know You, But I Can Catch You: Real-Time Defense against Diverse Adversarial Patches for Object Detectors
by: Lin, Zijin, et al.
Published: (2024)
by: Lin, Zijin, et al.
Published: (2024)
Mind the Web: The Security of Web Use Agents
by: Shapira, Avishag, et al.
Published: (2025)
by: Shapira, Avishag, et al.
Published: (2025)
WebTrap Park: An Automated Platform for Systematic Security Evaluation of Web Agents
by: Wu, Xinyi, et al.
Published: (2026)
by: Wu, Xinyi, et al.
Published: (2026)
AI-Governed Agent Architecture for Web-Trustworthy Tokenization of Alternative Assets
by: Borjigin, Ailiya, et al.
Published: (2025)
by: Borjigin, Ailiya, et al.
Published: (2025)
LLM Cyber Evaluations Don't Capture Real-World Risk
by: Lukošiūtė, Kamilė, et al.
Published: (2025)
by: Lukošiūtė, Kamilė, et al.
Published: (2025)
WIPI: A New Web Threat for LLM-Driven Web Agents
by: Wu, Fangzhou, et al.
Published: (2024)
by: Wu, Fangzhou, et al.
Published: (2024)
Fuzz-Testing Meets LLM-Based Agents: An Automated and Efficient Framework for Jailbreaking Text-To-Image Generation Models
by: Dong, Yingkai, et al.
Published: (2024)
by: Dong, Yingkai, et al.
Published: (2024)
AdInject: Real-World Black-Box Attacks on Web Agents via Advertising Delivery
by: Wang, Haowei, et al.
Published: (2025)
by: Wang, Haowei, et al.
Published: (2025)
Multi-Agent Penetration Testing AI for the Web
by: David, Isaac, et al.
Published: (2025)
by: David, Isaac, et al.
Published: (2025)
WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections
by: Cao, Tri, et al.
Published: (2026)
by: Cao, Tri, et al.
Published: (2026)
AWE: Adaptive Agents for Dynamic Web Penetration Testing
by: Jaswal, Akshat Singh, et al.
Published: (2026)
by: Jaswal, Akshat Singh, et al.
Published: (2026)
DECEPTICON: How Dark Patterns Manipulate Web Agents
by: Cuvin, Phil, et al.
Published: (2025)
by: Cuvin, Phil, et al.
Published: (2025)
SnapGuard: Lightweight Prompt Injection Detection for Screenshot-Based Web Agents
by: Du, Mengyao, et al.
Published: (2026)
by: Du, Mengyao, et al.
Published: (2026)
WebSentinel: Detecting and Localizing Prompt Injection Attacks for Web Agents
by: Wang, Xilong, et al.
Published: (2026)
by: Wang, Xilong, et al.
Published: (2026)
Poison Once, Exploit Forever: Environment-Injected Memory Poisoning Attacks on Web Agents
by: Zou, Wei, et al.
Published: (2026)
by: Zou, Wei, et al.
Published: (2026)
Cross-Modal Content Optimization for Steering Web Agent Preferences
by: Jiang, Tanqiu, et al.
Published: (2025)
by: Jiang, Tanqiu, et al.
Published: (2025)
Real Money, Fake Models: Deceptive Model Claims in Shadow APIs
by: Zhang, Yage, et al.
Published: (2026)
by: Zhang, Yage, et al.
Published: (2026)
WebPII: Benchmarking Visual PII Detection for Computer-Use Agents
by: Zhao, Nathan
Published: (2026)
by: Zhao, Nathan
Published: (2026)
WASP: Benchmarking Web Agent Security Against Prompt Injection Attacks
by: Evtimov, Ivan, et al.
Published: (2025)
by: Evtimov, Ivan, et al.
Published: (2025)
IPI-proxy: An Intercepting Proxy for Red-Teaming Web-Browsing AI Agents Against Indirect Prompt Injection
by: Chia-Pei, et al.
Published: (2026)
by: Chia-Pei, et al.
Published: (2026)
When Bots Take the Bait: Exposing and Mitigating the Emerging Social Engineering Attack in Web Automation Agent
by: Wu, Xinyi, et al.
Published: (2026)
by: Wu, Xinyi, et al.
Published: (2026)
Options, Not Clicks: Lattice Refinement for Consent-Driven MCP Authorization
by: Li, Ying, et al.
Published: (2026)
by: Li, Ying, et al.
Published: (2026)
WebWeaver: Breaking Topology Confidentiality in LLM Multi-Agent Systems with Stealthy Context-Based Inference
by: Xiong, Zixun, et al.
Published: (2026)
by: Xiong, Zixun, et al.
Published: (2026)
WebTrap: Stealthy Mid-Task Hijacking of Browser Agents During Navigation
by: Liu, Zhichao, et al.
Published: (2026)
by: Liu, Zhichao, et al.
Published: (2026)
zkUnlearner: A Zero-Knowledge Framework for Verifiable Unlearning with Multi-Granularity and Forgery-Resistance
by: Wang, Nan, et al.
Published: (2025)
by: Wang, Nan, et al.
Published: (2025)
CheatAgent: Attacking LLM-Empowered Recommender Systems via LLM Agent
by: Ning, Liang-bo, et al.
Published: (2025)
by: Ning, Liang-bo, et al.
Published: (2025)
Jailbreaking Frontier Foundation Models Through Intention Deception
by: Wang, Xinhe, et al.
Published: (2026)
by: Wang, Xinhe, et al.
Published: (2026)
AdaPhish: AI-Powered Adaptive Defense and Education Resource Against Deceptive Emails
by: Meguro, Rei, et al.
Published: (2025)
by: Meguro, Rei, et al.
Published: (2025)
Autonomous Intelligent Agents for Natural-Language-Driven Web Execution with Integrated Security Assurance
by: Pasupuleti, Vinil, et al.
Published: (2026)
by: Pasupuleti, Vinil, et al.
Published: (2026)
MalURLBench: A Benchmark Evaluating Agents' Vulnerabilities When Processing Web URLs
by: Kong, Dezhang, et al.
Published: (2026)
by: Kong, Dezhang, et al.
Published: (2026)
Contextual Chart Generation for Cyber Deception
by: Nguyen, David D., et al.
Published: (2024)
by: Nguyen, David D., et al.
Published: (2024)
PAPILLON: Efficient and Stealthy Fuzz Testing-Powered Jailbreaks for LLMs
by: Gong, Xueluan, et al.
Published: (2024)
by: Gong, Xueluan, et al.
Published: (2024)
ICON: Indirect Prompt Injection Defense for Agents based on Inference-Time Correction
by: Wang, Che, et al.
Published: (2026)
by: Wang, Che, et al.
Published: (2026)
MUZZLE: Adaptive Agentic Red-Teaming of Web Agents Against Indirect Prompt Injection Attacks
by: Syros, Georgios, et al.
Published: (2026)
by: Syros, Georgios, et al.
Published: (2026)
Manipulating LLM Web Agents with Indirect Prompt Injection Attack via HTML Accessibility Tree
by: Johnson, Sam, et al.
Published: (2025)
by: Johnson, Sam, et al.
Published: (2025)
WebSP-Eval: Evaluating Web Agents on Website Security and Privacy Tasks
by: Ramesh, Guruprasad Viswanathan, et al.
Published: (2026)
by: Ramesh, Guruprasad Viswanathan, et al.
Published: (2026)
Similar Items
-
Please Don't Kill My Vibe: Empowering Agents with Data Flow Control
by: Summers, Charlie, et al.
Published: (2025) -
Don't believe everything you read: Understanding and Measuring MCP Behavior under Misleading Tool Descriptions
by: Li, Zhihao, et al.
Published: (2026) -
Prompts Don't Protect: Architectural Enforcement via MCP Proxy for LLM Tool Access Control
by: Uppala, Rohith
Published: (2026) -
Secret Collusion among AI Agents: Multi-Agent Deception via Steganography
by: Motwani, Sumeet Ramesh, et al.
Published: (2024) -
I Don't Know You, But I Can Catch You: Real-Time Defense against Diverse Adversarial Patches for Object Detectors
by: Lin, Zijin, et al.
Published: (2024)