Saved in:
Bibliographic Details
Main Authors: Liu, Zhichao, Pan, Wenbo, Yu, Haining, Gao, Ge, Zhu, Tianqing, Jia, Xiaohua
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.08310
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910202820820992
author Liu, Zhichao
Pan, Wenbo
Yu, Haining
Gao, Ge
Zhu, Tianqing
Jia, Xiaohua
author_facet Liu, Zhichao
Pan, Wenbo
Yu, Haining
Gao, Ge
Zhu, Tianqing
Jia, Xiaohua
contents Browser agents are increasingly deployed in long-horizon tasks, which require executing extended action chains to accomplish user goals. However, this prolonged execution process provides attackers with more opportunities to inject malicious instructions. Existing prompt injection attacks against browser agents expose two key gaps: (1) low effectiveness, as attacks optimized for toy baselines fail to achieve end-to-end goals in real-world scenarios with complex environments and longer steps; (2) weak stealthiness, since most attacks pit the attack goal against the user goal, causing a significant drop in system usability under attack. To address these gaps, we propose WebTrap, a mid-task hijacking injection attack. It employs multi-step instruction fusion steering to seamlessly combine both goals, enabling the agent to resume the original user task after executing the attack goal. Furthermore, we design a context-grounded generation method to align the injected content with the task environment and system instructions, maximizing the hijacking success rate. Extensive experiments on two browser agent tasks, based on extended WASP and InjecAgent environments, demonstrate that our method achieves a high attack success rate while preserving the usability of the original system. We find that WebTrap exploits the agent's navigation vulnerabilities, binding the two goals so tightly that standard defense mechanisms cannot restore the system to normal operation. These findings reveal a critical vulnerability in agent systems during long-horizon tasks that they can be stealthily hijacked.
format Preprint
id arxiv_https___arxiv_org_abs_2605_08310
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle WebTrap: Stealthy Mid-Task Hijacking of Browser Agents During Navigation
Liu, Zhichao
Pan, Wenbo
Yu, Haining
Gao, Ge
Zhu, Tianqing
Jia, Xiaohua
Cryptography and Security
Artificial Intelligence
Browser agents are increasingly deployed in long-horizon tasks, which require executing extended action chains to accomplish user goals. However, this prolonged execution process provides attackers with more opportunities to inject malicious instructions. Existing prompt injection attacks against browser agents expose two key gaps: (1) low effectiveness, as attacks optimized for toy baselines fail to achieve end-to-end goals in real-world scenarios with complex environments and longer steps; (2) weak stealthiness, since most attacks pit the attack goal against the user goal, causing a significant drop in system usability under attack. To address these gaps, we propose WebTrap, a mid-task hijacking injection attack. It employs multi-step instruction fusion steering to seamlessly combine both goals, enabling the agent to resume the original user task after executing the attack goal. Furthermore, we design a context-grounded generation method to align the injected content with the task environment and system instructions, maximizing the hijacking success rate. Extensive experiments on two browser agent tasks, based on extended WASP and InjecAgent environments, demonstrate that our method achieves a high attack success rate while preserving the usability of the original system. We find that WebTrap exploits the agent's navigation vulnerabilities, binding the two goals so tightly that standard defense mechanisms cannot restore the system to normal operation. These findings reveal a critical vulnerability in agent systems during long-horizon tasks that they can be stealthily hijacked.
title WebTrap: Stealthy Mid-Task Hijacking of Browser Agents During Navigation
topic Cryptography and Security
Artificial Intelligence
url https://arxiv.org/abs/2605.08310