Saved in:
| Main Authors: | Kimura, Subaru, Tanaka, Ryota, Miyawaki, Shumpei, Suzuki, Jun, Sakaguchi, Keisuke |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.03554 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Goal-guided Generative Prompt Injection Attack on Large Language Models
by: Zhang, Chong, et al.
Published: (2024)
by: Zhang, Chong, et al.
Published: (2024)
Instruction-Following Evaluation of Large Vision-Language Models
by: Shiono, Daiki, et al.
Published: (2025)
by: Shiono, Daiki, et al.
Published: (2025)
Hijacking Large Language Models via Adversarial In-Context Learning
by: Zhou, Xiangyu, et al.
Published: (2023)
by: Zhou, Xiangyu, et al.
Published: (2023)
Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection
by: Yan, Jun, et al.
Published: (2023)
by: Yan, Jun, et al.
Published: (2023)
Fingerprinting LLMs via Prompt Injection
by: Hu, Yuepeng, et al.
Published: (2025)
by: Hu, Yuepeng, et al.
Published: (2025)
Hijacking Large Audio-Language Models via Context-Agnostic and Imperceptible Auditory Prompt Injection
by: Chen, Meng, et al.
Published: (2026)
by: Chen, Meng, et al.
Published: (2026)
FigStep: Jailbreaking Large Vision-Language Models via Typographic Visual Prompts
by: Gong, Yichen, et al.
Published: (2023)
by: Gong, Yichen, et al.
Published: (2023)
FATH: Authentication-based Test-time Defense against Indirect Prompt Injection Attacks
by: Wang, Jiongxiao, et al.
Published: (2024)
by: Wang, Jiongxiao, et al.
Published: (2024)
PARASITE: Conditional System Prompt Poisoning to Hijack LLMs
by: Pham, Viet, et al.
Published: (2025)
by: Pham, Viet, et al.
Published: (2025)
An Early Categorization of Prompt Injection Attacks on Large Language Models
by: Rossi, Sippo, et al.
Published: (2024)
by: Rossi, Sippo, et al.
Published: (2024)
InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents
by: Zhan, Qiusi, et al.
Published: (2024)
by: Zhan, Qiusi, et al.
Published: (2024)
Securing Large Language Models (LLMs) from Prompt Injection Attacks
by: Suri, Omar Farooq Khan, et al.
Published: (2025)
by: Suri, Omar Farooq Khan, et al.
Published: (2025)
Zero-Shot Embedding Drift Detection: A Lightweight Defense Against Prompt Injections in LLMs
by: Sekar, Anirudh, et al.
Published: (2026)
by: Sekar, Anirudh, et al.
Published: (2026)
The Landscape of Prompt Injection Threats in LLM Agents: From Taxonomy to Analysis
by: Wang, Peiran, et al.
Published: (2026)
by: Wang, Peiran, et al.
Published: (2026)
MIRAGE: Context-Aware Prompt Injection against Mobile GUI Agents via User-Generated Content
by: Guo, Ruoqi, et al.
Published: (2026)
by: Guo, Ruoqi, et al.
Published: (2026)
Soft Begging: Modular and Efficient Shielding of LLMs against Prompt Injection and Jailbreaking based on Prompt Tuning
by: Ostermann, Simon, et al.
Published: (2024)
by: Ostermann, Simon, et al.
Published: (2024)
ReasAlign: Reasoning Enhanced Safety Alignment against Prompt Injection Attack
by: Li, Hao, et al.
Published: (2026)
by: Li, Hao, et al.
Published: (2026)
Prompt Injection attack against LLM-integrated Applications
by: Liu, Yi, et al.
Published: (2023)
by: Liu, Yi, et al.
Published: (2023)
HijackRAG: Hijacking Attacks against Retrieval-Augmented Large Language Models
by: Zhang, Yucheng, et al.
Published: (2024)
by: Zhang, Yucheng, et al.
Published: (2024)
Is Your Prompt Safe? Investigating Prompt Injection Attacks Against Open-Source LLMs
by: Wang, Jiawen, et al.
Published: (2025)
by: Wang, Jiawen, et al.
Published: (2025)
Prompt Injection as Role Confusion
by: Ye, Charles, et al.
Published: (2026)
by: Ye, Charles, et al.
Published: (2026)
AttnTrace: Contextual Attribution of Prompt Injection and Knowledge Corruption
by: Wang, Yanting, et al.
Published: (2025)
by: Wang, Yanting, et al.
Published: (2025)
ShadowCoT: Cognitive Hijacking for Stealthy Reasoning Backdoors in LLMs
by: Zhao, Gejian, et al.
Published: (2025)
by: Zhao, Gejian, et al.
Published: (2025)
Stealthy and Persistent Unalignment on Large Language Models via Backdoor Injections
by: Cao, Yuanpu, et al.
Published: (2023)
by: Cao, Yuanpu, et al.
Published: (2023)
PRSA: Prompt Stealing Attacks against Real-World Prompt Services
by: Yang, Yong, et al.
Published: (2024)
by: Yang, Yong, et al.
Published: (2024)
Triosecuris: Formally Verified Protection Against Speculative Control-Flow Hijacking
by: Baumann, Jonathan, et al.
Published: (2026)
by: Baumann, Jonathan, et al.
Published: (2026)
LeechHijack: Covert Computational Resource Exploitation in Intelligent Agent Systems
by: Zhang, Yuanhe, et al.
Published: (2025)
by: Zhang, Yuanhe, et al.
Published: (2025)
Adversarial Attacks on LLM-as-a-Judge Systems: Insights from Prompt Injections
by: Maloyan, Narek, et al.
Published: (2025)
by: Maloyan, Narek, et al.
Published: (2025)
Beyond Pattern Matching: Seven Cross-Domain Techniques for Prompt Injection Detection
by: Munirathinam, Thamilvendhan
Published: (2026)
by: Munirathinam, Thamilvendhan
Published: (2026)
Fun-tuning: Characterizing the Vulnerability of Proprietary LLMs to Optimization-based Prompt Injection Attacks via the Fine-Tuning Interface
by: Labunets, Andrey, et al.
Published: (2025)
by: Labunets, Andrey, et al.
Published: (2025)
Prompt Stealing Attacks Against Large Language Models
by: Sha, Zeyang, et al.
Published: (2024)
by: Sha, Zeyang, et al.
Published: (2024)
Denial-of-Service Poisoning Attacks against Large Language Models
by: Gao, Kuofeng, et al.
Published: (2024)
by: Gao, Kuofeng, et al.
Published: (2024)
Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language Models
by: Liang, Zi, et al.
Published: (2024)
by: Liang, Zi, et al.
Published: (2024)
Separator Injection Attack: Uncovering Dialogue Biases in Large Language Models Caused by Role Separators
by: Li, Xitao, et al.
Published: (2025)
by: Li, Xitao, et al.
Published: (2025)
In-Context Representation Hijacking
by: Yona, Itay, et al.
Published: (2025)
by: Yona, Itay, et al.
Published: (2025)
AttackEval: A Systematic Empirical Study of Prompt Injection Attack Effectiveness Against Large Language Models
by: Wang, Jackson
Published: (2026)
by: Wang, Jackson
Published: (2026)
Image-based Prompt Injection: Hijacking Multimodal LLMs through Visually Embedded Adversarial Instructions
by: Nagaraja, Neha, et al.
Published: (2026)
by: Nagaraja, Neha, et al.
Published: (2026)
PISanitizer: Preventing Prompt Injection to Long-Context LLMs via Prompt Sanitization
by: Geng, Runpeng, et al.
Published: (2025)
by: Geng, Runpeng, et al.
Published: (2025)
Red Teaming the Mind of the Machine: A Systematic Evaluation of Prompt Injection and Jailbreak Vulnerabilities in LLMs
by: Pathade, Chetan
Published: (2025)
by: Pathade, Chetan
Published: (2025)
Applying Pre-trained Multilingual BERT in Embeddings for Improved Malicious Prompt Injection Attacks Detection
by: Rahman, Md Abdur, et al.
Published: (2024)
by: Rahman, Md Abdur, et al.
Published: (2024)
Similar Items
-
Goal-guided Generative Prompt Injection Attack on Large Language Models
by: Zhang, Chong, et al.
Published: (2024) -
Instruction-Following Evaluation of Large Vision-Language Models
by: Shiono, Daiki, et al.
Published: (2025) -
Hijacking Large Language Models via Adversarial In-Context Learning
by: Zhou, Xiangyu, et al.
Published: (2023) -
Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection
by: Yan, Jun, et al.
Published: (2023) -
Fingerprinting LLMs via Prompt Injection
by: Hu, Yuepeng, et al.
Published: (2025)