:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kimura, Subaru, Tanaka, Ryota, Miyawaki, Shumpei, Suzuki, Jun, Sakaguchi, Keisuke
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Cryptography and Security Machine Learning
Online Access:	https://arxiv.org/abs/2408.03554
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Goal-guided Generative Prompt Injection Attack on Large Language Models
by: Zhang, Chong, et al.
Published: (2024)

Instruction-Following Evaluation of Large Vision-Language Models
by: Shiono, Daiki, et al.
Published: (2025)

Hijacking Large Language Models via Adversarial In-Context Learning
by: Zhou, Xiangyu, et al.
Published: (2023)

Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection
by: Yan, Jun, et al.
Published: (2023)

Fingerprinting LLMs via Prompt Injection
by: Hu, Yuepeng, et al.
Published: (2025)

Hijacking Large Audio-Language Models via Context-Agnostic and Imperceptible Auditory Prompt Injection
by: Chen, Meng, et al.
Published: (2026)

FigStep: Jailbreaking Large Vision-Language Models via Typographic Visual Prompts
by: Gong, Yichen, et al.
Published: (2023)

FATH: Authentication-based Test-time Defense against Indirect Prompt Injection Attacks
by: Wang, Jiongxiao, et al.
Published: (2024)

PARASITE: Conditional System Prompt Poisoning to Hijack LLMs
by: Pham, Viet, et al.
Published: (2025)

An Early Categorization of Prompt Injection Attacks on Large Language Models
by: Rossi, Sippo, et al.
Published: (2024)

InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents
by: Zhan, Qiusi, et al.
Published: (2024)

Securing Large Language Models (LLMs) from Prompt Injection Attacks
by: Suri, Omar Farooq Khan, et al.
Published: (2025)

Zero-Shot Embedding Drift Detection: A Lightweight Defense Against Prompt Injections in LLMs
by: Sekar, Anirudh, et al.
Published: (2026)

The Landscape of Prompt Injection Threats in LLM Agents: From Taxonomy to Analysis
by: Wang, Peiran, et al.
Published: (2026)

MIRAGE: Context-Aware Prompt Injection against Mobile GUI Agents via User-Generated Content
by: Guo, Ruoqi, et al.
Published: (2026)

Soft Begging: Modular and Efficient Shielding of LLMs against Prompt Injection and Jailbreaking based on Prompt Tuning
by: Ostermann, Simon, et al.
Published: (2024)

ReasAlign: Reasoning Enhanced Safety Alignment against Prompt Injection Attack
by: Li, Hao, et al.
Published: (2026)

Prompt Injection attack against LLM-integrated Applications
by: Liu, Yi, et al.
Published: (2023)

HijackRAG: Hijacking Attacks against Retrieval-Augmented Large Language Models
by: Zhang, Yucheng, et al.
Published: (2024)

Is Your Prompt Safe? Investigating Prompt Injection Attacks Against Open-Source LLMs
by: Wang, Jiawen, et al.
Published: (2025)

Prompt Injection as Role Confusion
by: Ye, Charles, et al.
Published: (2026)

AttnTrace: Contextual Attribution of Prompt Injection and Knowledge Corruption
by: Wang, Yanting, et al.
Published: (2025)

ShadowCoT: Cognitive Hijacking for Stealthy Reasoning Backdoors in LLMs
by: Zhao, Gejian, et al.
Published: (2025)

Stealthy and Persistent Unalignment on Large Language Models via Backdoor Injections
by: Cao, Yuanpu, et al.
Published: (2023)

PRSA: Prompt Stealing Attacks against Real-World Prompt Services
by: Yang, Yong, et al.
Published: (2024)

Triosecuris: Formally Verified Protection Against Speculative Control-Flow Hijacking
by: Baumann, Jonathan, et al.
Published: (2026)

LeechHijack: Covert Computational Resource Exploitation in Intelligent Agent Systems
by: Zhang, Yuanhe, et al.
Published: (2025)

Adversarial Attacks on LLM-as-a-Judge Systems: Insights from Prompt Injections
by: Maloyan, Narek, et al.
Published: (2025)

Beyond Pattern Matching: Seven Cross-Domain Techniques for Prompt Injection Detection
by: Munirathinam, Thamilvendhan
Published: (2026)

Fun-tuning: Characterizing the Vulnerability of Proprietary LLMs to Optimization-based Prompt Injection Attacks via the Fine-Tuning Interface
by: Labunets, Andrey, et al.
Published: (2025)

Prompt Stealing Attacks Against Large Language Models
by: Sha, Zeyang, et al.
Published: (2024)

Denial-of-Service Poisoning Attacks against Large Language Models
by: Gao, Kuofeng, et al.
Published: (2024)

Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language Models
by: Liang, Zi, et al.
Published: (2024)

Separator Injection Attack: Uncovering Dialogue Biases in Large Language Models Caused by Role Separators
by: Li, Xitao, et al.
Published: (2025)

In-Context Representation Hijacking
by: Yona, Itay, et al.
Published: (2025)

AttackEval: A Systematic Empirical Study of Prompt Injection Attack Effectiveness Against Large Language Models
by: Wang, Jackson
Published: (2026)

Image-based Prompt Injection: Hijacking Multimodal LLMs through Visually Embedded Adversarial Instructions
by: Nagaraja, Neha, et al.
Published: (2026)

PISanitizer: Preventing Prompt Injection to Long-Context LLMs via Prompt Sanitization
by: Geng, Runpeng, et al.
Published: (2025)

Red Teaming the Mind of the Machine: A Systematic Evaluation of Prompt Injection and Jailbreak Vulnerabilities in LLMs
by: Pathade, Chetan
Published: (2025)

Applying Pre-trained Multilingual BERT in Embeddings for Improved Malicious Prompt Injection Attacks Detection
by: Rahman, Md Abdur, et al.
Published: (2024)