Saved in:
| Main Authors: | Balashov, Andrii, Ponomarova, Olena, Zhai, Xiaohua |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.15613 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
To Protect the LLM Agent Against the Prompt Injection Attack with Polymorphic Prompt
by: Wang, Zhilong, et al.
Published: (2025)
by: Wang, Zhilong, et al.
Published: (2025)
AMDS: Attack-Aware Multi-Stage Defense System for Network Intrusion Detection with Two-Stage Adaptive Weight Learning
by: Olukola, Oluseyi, et al.
Published: (2026)
by: Olukola, Oluseyi, et al.
Published: (2026)
A Method for Enhancing the Safety of Large Model Generation Based on Multi-dimensional Attack and Defense
by: Zhai, Keke
Published: (2024)
by: Zhai, Keke
Published: (2024)
Optimization-based Prompt Injection Attack to LLM-as-a-Judge
by: Shi, Jiawen, et al.
Published: (2024)
by: Shi, Jiawen, et al.
Published: (2024)
AttackPilot: Autonomous Inference Attacks Against ML Services With LLM-Based Agents
by: Wu, Yixin, et al.
Published: (2025)
by: Wu, Yixin, et al.
Published: (2025)
CompressionAttack: Exploiting Prompt Compression as a New Attack Surface in LLM-Powered Agents
by: Liu, Zesen, et al.
Published: (2025)
by: Liu, Zesen, et al.
Published: (2025)
The System Prompt Is the Attack Surface: How LLM Agent Configuration Shapes Security and Creates Exploitable Vulnerabilities
by: Litvak, Ron
Published: (2026)
by: Litvak, Ron
Published: (2026)
LUMIA: Linear probing for Unimodal and MultiModal Membership Inference Attacks leveraging internal LLM states
by: Ibanez-Lissen, Luis, et al.
Published: (2024)
by: Ibanez-Lissen, Luis, et al.
Published: (2024)
Prompt-in-Content Attacks: Exploiting Uploaded Inputs to Hijack LLM Behavior
by: Lian, Zhuotao, et al.
Published: (2025)
by: Lian, Zhuotao, et al.
Published: (2025)
Signed-Prompt: A New Approach to Prevent Prompt Injection Attacks Against LLM-Integrated Applications
by: Suo, Xuchen
Published: (2024)
by: Suo, Xuchen
Published: (2024)
Prompt Infection: LLM-to-LLM Prompt Injection within Multi-Agent Systems
by: Lee, Donghyun, et al.
Published: (2024)
by: Lee, Donghyun, et al.
Published: (2024)
PromptLocate: Localizing Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2025)
by: Jia, Yuqi, et al.
Published: (2025)
PIDP-Attack: Combining Prompt Injection with Database Poisoning Attacks on Retrieval-Augmented Generation Systems
by: Wang, Haozhen, et al.
Published: (2026)
by: Wang, Haozhen, et al.
Published: (2026)
Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks
by: Chen, Sizhe, et al.
Published: (2025)
by: Chen, Sizhe, et al.
Published: (2025)
Democratizing ML for Enterprise Security: A Self-Sustained Attack Detection Framework
by: Momeni, Sadegh, et al.
Published: (2025)
by: Momeni, Sadegh, et al.
Published: (2025)
Has My System Prompt Been Used? Large Language Model Prompt Membership Inference
by: Levin, Roman, et al.
Published: (2025)
by: Levin, Roman, et al.
Published: (2025)
Manipulating LLM Web Agents with Indirect Prompt Injection Attack via HTML Accessibility Tree
by: Johnson, Sam, et al.
Published: (2025)
by: Johnson, Sam, et al.
Published: (2025)
Involuntary Jailbreak: On Self-Prompting Attacks
by: Guo, Yangyang, et al.
Published: (2025)
by: Guo, Yangyang, et al.
Published: (2025)
CheatAgent: Attacking LLM-Empowered Recommender Systems via LLM Agent
by: Ning, Liang-bo, et al.
Published: (2025)
by: Ning, Liang-bo, et al.
Published: (2025)
Burn-After-Use for Preventing Data Leakage through a Secure Multi-Tenant Architecture in Enterprise LLM
by: Zhang, Qiang, et al.
Published: (2026)
by: Zhang, Qiang, et al.
Published: (2026)
Doppelganger Method: Breaking Role Consistency in LLM Agent via Prompt-based Transferable Adversarial Attack
by: Kang, Daewon, et al.
Published: (2025)
by: Kang, Daewon, et al.
Published: (2025)
Token-Efficient Prompt Injection Attack: Provoking Cessation in LLM Reasoning via Adaptive Token Compression
by: Cui, Yu, et al.
Published: (2025)
by: Cui, Yu, et al.
Published: (2025)
SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks
by: Zhang, Kaiyuan, et al.
Published: (2025)
by: Zhang, Kaiyuan, et al.
Published: (2025)
Operationalizing CaMeL: Strengthening LLM Defenses for Enterprise Deployment
by: Tallam, Krti, et al.
Published: (2025)
by: Tallam, Krti, et al.
Published: (2025)
System Prompt Poisoning: Persistent Attacks on Large Language Models Beyond User Injection
by: Li, Zongze, et al.
Published: (2025)
by: Li, Zongze, et al.
Published: (2025)
Joint Optimization of Prompt Security and System Performance in Edge-Cloud LLM Systems
by: Huang, Haiyang, et al.
Published: (2025)
by: Huang, Haiyang, et al.
Published: (2025)
WebWeaver: Breaking Topology Confidentiality in LLM Multi-Agent Systems with Stealthy Context-Based Inference
by: Xiong, Zixun, et al.
Published: (2026)
by: Xiong, Zixun, et al.
Published: (2026)
Efficient but Vulnerable: Benchmarking and Defending LLM Batch Prompting Attack
by: Yue, Murong, et al.
Published: (2025)
by: Yue, Murong, et al.
Published: (2025)
SkillTrojan: Backdoor Attacks on Skill-Based Agent Systems
by: Feng, Yunhao, et al.
Published: (2026)
by: Feng, Yunhao, et al.
Published: (2026)
Jailbreaking Prompt Attack: A Controllable Adversarial Attack against Diffusion Models
by: Ma, Jiachen, et al.
Published: (2024)
by: Ma, Jiachen, et al.
Published: (2024)
Bidirectional Intention Inference Enhances LLMs' Defense Against Multi-Turn Jailbreak Attacks
by: Tong, Haibo, et al.
Published: (2025)
by: Tong, Haibo, et al.
Published: (2025)
SafeGPT: Preventing Data Leakage and Unethical Outputs in Enterprise LLM Use
by: Desai, Pratyush, et al.
Published: (2026)
by: Desai, Pratyush, et al.
Published: (2026)
Overcoming the Retrieval Barrier: Indirect Prompt Injection in the Wild for LLM Systems
by: Chang, Hongyan, et al.
Published: (2026)
by: Chang, Hongyan, et al.
Published: (2026)
IP Leakage Attacks Targeting LLM-Based Multi-Agent Systems
by: Wang, Liwen, et al.
Published: (2025)
by: Wang, Liwen, et al.
Published: (2025)
Web Fraud Attacks Against LLM-Driven Multi-Agent Systems
by: Kong, Dezhang, et al.
Published: (2025)
by: Kong, Dezhang, et al.
Published: (2025)
Prompt and Circumstances: Evaluating the Efficacy of Human Prompt Inference in AI-Generated Art
by: Trinh, Khoi, et al.
Published: (2026)
by: Trinh, Khoi, et al.
Published: (2026)
Securing AI Agents Against Prompt Injection Attacks
by: Ramakrishnan, Badrinath, et al.
Published: (2025)
by: Ramakrishnan, Badrinath, et al.
Published: (2025)
Enhancing Jailbreak Attacks on LLMs via Persona Prompts
by: Zhang, Zheng, et al.
Published: (2025)
by: Zhang, Zheng, et al.
Published: (2025)
Analysis of LLMs Against Prompt Injection and Jailbreak Attacks
by: Jaiswal, Piyush, et al.
Published: (2026)
by: Jaiswal, Piyush, et al.
Published: (2026)
Defenses Against Prompt Attacks Learn Surface Heuristics
by: Li, Shawn, et al.
Published: (2026)
by: Li, Shawn, et al.
Published: (2026)
Similar Items
-
To Protect the LLM Agent Against the Prompt Injection Attack with Polymorphic Prompt
by: Wang, Zhilong, et al.
Published: (2025) -
AMDS: Attack-Aware Multi-Stage Defense System for Network Intrusion Detection with Two-Stage Adaptive Weight Learning
by: Olukola, Oluseyi, et al.
Published: (2026) -
A Method for Enhancing the Safety of Large Model Generation Based on Multi-dimensional Attack and Defense
by: Zhai, Keke
Published: (2024) -
Optimization-based Prompt Injection Attack to LLM-as-a-Judge
by: Shi, Jiawen, et al.
Published: (2024) -
AttackPilot: Autonomous Inference Attacks Against ML Services With LLM-Based Agents
by: Wu, Yixin, et al.
Published: (2025)