:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Balashov, Andrii, Ponomarova, Olena, Zhai, Xiaohua
Format:	Preprint
Published:	2025
Subjects:	Cryptography and Security Artificial Intelligence
Online Access:	https://arxiv.org/abs/2507.15613
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

To Protect the LLM Agent Against the Prompt Injection Attack with Polymorphic Prompt
by: Wang, Zhilong, et al.
Published: (2025)

AMDS: Attack-Aware Multi-Stage Defense System for Network Intrusion Detection with Two-Stage Adaptive Weight Learning
by: Olukola, Oluseyi, et al.
Published: (2026)

A Method for Enhancing the Safety of Large Model Generation Based on Multi-dimensional Attack and Defense
by: Zhai, Keke
Published: (2024)

Optimization-based Prompt Injection Attack to LLM-as-a-Judge
by: Shi, Jiawen, et al.
Published: (2024)

AttackPilot: Autonomous Inference Attacks Against ML Services With LLM-Based Agents
by: Wu, Yixin, et al.
Published: (2025)

CompressionAttack: Exploiting Prompt Compression as a New Attack Surface in LLM-Powered Agents
by: Liu, Zesen, et al.
Published: (2025)

The System Prompt Is the Attack Surface: How LLM Agent Configuration Shapes Security and Creates Exploitable Vulnerabilities
by: Litvak, Ron
Published: (2026)

LUMIA: Linear probing for Unimodal and MultiModal Membership Inference Attacks leveraging internal LLM states
by: Ibanez-Lissen, Luis, et al.
Published: (2024)

Prompt-in-Content Attacks: Exploiting Uploaded Inputs to Hijack LLM Behavior
by: Lian, Zhuotao, et al.
Published: (2025)

Signed-Prompt: A New Approach to Prevent Prompt Injection Attacks Against LLM-Integrated Applications
by: Suo, Xuchen
Published: (2024)

Prompt Infection: LLM-to-LLM Prompt Injection within Multi-Agent Systems
by: Lee, Donghyun, et al.
Published: (2024)

PromptLocate: Localizing Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2025)

PIDP-Attack: Combining Prompt Injection with Database Poisoning Attacks on Retrieval-Augmented Generation Systems
by: Wang, Haozhen, et al.
Published: (2026)

Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks
by: Chen, Sizhe, et al.
Published: (2025)

Democratizing ML for Enterprise Security: A Self-Sustained Attack Detection Framework
by: Momeni, Sadegh, et al.
Published: (2025)

Has My System Prompt Been Used? Large Language Model Prompt Membership Inference
by: Levin, Roman, et al.
Published: (2025)

Manipulating LLM Web Agents with Indirect Prompt Injection Attack via HTML Accessibility Tree
by: Johnson, Sam, et al.
Published: (2025)

Involuntary Jailbreak: On Self-Prompting Attacks
by: Guo, Yangyang, et al.
Published: (2025)

CheatAgent: Attacking LLM-Empowered Recommender Systems via LLM Agent
by: Ning, Liang-bo, et al.
Published: (2025)

Burn-After-Use for Preventing Data Leakage through a Secure Multi-Tenant Architecture in Enterprise LLM
by: Zhang, Qiang, et al.
Published: (2026)

Doppelganger Method: Breaking Role Consistency in LLM Agent via Prompt-based Transferable Adversarial Attack
by: Kang, Daewon, et al.
Published: (2025)

Token-Efficient Prompt Injection Attack: Provoking Cessation in LLM Reasoning via Adaptive Token Compression
by: Cui, Yu, et al.
Published: (2025)

SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks
by: Zhang, Kaiyuan, et al.
Published: (2025)

Operationalizing CaMeL: Strengthening LLM Defenses for Enterprise Deployment
by: Tallam, Krti, et al.
Published: (2025)

System Prompt Poisoning: Persistent Attacks on Large Language Models Beyond User Injection
by: Li, Zongze, et al.
Published: (2025)

Joint Optimization of Prompt Security and System Performance in Edge-Cloud LLM Systems
by: Huang, Haiyang, et al.
Published: (2025)

WebWeaver: Breaking Topology Confidentiality in LLM Multi-Agent Systems with Stealthy Context-Based Inference
by: Xiong, Zixun, et al.
Published: (2026)

Efficient but Vulnerable: Benchmarking and Defending LLM Batch Prompting Attack
by: Yue, Murong, et al.
Published: (2025)

SkillTrojan: Backdoor Attacks on Skill-Based Agent Systems
by: Feng, Yunhao, et al.
Published: (2026)

Jailbreaking Prompt Attack: A Controllable Adversarial Attack against Diffusion Models
by: Ma, Jiachen, et al.
Published: (2024)

Bidirectional Intention Inference Enhances LLMs' Defense Against Multi-Turn Jailbreak Attacks
by: Tong, Haibo, et al.
Published: (2025)

SafeGPT: Preventing Data Leakage and Unethical Outputs in Enterprise LLM Use
by: Desai, Pratyush, et al.
Published: (2026)

Overcoming the Retrieval Barrier: Indirect Prompt Injection in the Wild for LLM Systems
by: Chang, Hongyan, et al.
Published: (2026)

IP Leakage Attacks Targeting LLM-Based Multi-Agent Systems
by: Wang, Liwen, et al.
Published: (2025)

Web Fraud Attacks Against LLM-Driven Multi-Agent Systems
by: Kong, Dezhang, et al.
Published: (2025)

Prompt and Circumstances: Evaluating the Efficacy of Human Prompt Inference in AI-Generated Art
by: Trinh, Khoi, et al.
Published: (2026)

Securing AI Agents Against Prompt Injection Attacks
by: Ramakrishnan, Badrinath, et al.
Published: (2025)

Enhancing Jailbreak Attacks on LLMs via Persona Prompts
by: Zhang, Zheng, et al.
Published: (2025)

Analysis of LLMs Against Prompt Injection and Jailbreak Attacks
by: Jaiswal, Piyush, et al.
Published: (2026)

Defenses Against Prompt Attacks Learn Surface Heuristics
by: Li, Shawn, et al.
Published: (2026)