:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shao, Shuo, Li, Yiming, Yao, Hongwei, Chen, Yifei, Yang, Yuchen, Qin, Zhan
Format:	Preprint
Published:	2025
Subjects:	Cryptography and Security Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2510.06605
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

FIT-Print: Towards False-claim-resistant Model Ownership Verification via Targeted Fingerprint
by: Shao, Shuo, et al.
Published: (2025)

SoK: Large Language Model Copyright Auditing via Fingerprinting
by: Shao, Shuo, et al.
Published: (2025)

AttriGuard: Defeating Indirect Prompt Injection in LLM Agents via Causal Attribution of Tool Invocations
by: He, Yu, et al.
Published: (2026)

Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Watermarking Feature Attribution
by: Shao, Shuo, et al.
Published: (2024)

MIRAGE: Misleading Retrieval-Augmented Generation via Black-box and Query-agnostic Poisoning Attacks
by: Chen, Tailun, et al.
Published: (2025)

ShadowCode: Towards (Automatic) External Prompt Injection Attack against Code LLMs
by: Yang, Yuchen, et al.
Published: (2024)

External Data Extraction Attacks against Retrieval-Augmented Large Language Models
by: He, Yu, et al.
Published: (2025)

Black-Box Guardrail Reverse-engineering Attack
by: Yao, Hongwei, et al.
Published: (2025)

iSeal: Encrypted Fingerprinting for Reliable LLM Ownership Verification
by: Xiong, Zixun, et al.
Published: (2025)

Rotation, Scale, and Translation Resilient Black-box Fingerprinting for Intellectual Property Protection of EaaS Models
by: Zhang, Hongjie, et al.
Published: (2025)

On the Reliability of Radio Frequency Fingerprinting
by: Irfan, Muhammad, et al.
Published: (2024)

ControlNET: A Firewall for RAG-based LLM System
by: Yao, Hongwei, et al.
Published: (2025)

Eguard: Defending LLM Embeddings Against Inversion Attacks via Text Mutual Information Optimization
by: Liu, Tiantian, et al.
Published: (2024)

Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents
by: Zhang, Hanrong, et al.
Published: (2024)

MAJIC: Markovian Adaptive Jailbreaking via Iterative Composition of Diverse Innovative Strategies
by: Qi, Weiwei, et al.
Published: (2025)

CSF: Black-box Fingerprinting via Compositional Semantics for Text-to-Image Models
by: Lee, Junhoo, et al.
Published: (2026)

PointNCBW: Towards Dataset Ownership Verification for Point Clouds via Negative Clean-label Backdoor Watermark
by: Wei, Cheng, et al.
Published: (2024)

CBW: Towards Dataset Ownership Verification for Speaker Verification via Clustering-based Backdoor Watermarking
by: Li, Yiming, et al.
Published: (2025)

A Game Between the Defender and the Attacker for Trigger-based Black-box Model Watermarking
by: Huang, Chaoyue, et al.
Published: (2025)

MergePrint: Merge-Resistant Fingerprints for Robust Black-box Ownership Verification of Large Language Models
by: Yamabe, Shojiro, et al.
Published: (2024)

FDINet: Protecting against DNN Model Extraction via Feature Distortion Index
by: Yao, Hongwei, et al.
Published: (2023)

Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language Models
by: Yi, Biao, et al.
Published: (2025)

SmartGuard: Leveraging Large Language Models for Network Attack Detection through Audit Log Analysis and Summarization
by: Zhang, Hao, et al.
Published: (2025)

Black-box Optimization of LLM Outputs by Asking for Directions
by: Zhang, Jie, et al.
Published: (2025)

Audio Pirates: Black-box Audio Watermark Removal via Diffusion Priors
by: Yao, Lingfeng, et al.
Published: (2026)

AgentTypo: Adaptive Typographic Prompt Injection Attacks against Black-box Multimodal Agents
by: Li, Yanjie, et al.
Published: (2025)

UTF:Undertrained Tokens as Fingerprints A Novel Approach to LLM Identification
by: Cai, Jiacheng, et al.
Published: (2024)

LLM Fingerprinting via Semantically Conditioned Watermarks
by: Gloaguen, Thibaud, et al.
Published: (2025)

SEW: Strengthening Robustness of Black-box DNN Watermarking via Specificity Enhancement
by: Qiu, Huming, et al.
Published: (2026)

Performance-lossless Black-box Model Watermarking
by: Zhao, Na, et al.
Published: (2023)

Query Provenance Analysis: Efficient and Robust Defense against Query-based Black-box Attacks
by: Li, Shaofei, et al.
Published: (2024)

Red-Teaming Agent Execution Contexts: Open-World Security Evaluation on OpenClaw
by: Yao, Hongwei, et al.
Published: (2026)

Inhibitory Attacks on Backdoor-based Fingerprinting for Large Language Models
by: Fu, Hang, et al.
Published: (2026)

BlackCATT: Black-box Collusion Aware Traitor Tracing in Federated Learning
by: Rodríguez-Lois, Elena, et al.
Published: (2026)

BadApex: Backdoor Attack Based on Adaptive Optimization Mechanism of Black-box Large Language Models
by: Wu, Zhengxian, et al.
Published: (2025)

Towards Provably Secure Generative AI: Reliable Consensus Sampling
by: Cui, Yu, et al.
Published: (2025)

UAV Individual Identification via Distilled RF Fingerprints-Based LLM in ISAC Networks
by: Zheng, Haolin, et al.
Published: (2025)

REFINE: Inversion-Free Backdoor Defense via Model Reprogramming
by: Chen, Yukun, et al.
Published: (2025)

Exposing LLM User Privacy via Traffic Fingerprint Analysis: A Study of Privacy Risks in LLM Agent Interactions
by: Zhang, Yixiang, et al.
Published: (2025)

Shadow in the Cache: Unveiling and Mitigating Privacy Risks of KV-cache in LLM Inference
by: Luo, Zhifan, et al.
Published: (2025)