Saved in:
| Main Authors: | Ren, Zhenzhen, Li, GuoBiao, Li, Sheng, Qian, Zhenxing, Zhang, Xinpeng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.16785 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
R-CoT: A Reasoning-Layer Watermark via Redundant Chain-of-Thought in Large Language Models
by: Zhang, Ziming, et al.
Published: (2026)
by: Zhang, Ziming, et al.
Published: (2026)
ForgetMark: Stealthy Fingerprint Embedding via Targeted Unlearning in Language Models
by: Xu, Zhenhua, et al.
Published: (2026)
by: Xu, Zhenhua, et al.
Published: (2026)
SyncGuard: Robust Audio Watermarking Capable of Countering Desynchronization Attacks
by: Gan, Zhenliang, et al.
Published: (2025)
by: Gan, Zhenliang, et al.
Published: (2025)
Cover-separable Fixed Neural Network Steganography via Deep Generative Models
by: Li, Guobiao, et al.
Published: (2024)
by: Li, Guobiao, et al.
Published: (2024)
EverTracer: Hunting Stolen Large Language Models via Stealthy and Robust Probabilistic Fingerprint
by: Xu, Zhenhua, et al.
Published: (2025)
by: Xu, Zhenhua, et al.
Published: (2025)
BadThink: Triggered Overthinking Attacks on Chain-of-Thought Reasoning in Large Language Models
by: Liu, Shuaitong, et al.
Published: (2025)
by: Liu, Shuaitong, et al.
Published: (2025)
Large Language Model-driven Security Assistant for Internet of Things via Chain-of-Thought
by: Zeng, Mingfei, et al.
Published: (2025)
by: Zeng, Mingfei, et al.
Published: (2025)
A Behavioral Fingerprint for Large Language Models: Provenance Tracking via Refusal Vectors
by: Xu, Zhenyu, et al.
Published: (2026)
by: Xu, Zhenyu, et al.
Published: (2026)
LLMmap: Fingerprinting For Large Language Models
by: Pasquini, Dario, et al.
Published: (2024)
by: Pasquini, Dario, et al.
Published: (2024)
Hiding in Plain Sight: A Steganographic Approach to Stealthy LLM Jailbreaks
by: Geng, Jianing, et al.
Published: (2025)
by: Geng, Jianing, et al.
Published: (2025)
CryptoScope: Utilizing Large Language Models for Automated Cryptographic Logic Vulnerability Detection
by: Li, Zhihao, et al.
Published: (2025)
by: Li, Zhihao, et al.
Published: (2025)
StealthInk: A Multi-bit and Stealthy Watermark for Large Language Models
by: Jiang, Ya, et al.
Published: (2025)
by: Jiang, Ya, et al.
Published: (2025)
Purified and Unified Steganographic Network
by: Li, Guobiao, et al.
Published: (2024)
by: Li, Guobiao, et al.
Published: (2024)
REEF: Representation Encoding Fingerprints for Large Language Models
by: Zhang, Jie, et al.
Published: (2024)
by: Zhang, Jie, et al.
Published: (2024)
ExplainableGuard: Interpretable Adversarial Defense for Large Language Models Using Chain-of-Thought Reasoning
by: Guan, Shaowei, et al.
Published: (2025)
by: Guan, Shaowei, et al.
Published: (2025)
ShadowCoT: Cognitive Hijacking for Stealthy Reasoning Backdoors in LLMs
by: Zhao, Gejian, et al.
Published: (2025)
by: Zhao, Gejian, et al.
Published: (2025)
Echoes within the Reasoning: Stealthy and Effective Watermarking via Chain of Thought
by: Lu, Jiacheng, et al.
Published: (2026)
by: Lu, Jiacheng, et al.
Published: (2026)
Transferable & Stealthy Ensemble Attacks: A Black-Box Jailbreaking Framework for Large Language Models
by: Yang, Yiqi, et al.
Published: (2024)
by: Yang, Yiqi, et al.
Published: (2024)
MEraser: An Effective Fingerprint Erasure Approach for Large Language Models
by: Zhang, Jingxuan, et al.
Published: (2025)
by: Zhang, Jingxuan, et al.
Published: (2025)
Hide and Seek: Fingerprinting Large Language Models with Evolutionary Learning
by: Iourovitski, Dmitri, et al.
Published: (2024)
by: Iourovitski, Dmitri, et al.
Published: (2024)
Stealthy and Persistent Unalignment on Large Language Models via Backdoor Injections
by: Cao, Yuanpu, et al.
Published: (2023)
by: Cao, Yuanpu, et al.
Published: (2023)
PAPILLON: Efficient and Stealthy Fuzz Testing-Powered Jailbreaks for LLMs
by: Gong, Xueluan, et al.
Published: (2024)
by: Gong, Xueluan, et al.
Published: (2024)
Towards Robust Multi-tab Website Fingerprinting
by: Deng, Xinhao, et al.
Published: (2025)
by: Deng, Xinhao, et al.
Published: (2025)
EditMF: Drawing an Invisible Fingerprint for Your Large Language Models
by: Wu, Jiaxuan, et al.
Published: (2025)
by: Wu, Jiaxuan, et al.
Published: (2025)
AdvWave: Stealthy Adversarial Jailbreak Attack against Large Audio-Language Models
by: Kang, Mintong, et al.
Published: (2024)
by: Kang, Mintong, et al.
Published: (2024)
Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique
by: Russinovich, Mark, et al.
Published: (2024)
by: Russinovich, Mark, et al.
Published: (2024)
SoK: Large Language Model Copyright Auditing via Fingerprinting
by: Shao, Shuo, et al.
Published: (2025)
by: Shao, Shuo, et al.
Published: (2025)
Chain-of-Scrutiny: Detecting Backdoor Attacks for Large Language Models
by: Li, Xi, et al.
Published: (2024)
by: Li, Xi, et al.
Published: (2024)
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models
by: Xiang, Zhen, et al.
Published: (2024)
by: Xiang, Zhen, et al.
Published: (2024)
Towards Backdoor Stealthiness in Model Parameter Space
by: Xu, Xiaoyun, et al.
Published: (2025)
by: Xu, Xiaoyun, et al.
Published: (2025)
Beyond Max Tokens: Stealthy Resource Amplification via Tool Calling Chains in LLM Agents
by: Zhou, Kaiyu, et al.
Published: (2026)
by: Zhou, Kaiyu, et al.
Published: (2026)
Revocable Backdoor for Deep Model Trading
by: Xu, Yiran, et al.
Published: (2024)
by: Xu, Yiran, et al.
Published: (2024)
FNF: Functional Network Fingerprint for Large Language Models
by: Liu, Yiheng, et al.
Published: (2026)
by: Liu, Yiheng, et al.
Published: (2026)
Invariant-based Robust Weights Watermark for Large Language Models
by: Guo, Qingxiao, et al.
Published: (2025)
by: Guo, Qingxiao, et al.
Published: (2025)
DNF: Dual-Layer Nested Fingerprinting for Large Language Model Intellectual Property Protection
by: Xu, Zhenhua, et al.
Published: (2026)
by: Xu, Zhenhua, et al.
Published: (2026)
Towards Effective, Stealthy, and Persistent Backdoor Attacks Targeting Graph Foundation Models
by: Luo, Jiayi, et al.
Published: (2025)
by: Luo, Jiayi, et al.
Published: (2025)
LLMs Can Covertly Sandbag on Capability Evaluations Against Chain-of-Thought Monitoring
by: Li, Chloe, et al.
Published: (2025)
by: Li, Chloe, et al.
Published: (2025)
Know Thy Enemy: Securing LLMs Against Prompt Injection via Diverse Data Synthesis and Instruction-Level Chain-of-Thought Learning
by: Chang, Zhiyuan, et al.
Published: (2026)
by: Chang, Zhiyuan, et al.
Published: (2026)
Q-MLLM: Vector Quantization for Robust Multimodal Large Language Model Security
by: Zhao, Wei, et al.
Published: (2025)
by: Zhao, Wei, et al.
Published: (2025)
Instructional Fingerprinting of Large Language Models
by: Xu, Jiashu, et al.
Published: (2024)
by: Xu, Jiashu, et al.
Published: (2024)
Similar Items
-
R-CoT: A Reasoning-Layer Watermark via Redundant Chain-of-Thought in Large Language Models
by: Zhang, Ziming, et al.
Published: (2026) -
ForgetMark: Stealthy Fingerprint Embedding via Targeted Unlearning in Language Models
by: Xu, Zhenhua, et al.
Published: (2026) -
SyncGuard: Robust Audio Watermarking Capable of Countering Desynchronization Attacks
by: Gan, Zhenliang, et al.
Published: (2025) -
Cover-separable Fixed Neural Network Steganography via Deep Generative Models
by: Li, Guobiao, et al.
Published: (2024) -
EverTracer: Hunting Stolen Large Language Models via Stealthy and Robust Probabilistic Fingerprint
by: Xu, Zhenhua, et al.
Published: (2025)