Saved in:
| Main Authors: | Xu, Zhenyu, Sheng, Victor S. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.09434 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LLMmap: Fingerprinting For Large Language Models
by: Pasquini, Dario, et al.
Published: (2024)
by: Pasquini, Dario, et al.
Published: (2024)
CoTSRF: Utilize Chain of Thought as Stealthy and Robust Fingerprint of Large Language Models
by: Ren, Zhenzhen, et al.
Published: (2025)
by: Ren, Zhenzhen, et al.
Published: (2025)
MEraser: An Effective Fingerprint Erasure Approach for Large Language Models
by: Zhang, Jingxuan, et al.
Published: (2025)
by: Zhang, Jingxuan, et al.
Published: (2025)
ForgetMark: Stealthy Fingerprint Embedding via Targeted Unlearning in Language Models
by: Xu, Zhenhua, et al.
Published: (2026)
by: Xu, Zhenhua, et al.
Published: (2026)
Hide and Seek: Fingerprinting Large Language Models with Evolutionary Learning
by: Iourovitski, Dmitri, et al.
Published: (2024)
by: Iourovitski, Dmitri, et al.
Published: (2024)
DNF: Dual-Layer Nested Fingerprinting for Large Language Model Intellectual Property Protection
by: Xu, Zhenhua, et al.
Published: (2026)
by: Xu, Zhenhua, et al.
Published: (2026)
EditMF: Drawing an Invisible Fingerprint for Your Large Language Models
by: Wu, Jiaxuan, et al.
Published: (2025)
by: Wu, Jiaxuan, et al.
Published: (2025)
DrLLM: Prompt-Enhanced Distributed Denial-of-Service Resistance Method with Large Language Models
by: Yin, Zhenyu, et al.
Published: (2024)
by: Yin, Zhenyu, et al.
Published: (2024)
A Content-Based Framework for Cybersecurity Refusal Decisions in Large Language Models
by: Linder, Noa, et al.
Published: (2026)
by: Linder, Noa, et al.
Published: (2026)
Data Provenance Auditing of Fine-Tuned Large Language Models with a Text-Preserving Technique
by: Li, Yanming, et al.
Published: (2025)
by: Li, Yanming, et al.
Published: (2025)
SoK: Large Language Model Copyright Auditing via Fingerprinting
by: Shao, Shuo, et al.
Published: (2025)
by: Shao, Shuo, et al.
Published: (2025)
OMNISEC: LLM-Driven Provenance-based Intrusion Detection via Retrieval-Augmented Behavior Prompting
by: Cheng, Wenrui, et al.
Published: (2025)
by: Cheng, Wenrui, et al.
Published: (2025)
Instructional Fingerprinting of Large Language Models
by: Xu, Jiashu, et al.
Published: (2024)
by: Xu, Jiashu, et al.
Published: (2024)
ZKPROV: A Zero-Knowledge Approach to Dataset Provenance for Large Language Models
by: Namazi, Mina, et al.
Published: (2025)
by: Namazi, Mina, et al.
Published: (2025)
Refusal Before Decoding: Detecting and Exploiting Refusal Signals in Intermediate LLM Activations
by: Collu, Matteo Gioele, et al.
Published: (2026)
by: Collu, Matteo Gioele, et al.
Published: (2026)
FNF: Functional Network Fingerprint for Large Language Models
by: Liu, Yiheng, et al.
Published: (2026)
by: Liu, Yiheng, et al.
Published: (2026)
REEF: Representation Encoding Fingerprints for Large Language Models
by: Zhang, Jie, et al.
Published: (2024)
by: Zhang, Jie, et al.
Published: (2024)
PRSI: Privacy-Preserving Recommendation Model Based on Vector Splitting and Interactive Protocols
by: Cao, Xiaokai, et al.
Published: (2024)
by: Cao, Xiaokai, et al.
Published: (2024)
Q-MLLM: Vector Quantization for Robust Multimodal Large Language Model Security
by: Zhao, Wei, et al.
Published: (2025)
by: Zhao, Wei, et al.
Published: (2025)
Layerwise Convergence Fingerprints for Runtime Misbehavior Detection in Large Language Models
by: Min, Nay Myat, et al.
Published: (2026)
by: Min, Nay Myat, et al.
Published: (2026)
Agent-Sentry: Bounding LLM Agents via Execution Provenance
by: Sequeira, Rohan, et al.
Published: (2026)
by: Sequeira, Rohan, et al.
Published: (2026)
Signal Watermark on Large Language Models
by: Xu, Zhenyu, et al.
Published: (2024)
by: Xu, Zhenyu, et al.
Published: (2024)
Atlas: A Framework for ML Lifecycle Provenance & Transparency
by: Spoczynski, Marcin, et al.
Published: (2025)
by: Spoczynski, Marcin, et al.
Published: (2025)
Towards Robust Multi-tab Website Fingerprinting
by: Deng, Xinhao, et al.
Published: (2025)
by: Deng, Xinhao, et al.
Published: (2025)
A Survey of Attacks on Large Language Models
by: Xu, Wenrui, et al.
Published: (2025)
by: Xu, Wenrui, et al.
Published: (2025)
Untraceable DeepFakes via Traceable Fingerprint Elimination
by: Lai, Jiewei, et al.
Published: (2025)
by: Lai, Jiewei, et al.
Published: (2025)
Refusing Safe Prompts for Multi-modal Large Language Models
by: Shao, Zedian, et al.
Published: (2024)
by: Shao, Zedian, et al.
Published: (2024)
InvisMark: Invisible and Robust Watermarking for AI-generated Image Provenance
by: Xu, Rui, et al.
Published: (2024)
by: Xu, Rui, et al.
Published: (2024)
Towards Fine-Grained Webpage Fingerprinting at Scale
by: Zhao, Xiyuan, et al.
Published: (2024)
by: Zhao, Xiyuan, et al.
Published: (2024)
Defensive Refusal Bias: How Safety Alignment Fails Cyber Defenders
by: Campbell, David, et al.
Published: (2026)
by: Campbell, David, et al.
Published: (2026)
Beyond Explicit Refusals: Soft-Failure Attacks on Retrieval-Augmented Generation
by: Zhang, Wentao, et al.
Published: (2026)
by: Zhang, Wentao, et al.
Published: (2026)
Anticipating Adversary Behavior in DevSecOps Scenarios through Large Language Models
by: Caballero, Mario Marín, et al.
Published: (2026)
by: Caballero, Mario Marín, et al.
Published: (2026)
Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss Landscapes
by: Hu, Xiaomeng, et al.
Published: (2024)
by: Hu, Xiaomeng, et al.
Published: (2024)
Winning at All Cost: A Small Environment for Eliciting Specification Gaming Behaviors in Large Language Models
by: Malmqvist, Lars
Published: (2025)
by: Malmqvist, Lars
Published: (2025)
AlphaSteer: Learning Refusal Steering with Principled Null-Space Constraint
by: Sheng, Leheng, et al.
Published: (2025)
by: Sheng, Leheng, et al.
Published: (2025)
FPEdit: Robust LLM Fingerprinting through Localized Parameter Editing
by: Wang, Shida, et al.
Published: (2025)
by: Wang, Shida, et al.
Published: (2025)
Refusal Falls off a Cliff: How Safety Alignment Fails in Reasoning?
by: Yin, Qingyu, et al.
Published: (2025)
by: Yin, Qingyu, et al.
Published: (2025)
User Behavior Analysis in Privacy Protection with Large Language Models: A Study on Privacy Preferences with Limited Data
by: Yang, Haowei, et al.
Published: (2025)
by: Yang, Haowei, et al.
Published: (2025)
SoK: Robustness in Large Language Models against Jailbreak Attacks
by: Xu, Feiyue, et al.
Published: (2026)
by: Xu, Feiyue, et al.
Published: (2026)
SAID: Safety-Aware Intent Defense via Prefix Probing for Large Language Models
by: Chen, Yulong, et al.
Published: (2025)
by: Chen, Yulong, et al.
Published: (2025)
Similar Items
-
LLMmap: Fingerprinting For Large Language Models
by: Pasquini, Dario, et al.
Published: (2024) -
CoTSRF: Utilize Chain of Thought as Stealthy and Robust Fingerprint of Large Language Models
by: Ren, Zhenzhen, et al.
Published: (2025) -
MEraser: An Effective Fingerprint Erasure Approach for Large Language Models
by: Zhang, Jingxuan, et al.
Published: (2025) -
ForgetMark: Stealthy Fingerprint Embedding via Targeted Unlearning in Language Models
by: Xu, Zhenhua, et al.
Published: (2026) -
Hide and Seek: Fingerprinting Large Language Models with Evolutionary Learning
by: Iourovitski, Dmitri, et al.
Published: (2024)