Saved in:
| Main Authors: | Lu, Jiacheng, Li, Yiming, Song, Tao, Wang, Weijian, Qu, Wenjie, Guan, Haibing, Zhang, Jiaheng |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.28890 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SWAP: Towards Copyright Auditing of Soft Prompts via Sequential Watermarking
by: Yang, Wenyuan, et al.
Published: (2025)
by: Yang, Wenyuan, et al.
Published: (2025)
Self-Sovereign Agent
by: Qu, Wenjie, et al.
Published: (2026)
by: Qu, Wenjie, et al.
Published: (2026)
DMark: Order-Agnostic Watermarking for Diffusion Large Language Models
by: Wu, Linyu, et al.
Published: (2025)
by: Wu, Linyu, et al.
Published: (2025)
Thought-Transfer: Indirect Targeted Poisoning Attacks on Chain-of-Thought Reasoning Models
by: Chaudhari, Harsh, et al.
Published: (2026)
by: Chaudhari, Harsh, et al.
Published: (2026)
AliMark: Enhancing Robustness of Sentence-Level Watermarking Against Text Paraphrasing
by: Li, Yuexin, et al.
Published: (2026)
by: Li, Yuexin, et al.
Published: (2026)
SettleFL: Trustless and Scalable Reward Settlement Protocol for Federated Learning on Permissionless Blockchains (Extended version)
by: Liang, Shuang, et al.
Published: (2026)
by: Liang, Shuang, et al.
Published: (2026)
Stealthy Yet Effective: Distribution-Preserving Backdoor Attacks on Graph Classification
by: Wang, Xiaobao, et al.
Published: (2025)
by: Wang, Xiaobao, et al.
Published: (2025)
Watermark under Fire: A Robustness Evaluation of LLM Watermarking
by: Liang, Jiacheng, et al.
Published: (2024)
by: Liang, Jiacheng, et al.
Published: (2024)
LoRAGuard: An Effective Black-box Watermarking Approach for LoRAs
by: Lv, Peizhuo, et al.
Published: (2025)
by: Lv, Peizhuo, et al.
Published: (2025)
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models
by: Xiang, Zhen, et al.
Published: (2024)
by: Xiang, Zhen, et al.
Published: (2024)
DarkMind: Latent Chain-of-Thought Backdoor in Customized LLMs
by: Guo, Zhen, et al.
Published: (2025)
by: Guo, Zhen, et al.
Published: (2025)
Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Watermarking Feature Attribution
by: Shao, Shuo, et al.
Published: (2024)
by: Shao, Shuo, et al.
Published: (2024)
Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking
by: Yao, Yuan, et al.
Published: (2025)
by: Yao, Yuan, et al.
Published: (2025)
Poisoning with A Pill: Circumventing Detection in Federated Learning
by: Guo, Hanxi, et al.
Published: (2024)
by: Guo, Hanxi, et al.
Published: (2024)
SWA-LDM: Toward Stealthy Watermarks for Latent Diffusion Models
by: Yang, Zhonghao, et al.
Published: (2025)
by: Yang, Zhonghao, et al.
Published: (2025)
Ideal Attribution and Faithful Watermarks for Language Models
by: Song, Min Jae, et al.
Published: (2025)
by: Song, Min Jae, et al.
Published: (2025)
SWaRL: Safeguard Code Watermarking via Reinforcement Learning
by: Javidnia, Neusha, et al.
Published: (2026)
by: Javidnia, Neusha, et al.
Published: (2026)
Stealthy and Adjustable Text-Guided Backdoor Attacks on Multimodal Pretrained Models
by: Zhang, Yiyang, et al.
Published: (2026)
by: Zhang, Yiyang, et al.
Published: (2026)
Stealthy Adversarial Attacks on Stochastic Multi-Armed Bandits
by: Wang, Zhiwei, et al.
Published: (2024)
by: Wang, Zhiwei, et al.
Published: (2024)
Robust Spectral Watermark for Synthetic Tabular Data
by: Zhao, Yizhou, et al.
Published: (2025)
by: Zhao, Yizhou, et al.
Published: (2025)
LLM Fingerprinting via Semantically Conditioned Watermarks
by: Gloaguen, Thibaud, et al.
Published: (2025)
by: Gloaguen, Thibaud, et al.
Published: (2025)
State Backdoor: Towards Stealthy Real-world Poisoning Attack on Vision-Language-Action Model in State Space
by: Guo, Ji, et al.
Published: (2026)
by: Guo, Ji, et al.
Published: (2026)
Distortion-free Watermarks are not Truly Distortion-free under Watermark Key Collisions
by: Wu, Yihan, et al.
Published: (2024)
by: Wu, Yihan, et al.
Published: (2024)
Lurking in the shadows: Unveiling Stealthy Backdoor Attacks against Personalized Federated Learning
by: Lyu, Xiaoting, et al.
Published: (2024)
by: Lyu, Xiaoting, et al.
Published: (2024)
Unforgeable Watermarks for Language Models via Robust Signatures
by: Lin, Huijia, et al.
Published: (2026)
by: Lin, Huijia, et al.
Published: (2026)
Stealthy Imitation: Reward-guided Environment-free Policy Stealing
by: Zhuang, Zhixiong, et al.
Published: (2024)
by: Zhuang, Zhixiong, et al.
Published: (2024)
Provably Robust Multi-bit Watermarking for AI-generated Text
by: Qu, Wenjie, et al.
Published: (2024)
by: Qu, Wenjie, et al.
Published: (2024)
Conscious Data Contribution via Community-Driven Chain-of-Thought Distillation
by: Libon, Lena, et al.
Published: (2025)
by: Libon, Lena, et al.
Published: (2025)
GESR: Graph-Based Edge Semantic Reconstruction for Stealthy Communication Detection with Benign-Only Training
by: Xu, Henghui, et al.
Published: (2026)
by: Xu, Henghui, et al.
Published: (2026)
R-CoT: A Reasoning-Layer Watermark via Redundant Chain-of-Thought in Large Language Models
by: Zhang, Ziming, et al.
Published: (2026)
by: Zhang, Ziming, et al.
Published: (2026)
Graph-Aware Stealthy Poison-Text Backdoors for Text-Attributed Graphs
by: Luo, Qi, et al.
Published: (2026)
by: Luo, Qi, et al.
Published: (2026)
AutoRAN: Automated Hijacking of Safety Reasoning in Large Reasoning Models
by: Liang, Jiacheng, et al.
Published: (2025)
by: Liang, Jiacheng, et al.
Published: (2025)
Output Supervision Can Obfuscate the Chain of Thought
by: Drori, Jacob, et al.
Published: (2025)
by: Drori, Jacob, et al.
Published: (2025)
Robust GNN Watermarking via Implicit Perception of Topological Invariants
by: Li, Jipeng, et al.
Published: (2025)
by: Li, Jipeng, et al.
Published: (2025)
CGI-DM: Digital Copyright Authentication for Diffusion Models via Contrasting Gradient Inversion
by: Wu, Xiaoyu, et al.
Published: (2024)
by: Wu, Xiaoyu, et al.
Published: (2024)
SDBA: A Stealthy and Long-Lasting Durable Backdoor Attack in Federated Learning
by: Choe, Minyeong, et al.
Published: (2024)
by: Choe, Minyeong, et al.
Published: (2024)
SilentStriker:Toward Stealthy Bit-Flip Attacks on Large Language Models
by: Xu, Haotian, et al.
Published: (2025)
by: Xu, Haotian, et al.
Published: (2025)
PoLO: Proof-of-Learning and Proof-of-Ownership at Once with Chained Watermarking
by: Deng, Haiyu, et al.
Published: (2025)
by: Deng, Haiyu, et al.
Published: (2025)
Traceable Black-box Watermarks for Federated Learning
by: Xu, Jiahao, et al.
Published: (2025)
by: Xu, Jiahao, et al.
Published: (2025)
DeepTracer: Tracing Stolen Model via Deep Coupled Watermarks
by: Yang, Yunfei, et al.
Published: (2025)
by: Yang, Yunfei, et al.
Published: (2025)
Similar Items
-
SWAP: Towards Copyright Auditing of Soft Prompts via Sequential Watermarking
by: Yang, Wenyuan, et al.
Published: (2025) -
Self-Sovereign Agent
by: Qu, Wenjie, et al.
Published: (2026) -
DMark: Order-Agnostic Watermarking for Diffusion Large Language Models
by: Wu, Linyu, et al.
Published: (2025) -
Thought-Transfer: Indirect Targeted Poisoning Attacks on Chain-of-Thought Reasoning Models
by: Chaudhari, Harsh, et al.
Published: (2026) -
AliMark: Enhancing Robustness of Sentence-Level Watermarking Against Text Paraphrasing
by: Li, Yuexin, et al.
Published: (2026)