Saved in:
Bibliographic Details
Main Authors: Zhang, Guokai, Wang, Lanjun, Su, Yuting, Liu, An-An
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.05607
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918158343864320
author Zhang, Guokai
Wang, Lanjun
Su, Yuting
Liu, An-An
author_facet Zhang, Guokai
Wang, Lanjun
Su, Yuting
Liu, An-An
contents Today, the family of latent diffusion models (LDMs) has gained prominence for its high quality outputs and scalability. This has also raised security concerns on social media, as malicious users can create and disseminate harmful content. Existing approaches typically involve training specific components or entire generative models to embed a watermark in generated images for traceability and responsibility. However, in the fast-evolving era of AI-generated content (AIGC), the rapid iteration and modification of LDMs makes retraining with watermark models costly. To address the problem, we propose MarkPlugger, a generalizable plug-and-play watermark framework without LDM retraining. In particular, to reduce the disturbance of the watermark on the semantics of the generated image, we try to identify a watermark representation that is approaching orthogonal to the semantic in latent space, and apply an additive fusion strategy for the watermark and the semantic. Without modifying any components of the LDMs, we embed diverse watermarks in latent space, adapting to the denoising process. Our experimental findings reveal that our method effectively harmonizes image quality and watermark recovery rate. We also have validated that our method is generalized to multiple official versions and modified variants of LDMs, even without retraining the watermark model. Furthermore, it performs robustly under various attacks of different intensities.
format Preprint
id arxiv_https___arxiv_org_abs_2404_05607
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle MarkPlugger: Generalizable Watermark Framework for Latent Diffusion Models without Retraining
Zhang, Guokai
Wang, Lanjun
Su, Yuting
Liu, An-An
Computer Vision and Pattern Recognition
Today, the family of latent diffusion models (LDMs) has gained prominence for its high quality outputs and scalability. This has also raised security concerns on social media, as malicious users can create and disseminate harmful content. Existing approaches typically involve training specific components or entire generative models to embed a watermark in generated images for traceability and responsibility. However, in the fast-evolving era of AI-generated content (AIGC), the rapid iteration and modification of LDMs makes retraining with watermark models costly. To address the problem, we propose MarkPlugger, a generalizable plug-and-play watermark framework without LDM retraining. In particular, to reduce the disturbance of the watermark on the semantics of the generated image, we try to identify a watermark representation that is approaching orthogonal to the semantic in latent space, and apply an additive fusion strategy for the watermark and the semantic. Without modifying any components of the LDMs, we embed diverse watermarks in latent space, adapting to the denoising process. Our experimental findings reveal that our method effectively harmonizes image quality and watermark recovery rate. We also have validated that our method is generalized to multiple official versions and modified variants of LDMs, even without retraining the watermark model. Furthermore, it performs robustly under various attacks of different intensities.
title MarkPlugger: Generalizable Watermark Framework for Latent Diffusion Models without Retraining
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2404.05607