Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhang, Guokai, Wang, Lanjun, Su, Yuting, Liu, An-An
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2404.05607
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918158343864320
author	Zhang, Guokai Wang, Lanjun Su, Yuting Liu, An-An
author_facet	Zhang, Guokai Wang, Lanjun Su, Yuting Liu, An-An
contents	Today, the family of latent diffusion models (LDMs) has gained prominence for its high quality outputs and scalability. This has also raised security concerns on social media, as malicious users can create and disseminate harmful content. Existing approaches typically involve training specific components or entire generative models to embed a watermark in generated images for traceability and responsibility. However, in the fast-evolving era of AI-generated content (AIGC), the rapid iteration and modification of LDMs makes retraining with watermark models costly. To address the problem, we propose MarkPlugger, a generalizable plug-and-play watermark framework without LDM retraining. In particular, to reduce the disturbance of the watermark on the semantics of the generated image, we try to identify a watermark representation that is approaching orthogonal to the semantic in latent space, and apply an additive fusion strategy for the watermark and the semantic. Without modifying any components of the LDMs, we embed diverse watermarks in latent space, adapting to the denoising process. Our experimental findings reveal that our method effectively harmonizes image quality and watermark recovery rate. We also have validated that our method is generalized to multiple official versions and modified variants of LDMs, even without retraining the watermark model. Furthermore, it performs robustly under various attacks of different intensities.
format	Preprint
id	arxiv_https___arxiv_org_abs_2404_05607
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	MarkPlugger: Generalizable Watermark Framework for Latent Diffusion Models without Retraining Zhang, Guokai Wang, Lanjun Su, Yuting Liu, An-An Computer Vision and Pattern Recognition Today, the family of latent diffusion models (LDMs) has gained prominence for its high quality outputs and scalability. This has also raised security concerns on social media, as malicious users can create and disseminate harmful content. Existing approaches typically involve training specific components or entire generative models to embed a watermark in generated images for traceability and responsibility. However, in the fast-evolving era of AI-generated content (AIGC), the rapid iteration and modification of LDMs makes retraining with watermark models costly. To address the problem, we propose MarkPlugger, a generalizable plug-and-play watermark framework without LDM retraining. In particular, to reduce the disturbance of the watermark on the semantics of the generated image, we try to identify a watermark representation that is approaching orthogonal to the semantic in latent space, and apply an additive fusion strategy for the watermark and the semantic. Without modifying any components of the LDMs, we embed diverse watermarks in latent space, adapting to the denoising process. Our experimental findings reveal that our method effectively harmonizes image quality and watermark recovery rate. We also have validated that our method is generalized to multiple official versions and modified variants of LDMs, even without retraining the watermark model. Furthermore, it performs robustly under various attacks of different intensities.
title	MarkPlugger: Generalizable Watermark Framework for Latent Diffusion Models without Retraining
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2404.05607

Similar Items