Vista Equipo: :: Library Catalog

Guardado en:

Detalles Bibliográficos
Autores principales:	Fang, ZhongLi, Xie, Yu, Chen, Ping
Formato:	Preprint
Publicado:	2025
Materias:	Multimedia
Acceso en línea:	https://arxiv.org/abs/2504.02640
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

_version_	1866918042594705408
author	Fang, ZhongLi Xie, Yu Chen, Ping
author_facet	Fang, ZhongLi Xie, Yu Chen, Ping
contents	Current image watermarking technologies are predominantly categorized into text watermarking techniques and image steganography; however, few methods can simultaneously handle text and image-based watermark data, which limits their applicability in complex digital environments. This paper introduces an innovative multi-modal watermarking approach, drawing on the concept of vector discretization in encoder-based vector quantization. By constructing adjacency matrices, the proposed method enables the transformation of text watermarks into robust image-based representations, providing a novel multi-modal watermarking paradigm for image generation applications. Additionally, this study presents a newly designed image restoration module to mitigate image degradation caused by transmission losses and various noise interferences, thereby ensuring the reliability and integrity of the watermark. Experimental results validate the robustness of the method under multiple noise attacks, providing a secure, scalable, and efficient solution for digital image copyright protection.
format	Preprint
id	arxiv_https___arxiv_org_abs_2504_02640
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	RoSMM: A Robust and Secure Multi-Modal Watermarking Framework for Diffusion Models Fang, ZhongLi Xie, Yu Chen, Ping Multimedia Current image watermarking technologies are predominantly categorized into text watermarking techniques and image steganography; however, few methods can simultaneously handle text and image-based watermark data, which limits their applicability in complex digital environments. This paper introduces an innovative multi-modal watermarking approach, drawing on the concept of vector discretization in encoder-based vector quantization. By constructing adjacency matrices, the proposed method enables the transformation of text watermarks into robust image-based representations, providing a novel multi-modal watermarking paradigm for image generation applications. Additionally, this study presents a newly designed image restoration module to mitigate image degradation caused by transmission losses and various noise interferences, thereby ensuring the reliability and integrity of the watermark. Experimental results validate the robustness of the method under multiple noise attacks, providing a secure, scalable, and efficient solution for digital image copyright protection.
title	RoSMM: A Robust and Secure Multi-Modal Watermarking Framework for Diffusion Models
topic	Multimedia
url	https://arxiv.org/abs/2504.02640

Ejemplares similares