Saved in:
Bibliographic Details
Main Authors: Ye, Liang, Chen, Shengqin, Dai, Jiazhu
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.20792
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917430727540736
author Ye, Liang
Chen, Shengqin
Dai, Jiazhu
author_facet Ye, Liang
Chen, Shengqin
Dai, Jiazhu
contents The rapid progress of graph generation has raised new security concerns, particularly regarding backdoor vulnerabilities. Though prior work has explored backdoor attacks against diffusion models for image or unconditional graph generation, those against conditional graph generation models, especially text-guided graph generation models, remain largely unexamined. This paper proposes BadGraph, a backdoor attack method against latent diffusion models for text-guided graph generation. BadGraph leverages textual triggers to poison training data, covertly implanting backdoors that induce attacker-specified subgraphs during inference when triggers appear, while preserving normal performance on clean inputs. Extensive experiments on four benchmark datasets (PubChem, ChEBI-20, PCDes, MoMu) demonstrate the effectiveness and stealth of the attack: a poisoning rate of less than 10% can achieve a 50% attack success rate, while 24% suffices for over an 80% success rate, with negligible performance degradation on benign samples. Ablation studies further reveal that the backdoor is implanted during VAE and diffusion training rather than pretraining. These findings reveal the security vulnerabilities in latent diffusion models for text-guided graph generation, highlight the serious risks in applications such as drug discovery, and underscore the need for robust defenses against the backdoor attack in such diffusion models.
format Preprint
id arxiv_https___arxiv_org_abs_2510_20792
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle BadGraph: A Backdoor Attack Against Latent Diffusion Model for Text-Guided Graph Generation
Ye, Liang
Chen, Shengqin
Dai, Jiazhu
Machine Learning
Computation and Language
Biomolecules
The rapid progress of graph generation has raised new security concerns, particularly regarding backdoor vulnerabilities. Though prior work has explored backdoor attacks against diffusion models for image or unconditional graph generation, those against conditional graph generation models, especially text-guided graph generation models, remain largely unexamined. This paper proposes BadGraph, a backdoor attack method against latent diffusion models for text-guided graph generation. BadGraph leverages textual triggers to poison training data, covertly implanting backdoors that induce attacker-specified subgraphs during inference when triggers appear, while preserving normal performance on clean inputs. Extensive experiments on four benchmark datasets (PubChem, ChEBI-20, PCDes, MoMu) demonstrate the effectiveness and stealth of the attack: a poisoning rate of less than 10% can achieve a 50% attack success rate, while 24% suffices for over an 80% success rate, with negligible performance degradation on benign samples. Ablation studies further reveal that the backdoor is implanted during VAE and diffusion training rather than pretraining. These findings reveal the security vulnerabilities in latent diffusion models for text-guided graph generation, highlight the serious risks in applications such as drug discovery, and underscore the need for robust defenses against the backdoor attack in such diffusion models.
title BadGraph: A Backdoor Attack Against Latent Diffusion Model for Text-Guided Graph Generation
topic Machine Learning
Computation and Language
Biomolecules
url https://arxiv.org/abs/2510.20792