Saved in:
Bibliographic Details
Main Authors: Li, Zhuoling, Rahmani, Hossein, Zhang, Jiarui, Xue, Yu, Mirmehdi, Majid, Kuen, Jason, Gu, Jiuxiang, Liu, Jun
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.20470
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917355436638208
author Li, Zhuoling
Rahmani, Hossein
Zhang, Jiarui
Xue, Yu
Mirmehdi, Majid
Kuen, Jason
Gu, Jiuxiang
Liu, Jun
author_facet Li, Zhuoling
Rahmani, Hossein
Zhang, Jiarui
Xue, Yu
Mirmehdi, Majid
Kuen, Jason
Gu, Jiuxiang
Liu, Jun
contents The rapid growth of the text-to-image (T2I) community has fostered a thriving online ecosystem of expert models, which are variants of pretrained diffusion models specialized for diverse generative abilities. Yet, existing model merging methods remain limited in fully leveraging abundant online expert resources and still struggle to meet diverse in-the-wild user needs. We present DiffGraph, a novel agent-driven graph-based model merging framework, which automatically harnesses online experts and flexibly merges them for diverse user needs. Our DiffGraph constructs a scalable graph and organizes ever-expanding online experts within it through node registration and calibration. Then, DiffGraph dynamically activates specific subgraphs based on user needs, enabling flexible combinations of different experts to achieve user-desired generation. Extensive experiments show the efficacy of our method.
format Preprint
id arxiv_https___arxiv_org_abs_2603_20470
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle DiffGraph: An Automated Agent-driven Model Merging Framework for In-the-Wild Text-to-Image Generation
Li, Zhuoling
Rahmani, Hossein
Zhang, Jiarui
Xue, Yu
Mirmehdi, Majid
Kuen, Jason
Gu, Jiuxiang
Liu, Jun
Artificial Intelligence
The rapid growth of the text-to-image (T2I) community has fostered a thriving online ecosystem of expert models, which are variants of pretrained diffusion models specialized for diverse generative abilities. Yet, existing model merging methods remain limited in fully leveraging abundant online expert resources and still struggle to meet diverse in-the-wild user needs. We present DiffGraph, a novel agent-driven graph-based model merging framework, which automatically harnesses online experts and flexibly merges them for diverse user needs. Our DiffGraph constructs a scalable graph and organizes ever-expanding online experts within it through node registration and calibration. Then, DiffGraph dynamically activates specific subgraphs based on user needs, enabling flexible combinations of different experts to achieve user-desired generation. Extensive experiments show the efficacy of our method.
title DiffGraph: An Automated Agent-driven Model Merging Framework for In-the-Wild Text-to-Image Generation
topic Artificial Intelligence
url https://arxiv.org/abs/2603.20470