Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Li, Zhuoling, Rahmani, Hossein, Zhang, Jiarui, Xue, Yu, Mirmehdi, Majid, Kuen, Jason, Gu, Jiuxiang, Liu, Jun
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2603.20470
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917355436638208
author	Li, Zhuoling Rahmani, Hossein Zhang, Jiarui Xue, Yu Mirmehdi, Majid Kuen, Jason Gu, Jiuxiang Liu, Jun
author_facet	Li, Zhuoling Rahmani, Hossein Zhang, Jiarui Xue, Yu Mirmehdi, Majid Kuen, Jason Gu, Jiuxiang Liu, Jun
contents	The rapid growth of the text-to-image (T2I) community has fostered a thriving online ecosystem of expert models, which are variants of pretrained diffusion models specialized for diverse generative abilities. Yet, existing model merging methods remain limited in fully leveraging abundant online expert resources and still struggle to meet diverse in-the-wild user needs. We present DiffGraph, a novel agent-driven graph-based model merging framework, which automatically harnesses online experts and flexibly merges them for diverse user needs. Our DiffGraph constructs a scalable graph and organizes ever-expanding online experts within it through node registration and calibration. Then, DiffGraph dynamically activates specific subgraphs based on user needs, enabling flexible combinations of different experts to achieve user-desired generation. Extensive experiments show the efficacy of our method.
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_20470
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	DiffGraph: An Automated Agent-driven Model Merging Framework for In-the-Wild Text-to-Image Generation Li, Zhuoling Rahmani, Hossein Zhang, Jiarui Xue, Yu Mirmehdi, Majid Kuen, Jason Gu, Jiuxiang Liu, Jun Artificial Intelligence The rapid growth of the text-to-image (T2I) community has fostered a thriving online ecosystem of expert models, which are variants of pretrained diffusion models specialized for diverse generative abilities. Yet, existing model merging methods remain limited in fully leveraging abundant online expert resources and still struggle to meet diverse in-the-wild user needs. We present DiffGraph, a novel agent-driven graph-based model merging framework, which automatically harnesses online experts and flexibly merges them for diverse user needs. Our DiffGraph constructs a scalable graph and organizes ever-expanding online experts within it through node registration and calibration. Then, DiffGraph dynamically activates specific subgraphs based on user needs, enabling flexible combinations of different experts to achieve user-desired generation. Extensive experiments show the efficacy of our method.
title	DiffGraph: An Automated Agent-driven Model Merging Framework for In-the-Wild Text-to-Image Generation
topic	Artificial Intelligence
url	https://arxiv.org/abs/2603.20470

Similar Items