:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kim, Taewook, Chen, Wei, Qiu, Qiang
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2410.10058
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Conditional Text-to-Image Generation with Reference Guidance
by: Kim, Taewook, et al.
Published: (2024)

IC-Custom: Diverse Image Customization via In-Context Learning
by: Li, Yaowei, et al.
Published: (2025)

Tuning-Free Image Customization with Image and Text Guidance
by: Li, Pengzhi, et al.
Published: (2024)

Customizing Text-to-Image Diffusion with Object Viewpoint Control
by: Kumari, Nupur, et al.
Published: (2024)

PhyCustom: Towards Realistic Physical Customization in Text-to-Image Generation
by: Wu, Fan, et al.
Published: (2025)

DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization
by: Jang, Geonhui, et al.
Published: (2024)

CustomText: Customized Textual Image Generation using Diffusion Models
by: Paliwal, Shubham, et al.
Published: (2024)

Text Embedding is Not All You Need: Attention Control for Text-to-Image Semantic Alignment with Text Self-Attention Maps
by: Kim, Jeeyung, et al.
Published: (2024)

Calligrapher: Freestyle Text Image Customization
by: Ma, Yue, et al.
Published: (2025)

Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
by: Huang, Siteng, et al.
Published: (2023)

Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting
by: Zeng, Weili, et al.
Published: (2024)

GroundingBooth: Grounding Text-to-Image Customization
by: Xiong, Zhexiao, et al.
Published: (2024)

Direct Consistency Optimization for Robust Customization of Text-to-Image Diffusion Models
by: Lee, Kyungmin, et al.
Published: (2024)

How to Continually Adapt Text-to-Image Diffusion Models for Flexible Customization?
by: Dong, Jiahua, et al.
Published: (2024)

Style Customization of Text-to-Vector Generation with Image Diffusion Priors
by: Zhang, Peiying, et al.
Published: (2025)

Towards Lifelong Few-Shot Customization of Text-to-Image Diffusion
by: Song, Nan, et al.
Published: (2024)

Design and Identification of Keypoint Patches in Unstructured Environments
by: Park, Taewook, et al.
Published: (2024)

CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization
by: Chen, Nan, et al.
Published: (2024)

Harmonizing Visual and Textual Embeddings for Zero-Shot Text-to-Image Customization
by: Song, Yeji, et al.
Published: (2024)

Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models
by: Ren, Yixuan, et al.
Published: (2024)

RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization
by: Huang, Mengqi, et al.
Published: (2024)

Model-Agnostic Human Preference Inversion in Diffusion Models
by: Kim, Jeeyung, et al.
Published: (2024)

SimAC: A Simple Anti-Customization Method for Protecting Face Privacy against Text-to-Image Synthesis of Diffusion Models
by: Wang, Feifei, et al.
Published: (2023)

Separate Motion from Appearance: Customizing Motion via Customizing Text-to-Video Diffusion Models
by: Liu, Huijie, et al.
Published: (2025)

Int3DNet: Scene-Motion Cross Attention Network for 3D Intention Prediction in Mixed Reality
by: Ha, Taewook, et al.
Published: (2026)

Exploring Diverse In-Context Configurations for Image Captioning
by: Yang, Xu, et al.
Published: (2023)

Ingredients: Blending Custom Photos with Video Diffusion Transformers
by: Fei, Zhengcong, et al.
Published: (2025)

Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects
by: Qiu, Weimin, et al.
Published: (2024)

Semantic Anchoring for Robust Personalization in Text-to-Image Diffusion Models
by: Yang, Seoyun, et al.
Published: (2025)

Continual Diffusion: Continual Customization of Text-to-Image Diffusion with C-LoRA
by: Smith, James Seale, et al.
Published: (2023)

Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models
by: Gong, Chao, et al.
Published: (2024)

AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models
by: Agarwal, Aishwarya, et al.
Published: (2024)

PIDiff: Image Customization for Personalized Identities with Diffusion Models
by: Gu, Jinyu, et al.
Published: (2025)

CoRe: Context-Regularized Text Embedding Learning for Text-to-Image Personalization
by: Wu, Feize, et al.
Published: (2024)

ECNet: Effective Controllable Text-to-Image Diffusion Models
by: Li, Sicheng, et al.
Published: (2024)

Learning to Sample Effective and Diverse Prompts for Text-to-Image Generation
by: Yun, Taeyoung, et al.
Published: (2025)

KRETA: A Benchmark for Korean Reading and Reasoning in Text-Rich VQA Attuned to Diverse Visual Contexts
by: Hwang, Taebaek, et al.
Published: (2025)

Local Conditional Controlling for Text-to-Image Diffusion Models
by: Zhao, Yibo, et al.
Published: (2023)

MoTrans: Customized Motion Transfer with Text-driven Video Diffusion Models
by: Li, Xiaomin, et al.
Published: (2024)

VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models
by: Huang, Chi-Pin, et al.
Published: (2025)