Guardado en:
| Autores principales: | Wu, Fan, Chen, Cheng, Fu, Zhoujie, Wei, Jiacheng, Xu, Yi, Ye, Deheng, Lin, Guosheng |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2512.02794 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
IC-World: In-Context Generation for Shared World Modeling
por: Wu, Fan, et al.
Publicado: (2025)
por: Wu, Fan, et al.
Publicado: (2025)
Towards Lifelong Few-Shot Customization of Text-to-Image Diffusion
por: Song, Nan, et al.
Publicado: (2024)
por: Song, Nan, et al.
Publicado: (2024)
Sync4D: Video Guided Controllable Dynamics for Physics-Based 4D Generation
por: Fu, Zhoujie, et al.
Publicado: (2024)
por: Fu, Zhoujie, et al.
Publicado: (2024)
Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior
por: Chen, Cheng, et al.
Publicado: (2024)
por: Chen, Cheng, et al.
Publicado: (2024)
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation
por: Fu, Zhoujie, et al.
Publicado: (2025)
por: Fu, Zhoujie, et al.
Publicado: (2025)
MVAnimate: Enhancing Character Animation with Multi-View Optimization
por: Sun, Tianyu, et al.
Publicado: (2026)
por: Sun, Tianyu, et al.
Publicado: (2026)
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
por: Huang, Siteng, et al.
Publicado: (2023)
por: Huang, Siteng, et al.
Publicado: (2023)
CustomText: Customized Textual Image Generation using Diffusion Models
por: Paliwal, Shubham, et al.
Publicado: (2024)
por: Paliwal, Shubham, et al.
Publicado: (2024)
Calligrapher: Freestyle Text Image Customization
por: Ma, Yue, et al.
Publicado: (2025)
por: Ma, Yue, et al.
Publicado: (2025)
Tuning-Free Image Customization with Image and Text Guidance
por: Li, Pengzhi, et al.
Publicado: (2024)
por: Li, Pengzhi, et al.
Publicado: (2024)
CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects
por: Wang, Zhao, et al.
Publicado: (2024)
por: Wang, Zhao, et al.
Publicado: (2024)
GroundingBooth: Grounding Text-to-Image Customization
por: Xiong, Zhexiao, et al.
Publicado: (2024)
por: Xiong, Zhexiao, et al.
Publicado: (2024)
Learning to Customize Text-to-Image Diffusion In Diverse Context
por: Kim, Taewook, et al.
Publicado: (2024)
por: Kim, Taewook, et al.
Publicado: (2024)
Interact-Custom: Customized Human Object Interaction Image Generation
por: Xu, Zhu, et al.
Publicado: (2025)
por: Xu, Zhu, et al.
Publicado: (2025)
PhyEdit: Towards Real-World Object Manipulation via Physically-Grounded Image Editing
por: Xu, Ruihang, et al.
Publicado: (2026)
por: Xu, Ruihang, et al.
Publicado: (2026)
RealCustom: Narrowing Real Text Word for Real-Time Open-Domain Text-to-Image Customization
por: Huang, Mengqi, et al.
Publicado: (2024)
por: Huang, Mengqi, et al.
Publicado: (2024)
CoMo: Compositional Motion Customization for Text-to-Video Generation
por: Xu, Youcan, et al.
Publicado: (2025)
por: Xu, Youcan, et al.
Publicado: (2025)
Customization Assistant for Text-to-image Generation
por: Zhou, Yufan, et al.
Publicado: (2023)
por: Zhou, Yufan, et al.
Publicado: (2023)
Event-Customized Image Generation
por: Wang, Zhen, et al.
Publicado: (2024)
por: Wang, Zhen, et al.
Publicado: (2024)
CustomContrast: A Multilevel Contrastive Perspective For Subject-Driven Text-to-Image Customization
por: Chen, Nan, et al.
Publicado: (2024)
por: Chen, Nan, et al.
Publicado: (2024)
Style Customization of Text-to-Vector Generation with Image Diffusion Priors
por: Zhang, Peiying, et al.
Publicado: (2025)
por: Zhang, Peiying, et al.
Publicado: (2025)
Towards General Text-guided Image Synthesis for Customized Multimodal Brain MRI Generation
por: Wang, Yulin, et al.
Publicado: (2024)
por: Wang, Yulin, et al.
Publicado: (2024)
DECOR:Decomposition and Projection of Text Embeddings for Text-to-Image Customization
por: Jang, Geonhui, et al.
Publicado: (2024)
por: Jang, Geonhui, et al.
Publicado: (2024)
IC-Custom: Diverse Image Customization via In-Context Learning
por: Li, Yaowei, et al.
Publicado: (2025)
por: Li, Yaowei, et al.
Publicado: (2025)
FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
por: Ding, Ganggui, et al.
Publicado: (2024)
por: Ding, Ganggui, et al.
Publicado: (2024)
Generating Multi-Image Synthetic Data for Text-to-Image Customization
por: Kumari, Nupur, et al.
Publicado: (2025)
por: Kumari, Nupur, et al.
Publicado: (2025)
StyleAR: Customizing Multimodal Autoregressive Model for Style-Aligned Text-to-Image Generation
por: Wu, Yi, et al.
Publicado: (2025)
por: Wu, Yi, et al.
Publicado: (2025)
Customizing Text-to-Image Diffusion with Object Viewpoint Control
por: Kumari, Nupur, et al.
Publicado: (2024)
por: Kumari, Nupur, et al.
Publicado: (2024)
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
por: Hu, Teng, et al.
Publicado: (2025)
por: Hu, Teng, et al.
Publicado: (2025)
MotionBooth: Motion-Aware Customized Text-to-Video Generation
por: Wu, Jianzong, et al.
Publicado: (2024)
por: Wu, Jianzong, et al.
Publicado: (2024)
DreamO: A Unified Framework for Image Customization
por: Mou, Chong, et al.
Publicado: (2025)
por: Mou, Chong, et al.
Publicado: (2025)
EditID: Training-Free Editable ID Customization for Text-to-Image Generation
por: Li, Guandong, et al.
Publicado: (2025)
por: Li, Guandong, et al.
Publicado: (2025)
In-Context Learning with Unpaired Clips for Instruction-based Video Editing
por: Liao, Xinyao, et al.
Publicado: (2025)
por: Liao, Xinyao, et al.
Publicado: (2025)
Multi-Garment Customized Model Generation
por: Liu, Yichen, et al.
Publicado: (2024)
por: Liu, Yichen, et al.
Publicado: (2024)
HRPVT: High-Resolution Pyramid Vision Transformer for medium and small-scale human pose estimation
por: Xu, Zhoujie
Publicado: (2024)
por: Xu, Zhoujie
Publicado: (2024)
AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models
por: Agarwal, Aishwarya, et al.
Publicado: (2024)
por: Agarwal, Aishwarya, et al.
Publicado: (2024)
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
por: Wu, Tao, et al.
Publicado: (2024)
por: Wu, Tao, et al.
Publicado: (2024)
Customized Generation Reimagined: Fidelity and Editability Harmonized
por: Jin, Jian, et al.
Publicado: (2024)
por: Jin, Jian, et al.
Publicado: (2024)
PhyPrompt: RL-based Prompt Refinement for Physically Plausible Text-to-Video Generation
por: Wu, Shang, et al.
Publicado: (2026)
por: Wu, Shang, et al.
Publicado: (2026)
Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models
por: Ren, Yixuan, et al.
Publicado: (2024)
por: Ren, Yixuan, et al.
Publicado: (2024)
Ejemplares similares
-
IC-World: In-Context Generation for Shared World Modeling
por: Wu, Fan, et al.
Publicado: (2025) -
Towards Lifelong Few-Shot Customization of Text-to-Image Diffusion
por: Song, Nan, et al.
Publicado: (2024) -
Sync4D: Video Guided Controllable Dynamics for Physics-Based 4D Generation
por: Fu, Zhoujie, et al.
Publicado: (2024) -
Sculpt3D: Multi-View Consistent Text-to-3D Generation with Sparse 3D Prior
por: Chen, Cheng, et al.
Publicado: (2024) -
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation
por: Fu, Zhoujie, et al.
Publicado: (2025)