Saved in:
| Main Authors: | Li, Shangxun, Uh, Youngjung |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.16443 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ASemConsist: Adaptive Semantic Feature Control for Training-Free Identity-Consistent Generation
by: Kim, Shin Seong, et al.
Published: (2025)
by: Kim, Shin Seong, et al.
Published: (2025)
Training-free Content Injection using h-space in Diffusion Models
by: Jeong, Jaeseok, et al.
Published: (2023)
by: Jeong, Jaeseok, et al.
Published: (2023)
Frequency-Adaptive Sharpness Regularization for Improving 3D Gaussian Splatting Generalization
by: Yun, Youngsik, et al.
Published: (2025)
by: Yun, Youngsik, et al.
Published: (2025)
Attribute Based Interpretable Evaluation Metrics for Generative Models
by: Kim, Dongkyun, et al.
Published: (2023)
by: Kim, Dongkyun, et al.
Published: (2023)
Semantic Image Synthesis with Unconditional Generator
by: Chae, Jungwoo, et al.
Published: (2024)
by: Chae, Jungwoo, et al.
Published: (2024)
Visual Style Prompting with Swapping Self-Attention
by: Jeong, Jaeseok, et al.
Published: (2024)
by: Jeong, Jaeseok, et al.
Published: (2024)
Syncphony: Synchronized Audio-to-Video Generation with Diffusion Transformers
by: Song, Jibin, et al.
Published: (2025)
by: Song, Jibin, et al.
Published: (2025)
HARIVO: Harnessing Text-to-Image Models for Video Generation
by: Kwon, Mingi, et al.
Published: (2024)
by: Kwon, Mingi, et al.
Published: (2024)
FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation
by: Song, Jibin, et al.
Published: (2025)
by: Song, Jibin, et al.
Published: (2025)
TetraSDF: Precise Mesh Extraction with Multi-resolution Tetrahedral Grid
by: Oh, Seonghun, et al.
Published: (2025)
by: Oh, Seonghun, et al.
Published: (2025)
Per-Gaussian Embedding-Based Deformation for Deformable 3D Gaussian Splatting
by: Bae, Jeongmin, et al.
Published: (2024)
by: Bae, Jeongmin, et al.
Published: (2024)
StyleKeeper: Prevent Content Leakage using Negative Visual Query Guidance
by: Jeong, Jaeseok, et al.
Published: (2025)
by: Jeong, Jaeseok, et al.
Published: (2025)
MVCustom: Multi-View Customized Diffusion via Geometric Latent Rendering and Completion
by: Shin, Minjung, et al.
Published: (2025)
by: Shin, Minjung, et al.
Published: (2025)
Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models
by: Go, Sooyeon, et al.
Published: (2024)
by: Go, Sooyeon, et al.
Published: (2024)
Sync-NeRF: Generalizing Dynamic NeRFs to Unsynchronized Videos
by: Kim, Seoha, et al.
Published: (2023)
by: Kim, Seoha, et al.
Published: (2023)
Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space
by: Lee, Hyunjee, et al.
Published: (2024)
by: Lee, Hyunjee, et al.
Published: (2024)
CoDi: Subject-Consistent and Pose-Diverse Text-to-Image Generation
by: Gao, Zhanxin, et al.
Published: (2025)
by: Gao, Zhanxin, et al.
Published: (2025)
DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation
by: Chen, Hong, et al.
Published: (2023)
by: Chen, Hong, et al.
Published: (2023)
Disentangling to Re-couple: Resolving the Similarity-Controllability Paradox in Subject-Driven Text-to-Image Generation
by: Li, Shuang, et al.
Published: (2026)
by: Li, Shuang, et al.
Published: (2026)
TCFG: Tangential Damping Classifier-free Guidance
by: Kwon, Mingi, et al.
Published: (2025)
by: Kwon, Mingi, et al.
Published: (2025)
Contrastive Prompts Improve Disentanglement in Text-to-Image Diffusion Models
by: Wu, Chen, et al.
Published: (2024)
by: Wu, Chen, et al.
Published: (2024)
TIT-Score: Evaluating Long-Prompt Based Text-to-Image Alignment via Text-to-Image-to-Text Consistency
by: Wang, Juntong, et al.
Published: (2025)
by: Wang, Juntong, et al.
Published: (2025)
Geometrical Properties of Text Token Embeddings for Strong Semantic Binding in Text-to-Image Generation
by: Seo, Hoigi, et al.
Published: (2025)
by: Seo, Hoigi, et al.
Published: (2025)
JAM-Flow: Joint Audio-Motion Synthesis with Flow Matching
by: Kwon, Mingi, et al.
Published: (2025)
by: Kwon, Mingi, et al.
Published: (2025)
Prompt-Softbox-Prompt: A Free-Text Embedding Control for Image Editing
by: Yang, Yitong, et al.
Published: (2024)
by: Yang, Yitong, et al.
Published: (2024)
Optimizing Prompts for Text-to-Image Generation
by: Hao, Yaru, et al.
Published: (2022)
by: Hao, Yaru, et al.
Published: (2022)
One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt
by: Liu, Tao, et al.
Published: (2025)
by: Liu, Tao, et al.
Published: (2025)
Compensating Spatiotemporally Inconsistent Observations for Online Dynamic 3D Gaussian Splatting
by: Yun, Youngsik, et al.
Published: (2025)
by: Yun, Youngsik, et al.
Published: (2025)
AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation
by: He, Junjie, et al.
Published: (2025)
by: He, Junjie, et al.
Published: (2025)
4D Scaffold Gaussian Splatting with Dynamic-Aware Anchor Growing for Efficient and High-Fidelity Dynamic Scene Reconstruction
by: Cho, Woong Oh, et al.
Published: (2024)
by: Cho, Woong Oh, et al.
Published: (2024)
Improving Text-to-Image Consistency via Automatic Prompt Optimization
by: Mañas, Oscar, et al.
Published: (2024)
by: Mañas, Oscar, et al.
Published: (2024)
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
by: Huang, Siteng, et al.
Published: (2023)
by: Huang, Siteng, et al.
Published: (2023)
TIPO: Text to Image with Text Presampling for Prompt Optimization
by: Yeh, Shih-Ying, et al.
Published: (2024)
by: Yeh, Shih-Ying, et al.
Published: (2024)
PromptSafe: Gated Prompt Tuning for Safe Text-to-Image Generation
by: Jing, Zonglei, et al.
Published: (2025)
by: Jing, Zonglei, et al.
Published: (2025)
TextTIGER: Text-based Intelligent Generation with Entity Prompt Refinement for Text-to-Image Generation
by: Ozaki, Shintaro, et al.
Published: (2025)
by: Ozaki, Shintaro, et al.
Published: (2025)
Prompt Refinement with Image Pivot for Text-to-Image Generation
by: Zhan, Jingtao, et al.
Published: (2024)
by: Zhan, Jingtao, et al.
Published: (2024)
StorySync: Training-Free Subject Consistency in Text-to-Image Generation via Region Harmonization
by: Gaur, Gopalji, et al.
Published: (2025)
by: Gaur, Gopalji, et al.
Published: (2025)
End-to-end Training for Text-to-Image Synthesis using Dual-Text Embeddings
by: Ahmed, Yeruru Asrar, et al.
Published: (2025)
by: Ahmed, Yeruru Asrar, et al.
Published: (2025)
Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation
by: Nath, Utkarsh, et al.
Published: (2024)
by: Nath, Utkarsh, et al.
Published: (2024)
SceneBooth: Diffusion-based Framework for Subject-preserved Text-to-Image Generation
by: Chai, Shang, et al.
Published: (2025)
by: Chai, Shang, et al.
Published: (2025)
Similar Items
-
ASemConsist: Adaptive Semantic Feature Control for Training-Free Identity-Consistent Generation
by: Kim, Shin Seong, et al.
Published: (2025) -
Training-free Content Injection using h-space in Diffusion Models
by: Jeong, Jaeseok, et al.
Published: (2023) -
Frequency-Adaptive Sharpness Regularization for Improving 3D Gaussian Splatting Generalization
by: Yun, Youngsik, et al.
Published: (2025) -
Attribute Based Interpretable Evaluation Metrics for Generative Models
by: Kim, Dongkyun, et al.
Published: (2023) -
Semantic Image Synthesis with Unconditional Generator
by: Chae, Jungwoo, et al.
Published: (2024)