:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Shangxun, Uh, Youngjung
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2512.16443
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ASemConsist: Adaptive Semantic Feature Control for Training-Free Identity-Consistent Generation
by: Kim, Shin Seong, et al.
Published: (2025)

Training-free Content Injection using h-space in Diffusion Models
by: Jeong, Jaeseok, et al.
Published: (2023)

Frequency-Adaptive Sharpness Regularization for Improving 3D Gaussian Splatting Generalization
by: Yun, Youngsik, et al.
Published: (2025)

Attribute Based Interpretable Evaluation Metrics for Generative Models
by: Kim, Dongkyun, et al.
Published: (2023)

Semantic Image Synthesis with Unconditional Generator
by: Chae, Jungwoo, et al.
Published: (2024)

Visual Style Prompting with Swapping Self-Attention
by: Jeong, Jaeseok, et al.
Published: (2024)

Syncphony: Synchronized Audio-to-Video Generation with Diffusion Transformers
by: Song, Jibin, et al.
Published: (2025)

HARIVO: Harnessing Text-to-Image Models for Video Generation
by: Kwon, Mingi, et al.
Published: (2024)

FlowBlending: Stage-Aware Multi-Model Sampling for Fast and High-Fidelity Video Generation
by: Song, Jibin, et al.
Published: (2025)

TetraSDF: Precise Mesh Extraction with Multi-resolution Tetrahedral Grid
by: Oh, Seonghun, et al.
Published: (2025)

Per-Gaussian Embedding-Based Deformation for Deformable 3D Gaussian Splatting
by: Bae, Jeongmin, et al.
Published: (2024)

StyleKeeper: Prevent Content Leakage using Negative Visual Query Guidance
by: Jeong, Jaeseok, et al.
Published: (2025)

MVCustom: Multi-View Customized Diffusion via Geometric Latent Rendering and Completion
by: Shin, Minjung, et al.
Published: (2025)

Eye-for-an-eye: Appearance Transfer with Semantic Correspondence in Diffusion Models
by: Go, Sooyeon, et al.
Published: (2024)

Sync-NeRF: Generalizing Dynamic NeRFs to Unsynchronized Videos
by: Kim, Seoha, et al.
Published: (2023)

Rethinking Open-Vocabulary Segmentation of Radiance Fields in 3D Space
by: Lee, Hyunjee, et al.
Published: (2024)

CoDi: Subject-Consistent and Pose-Diverse Text-to-Image Generation
by: Gao, Zhanxin, et al.
Published: (2025)

DisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image Generation
by: Chen, Hong, et al.
Published: (2023)

Disentangling to Re-couple: Resolving the Similarity-Controllability Paradox in Subject-Driven Text-to-Image Generation
by: Li, Shuang, et al.
Published: (2026)

TCFG: Tangential Damping Classifier-free Guidance
by: Kwon, Mingi, et al.
Published: (2025)

Contrastive Prompts Improve Disentanglement in Text-to-Image Diffusion Models
by: Wu, Chen, et al.
Published: (2024)

TIT-Score: Evaluating Long-Prompt Based Text-to-Image Alignment via Text-to-Image-to-Text Consistency
by: Wang, Juntong, et al.
Published: (2025)

Geometrical Properties of Text Token Embeddings for Strong Semantic Binding in Text-to-Image Generation
by: Seo, Hoigi, et al.
Published: (2025)

JAM-Flow: Joint Audio-Motion Synthesis with Flow Matching
by: Kwon, Mingi, et al.
Published: (2025)

Prompt-Softbox-Prompt: A Free-Text Embedding Control for Image Editing
by: Yang, Yitong, et al.
Published: (2024)

Optimizing Prompts for Text-to-Image Generation
by: Hao, Yaru, et al.
Published: (2022)

One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single Prompt
by: Liu, Tao, et al.
Published: (2025)

Compensating Spatiotemporally Inconsistent Observations for Online Dynamic 3D Gaussian Splatting
by: Yun, Youngsik, et al.
Published: (2025)

AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation
by: He, Junjie, et al.
Published: (2025)

4D Scaffold Gaussian Splatting with Dynamic-Aware Anchor Growing for Efficient and High-Fidelity Dynamic Scene Reconstruction
by: Cho, Woong Oh, et al.
Published: (2024)

Improving Text-to-Image Consistency via Automatic Prompt Optimization
by: Mañas, Oscar, et al.
Published: (2024)

Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
by: Huang, Siteng, et al.
Published: (2023)

TIPO: Text to Image with Text Presampling for Prompt Optimization
by: Yeh, Shih-Ying, et al.
Published: (2024)

PromptSafe: Gated Prompt Tuning for Safe Text-to-Image Generation
by: Jing, Zonglei, et al.
Published: (2025)

TextTIGER: Text-based Intelligent Generation with Entity Prompt Refinement for Text-to-Image Generation
by: Ozaki, Shintaro, et al.
Published: (2025)

Prompt Refinement with Image Pivot for Text-to-Image Generation
by: Zhan, Jingtao, et al.
Published: (2024)

StorySync: Training-Free Subject Consistency in Text-to-Image Generation via Region Harmonization
by: Gaur, Gopalji, et al.
Published: (2025)

End-to-end Training for Text-to-Image Synthesis using Dual-Text Embeddings
by: Ahmed, Yeruru Asrar, et al.
Published: (2025)

Deep Geometric Moments Promote Shape Consistency in Text-to-3D Generation
by: Nath, Utkarsh, et al.
Published: (2024)

SceneBooth: Diffusion-based Framework for Subject-preserved Text-to-Image Generation
by: Chai, Shang, et al.
Published: (2025)