Saved in:
| Main Authors: | Kim, Hyeongjin, Kim, Sangwon, Ahn, Dasom, Lee, Jong Taek, Ko, Byoung Chul |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.12648 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
EQ-CBM: A Probabilistic Concept Bottleneck with Energy-based Models and Quantized Vectors
by: Kim, Sangwon, et al.
Published: (2024)
by: Kim, Sangwon, et al.
Published: (2024)
CoBELa: Steering Transparent Generation via Concept Bottlenecks on Energy Landscapes
by: Kim, Sangwon, et al.
Published: (2025)
by: Kim, Sangwon, et al.
Published: (2025)
Scene Graph-Guided Proactive Replanning for Failure-Resilient Embodied Agent
by: Yu, Che Rin, et al.
Published: (2025)
by: Yu, Che Rin, et al.
Published: (2025)
DeClotH: Decomposable 3D Cloth and Human Body Reconstruction from a Single Image
by: Nam, Hyeongjin, et al.
Published: (2025)
by: Nam, Hyeongjin, et al.
Published: (2025)
Generalized Consistency Trajectory Models for Image Manipulation
by: Kim, Beomsu, et al.
Published: (2024)
by: Kim, Beomsu, et al.
Published: (2024)
Mosaic: Compositional Multi-Concept Erasure via Vector Field Blending
by: Ko, Junseok, et al.
Published: (2026)
by: Ko, Junseok, et al.
Published: (2026)
Gradient-Free Noise Optimization for Reward Alignment in Generative Models
by: Kim, Jeongsol, et al.
Published: (2026)
by: Kim, Jeongsol, et al.
Published: (2026)
Boost-and-Skip: A Simple Guidance-Free Diffusion for Minority Generation
by: Um, Soobin, et al.
Published: (2025)
by: Um, Soobin, et al.
Published: (2025)
Latent Schrodinger Bridge: Prompting Latent Diffusion for Fast Unpaired Image-to-Image Translation
by: Kim, Jeongsol, et al.
Published: (2024)
by: Kim, Jeongsol, et al.
Published: (2024)
PoseBridge: Bridging the Skeletonization Gap for Zero-Shot Skeleton-Based Action Recognition
by: Lee, Sanghyeon, et al.
Published: (2026)
by: Lee, Sanghyeon, et al.
Published: (2026)
Training-Free Reward-Guided Image Editing via Trajectory Optimal Control
by: Chang, Jinho, et al.
Published: (2025)
by: Chang, Jinho, et al.
Published: (2025)
Diverse Text-to-Image Generation via Contrastive Noise Optimization
by: Kim, Byungjun, et al.
Published: (2025)
by: Kim, Byungjun, et al.
Published: (2025)
PCPO: Proportionate Credit Policy Optimization for Aligning Image Generation Models
by: Lee, Jeongjae, et al.
Published: (2025)
by: Lee, Jeongjae, et al.
Published: (2025)
Unified Editing of Panorama, 3D Scenes, and Videos Through Disentangled Self-Attention Injection
by: Kwon, Gihyun, et al.
Published: (2024)
by: Kwon, Gihyun, et al.
Published: (2024)
Reangle-A-Video: 4D Video Generation as Video-to-Video Translation
by: Jeong, Hyeonho, et al.
Published: (2025)
by: Jeong, Hyeonho, et al.
Published: (2025)
UNICORN: Ultrasound Nakagami Imaging via Score Matching and Adaptation for Assessing Hepatic Steatosis
by: Kim, Kwanyoung, et al.
Published: (2026)
by: Kim, Kwanyoung, et al.
Published: (2026)
Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment
by: Kim, Bryan Sangwoo, et al.
Published: (2025)
by: Kim, Bryan Sangwoo, et al.
Published: (2025)
Extreme Blind Image Restoration via Prompt-Conditioned Information Bottleneck
by: Kim, Hongeun, et al.
Published: (2025)
by: Kim, Hongeun, et al.
Published: (2025)
Free$^2$Guide: Training-Free Text-to-Video Alignment using Image LVLM
by: Kim, Jaemin, et al.
Published: (2024)
by: Kim, Jaemin, et al.
Published: (2024)
FlowDPS: Flow-Driven Posterior Sampling for Inverse Problems
by: Kim, Jeongsol, et al.
Published: (2025)
by: Kim, Jeongsol, et al.
Published: (2025)
Optical-Flow Guided Prompt Optimization for Coherent Video Generation
by: Nam, Hyelin, et al.
Published: (2024)
by: Nam, Hyelin, et al.
Published: (2024)
Improving Video Diffusion Transformer Training by Multi-Feature Fusion and Alignment from Self-Supervised Vision Encoders
by: Lee, Dohun, et al.
Published: (2025)
by: Lee, Dohun, et al.
Published: (2025)
Reward Score Matching: Unifying Reward-based Fine-tuning for Flow and Diffusion Models
by: Lee, Jeongjae, et al.
Published: (2026)
by: Lee, Jeongjae, et al.
Published: (2026)
Jailbreaking on Text-to-Video Models via Scene Splitting Strategy
by: Lee, Wonjun, et al.
Published: (2025)
by: Lee, Wonjun, et al.
Published: (2025)
LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation
by: Lee, Suhyeon, et al.
Published: (2023)
by: Lee, Suhyeon, et al.
Published: (2023)
Adaptive Self-training Framework for Fine-grained Scene Graph Generation
by: Kim, Kibum, et al.
Published: (2024)
by: Kim, Kibum, et al.
Published: (2024)
CRePE: Curved Ray Expectation Positional Encoding for Unified-Camera-Controlled Video Generation
by: Jin, Seonghyun, et al.
Published: (2026)
by: Jin, Seonghyun, et al.
Published: (2026)
Unpaired Image-to-Image Translation via Neural Schrödinger Bridge
by: Kim, Beomsu, et al.
Published: (2023)
by: Kim, Beomsu, et al.
Published: (2023)
UNICORN: Ultrasound Nakagami Imaging via Score Matching and Adaptation
by: Kim, Kwanyoung, et al.
Published: (2024)
by: Kim, Kwanyoung, et al.
Published: (2024)
AVOID: The Adverse Visual Conditions Dataset with Obstacles for Driving Scene Understanding
by: Jeong, Jongoh, et al.
Published: (2025)
by: Jeong, Jongoh, et al.
Published: (2025)
Align Your Tangent: Training Better Consistency Models via Manifold-Aligned Tangents
by: Kim, Beomsu, et al.
Published: (2025)
by: Kim, Beomsu, et al.
Published: (2025)
OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation
by: Kim, Kwanyoung, et al.
Published: (2024)
by: Kim, Kwanyoung, et al.
Published: (2024)
MotionCFG: Boosting Motion Dynamics via Stochastic Concept Perturbation
by: Kim, Byungjun, et al.
Published: (2026)
by: Kim, Byungjun, et al.
Published: (2026)
Aligning Text to Image in Diffusion Models is Easier Than You Think
by: Lee, Jaa-Yeon, et al.
Published: (2025)
by: Lee, Jaa-Yeon, et al.
Published: (2025)
VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide
by: Lee, Dohun, et al.
Published: (2024)
by: Lee, Dohun, et al.
Published: (2024)
TeHOR: Text-Guided 3D Human and Object Reconstruction with Textures
by: Nam, Hyeongjin, et al.
Published: (2026)
by: Nam, Hyeongjin, et al.
Published: (2026)
ED-NeRF: Efficient Text-Guided Editing of 3D Scene with Latent Space NeRF
by: Park, Jangho, et al.
Published: (2023)
by: Park, Jangho, et al.
Published: (2023)
PromptLoop: Plug-and-Play Prompt Refinement via Latent Feedback for Diffusion Model Alignment
by: Lee, Suhyeon, et al.
Published: (2025)
by: Lee, Suhyeon, et al.
Published: (2025)
DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation
by: Kim, Jeongsol, et al.
Published: (2024)
by: Kim, Jeongsol, et al.
Published: (2024)
Tiled Prompts: Overcoming Prompt Misguidance in Image and Video Super-Resolution
by: Kim, Bryan Sangwoo, et al.
Published: (2026)
by: Kim, Bryan Sangwoo, et al.
Published: (2026)
Similar Items
-
EQ-CBM: A Probabilistic Concept Bottleneck with Energy-based Models and Quantized Vectors
by: Kim, Sangwon, et al.
Published: (2024) -
CoBELa: Steering Transparent Generation via Concept Bottlenecks on Energy Landscapes
by: Kim, Sangwon, et al.
Published: (2025) -
Scene Graph-Guided Proactive Replanning for Failure-Resilient Embodied Agent
by: Yu, Che Rin, et al.
Published: (2025) -
DeClotH: Decomposable 3D Cloth and Human Body Reconstruction from a Single Image
by: Nam, Hyeongjin, et al.
Published: (2025) -
Generalized Consistency Trajectory Models for Image Manipulation
by: Kim, Beomsu, et al.
Published: (2024)