:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kim, Hyeongjin, Kim, Sangwon, Ahn, Dasom, Lee, Jong Taek, Ko, Byoung Chul
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2405.12648
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

EQ-CBM: A Probabilistic Concept Bottleneck with Energy-based Models and Quantized Vectors
by: Kim, Sangwon, et al.
Published: (2024)

CoBELa: Steering Transparent Generation via Concept Bottlenecks on Energy Landscapes
by: Kim, Sangwon, et al.
Published: (2025)

Scene Graph-Guided Proactive Replanning for Failure-Resilient Embodied Agent
by: Yu, Che Rin, et al.
Published: (2025)

DeClotH: Decomposable 3D Cloth and Human Body Reconstruction from a Single Image
by: Nam, Hyeongjin, et al.
Published: (2025)

Generalized Consistency Trajectory Models for Image Manipulation
by: Kim, Beomsu, et al.
Published: (2024)

Mosaic: Compositional Multi-Concept Erasure via Vector Field Blending
by: Ko, Junseok, et al.
Published: (2026)

Gradient-Free Noise Optimization for Reward Alignment in Generative Models
by: Kim, Jeongsol, et al.
Published: (2026)

Boost-and-Skip: A Simple Guidance-Free Diffusion for Minority Generation
by: Um, Soobin, et al.
Published: (2025)

Latent Schrodinger Bridge: Prompting Latent Diffusion for Fast Unpaired Image-to-Image Translation
by: Kim, Jeongsol, et al.
Published: (2024)

PoseBridge: Bridging the Skeletonization Gap for Zero-Shot Skeleton-Based Action Recognition
by: Lee, Sanghyeon, et al.
Published: (2026)

Training-Free Reward-Guided Image Editing via Trajectory Optimal Control
by: Chang, Jinho, et al.
Published: (2025)

Diverse Text-to-Image Generation via Contrastive Noise Optimization
by: Kim, Byungjun, et al.
Published: (2025)

PCPO: Proportionate Credit Policy Optimization for Aligning Image Generation Models
by: Lee, Jeongjae, et al.
Published: (2025)

Unified Editing of Panorama, 3D Scenes, and Videos Through Disentangled Self-Attention Injection
by: Kwon, Gihyun, et al.
Published: (2024)

Reangle-A-Video: 4D Video Generation as Video-to-Video Translation
by: Jeong, Hyeonho, et al.
Published: (2025)

UNICORN: Ultrasound Nakagami Imaging via Score Matching and Adaptation for Assessing Hepatic Steatosis
by: Kim, Kwanyoung, et al.
Published: (2026)

Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment
by: Kim, Bryan Sangwoo, et al.
Published: (2025)

Extreme Blind Image Restoration via Prompt-Conditioned Information Bottleneck
by: Kim, Hongeun, et al.
Published: (2025)

Free$^2$Guide: Training-Free Text-to-Video Alignment using Image LVLM
by: Kim, Jaemin, et al.
Published: (2024)

FlowDPS: Flow-Driven Posterior Sampling for Inverse Problems
by: Kim, Jeongsol, et al.
Published: (2025)

Optical-Flow Guided Prompt Optimization for Coherent Video Generation
by: Nam, Hyelin, et al.
Published: (2024)

Improving Video Diffusion Transformer Training by Multi-Feature Fusion and Alignment from Self-Supervised Vision Encoders
by: Lee, Dohun, et al.
Published: (2025)

Reward Score Matching: Unifying Reward-based Fine-tuning for Flow and Diffusion Models
by: Lee, Jeongjae, et al.
Published: (2026)

Jailbreaking on Text-to-Video Models via Scene Splitting Strategy
by: Lee, Wonjun, et al.
Published: (2025)

LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation
by: Lee, Suhyeon, et al.
Published: (2023)

Adaptive Self-training Framework for Fine-grained Scene Graph Generation
by: Kim, Kibum, et al.
Published: (2024)

CRePE: Curved Ray Expectation Positional Encoding for Unified-Camera-Controlled Video Generation
by: Jin, Seonghyun, et al.
Published: (2026)

Unpaired Image-to-Image Translation via Neural Schrödinger Bridge
by: Kim, Beomsu, et al.
Published: (2023)

UNICORN: Ultrasound Nakagami Imaging via Score Matching and Adaptation
by: Kim, Kwanyoung, et al.
Published: (2024)

AVOID: The Adverse Visual Conditions Dataset with Obstacles for Driving Scene Understanding
by: Jeong, Jongoh, et al.
Published: (2025)

Align Your Tangent: Training Better Consistency Models via Manifold-Aligned Tangents
by: Kim, Beomsu, et al.
Published: (2025)

OTSeg: Multi-prompt Sinkhorn Attention for Zero-Shot Semantic Segmentation
by: Kim, Kwanyoung, et al.
Published: (2024)

MotionCFG: Boosting Motion Dynamics via Stochastic Concept Perturbation
by: Kim, Byungjun, et al.
Published: (2026)

Aligning Text to Image in Diffusion Models is Easier Than You Think
by: Lee, Jaa-Yeon, et al.
Published: (2025)

VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide
by: Lee, Dohun, et al.
Published: (2024)

TeHOR: Text-Guided 3D Human and Object Reconstruction with Textures
by: Nam, Hyeongjin, et al.
Published: (2026)

ED-NeRF: Efficient Text-Guided Editing of 3D Scene with Latent Space NeRF
by: Park, Jangho, et al.
Published: (2023)

PromptLoop: Plug-and-Play Prompt Refinement via Latent Feedback for Diffusion Model Alignment
by: Lee, Suhyeon, et al.
Published: (2025)

DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image Manipulation
by: Kim, Jeongsol, et al.
Published: (2024)

Tiled Prompts: Overcoming Prompt Misguidance in Image and Video Super-Resolution
by: Kim, Bryan Sangwoo, et al.
Published: (2026)