:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Koo, Juil, Choi, Daehyeon, Youn, Sangwoo, Lee, Phillip Y., Sung, Minhyuk
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2512.13250
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Posterior Distillation Sampling
by: Koo, Juil, et al.
Published: (2023)

Token Warping Helps MLLMs Look from Nearby Viewpoints
by: Lee, Phillip Y., et al.
Published: (2026)

ReGround: Improving Textual and Spatial Grounding at No Cost
by: Lee, Phillip Y., et al.
Published: (2024)

Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object Poses
by: Yoo, Seungwoo, et al.
Published: (2024)

SyncTweedies: A General Generative Framework Based on Synchronized Diffusions
by: Kim, Jaihoon, et al.
Published: (2024)

ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation
by: Min, Yunhong, et al.
Published: (2025)

SALAD: Part-Level Latent Diffusion for 3D Shape Generation and Manipulation
by: Koo, Juil, et al.
Published: (2023)

GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation
by: Lee, Phillip Y., et al.
Published: (2024)

BoxSplitGen: A Generative Model for 3D Part Bounding Boxes in Varying Granularity
by: Koo, Juil, et al.
Published: (2026)

VideoHandles: Editing 3D Object Compositions in Videos Using Video Generative Priors
by: Koo, Juil, et al.
Published: (2025)

DiverseVAR: Balancing Diversity and Quality of Next-Scale Visual Autoregressive Models
by: Park, Mingue, et al.
Published: (2025)

Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models
by: Phunyaphibarn, Prin, et al.
Published: (2025)

DiffusionRollout: Uncertainty-Aware Rollout Planning in Long-Horizon PDE Solving
by: Yoo, Seungwoo, et al.
Published: (2026)

Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation
by: Lee, Phillip Y., et al.
Published: (2025)

OASIS: Online Sample Selection for Continual Visual Instruction Tuning
by: Lee, Minjae, et al.
Published: (2025)

InstantDrag: Improving Interactivity in Drag-based Image Editing
by: Shin, Joonghyuk, et al.
Published: (2024)

PartSTAD: 2D-to-3D Part Segmentation Task Adaptation
by: Kim, Hyunjin, et al.
Published: (2024)

MV2Cyl: Reconstructing 3D Extrusion Cylinders from Multi-View Images
by: Hong, Eunji, et al.
Published: (2024)

Attention Misses Visual Risk: Risk-Adaptive Steering for Multimodal Safety Alignment
by: Park, Jonghyun, et al.
Published: (2025)

Proxy-Free Gaussian Splats Deformation with Splat-Based Surface Estimation
by: Kim, Jaeyeong, et al.
Published: (2025)

Occupancy-Based Dual Contouring
by: Hwang, Jisung, et al.
Published: (2024)

MorphGS: Morphology-Adaptive Articulated 3D Motion Transfer from Videos
by: Kim, Taeyeon, et al.
Published: (2026)

MemBench: Memorized Image Trigger Prompt Dataset for Diffusion Models
by: Hong, Chunsan, et al.
Published: (2024)

vid-TLDR: Training Free Token merging for Light-weight Video Transformer
by: Choi, Joonmyung, et al.
Published: (2024)

StochSync: Stochastic Diffusion Synchronization for Image Generation in Arbitrary Spaces
by: Yeo, Kyeongmin, et al.
Published: (2025)

InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion
by: Lee, Jihyun, et al.
Published: (2024)

MatLat: Material Latent Space for PBR Texture Generation
by: Yeo, Kyeongmin, et al.
Published: (2025)

Efficient multi-view training for 3D Gaussian Splatting
by: Choi, Minhyuk, et al.
Published: (2025)

Active Prompt Learning with Vision-Language Model Priors
by: Kim, Hoyoung, et al.
Published: (2024)

Budgeted Online Continual Learning by Adaptive Layer Freezing and Frequency-based Sampling
by: Seo, Minhyuk, et al.
Published: (2024)

Formalizing the Sampling Design Space of Diffusion-Based Generative Models via Adaptive Solvers and Wasserstein-Bounded Timesteps
by: Jo, Sangwoo, et al.
Published: (2026)

Beyond Referring Expressions: Scenario Comprehension Visual Grounding
by: He, Ruozhen, et al.
Published: (2026)

Agentic Discovery with Active Hypothesis Exploration for Visual Recognition
by: Koo, Jaywon, et al.
Published: (2026)

Multimodal Dataset Distillation Made Simple by Prototype-Guided Data Synthesis
by: Choi, Junhyeok, et al.
Published: (2026)

IN2OUT: Fine-Tuning Video Inpainting Model for Video Outpainting Using Hierarchical Discriminator
by: Youn, Sangwoo, et al.
Published: (2025)

EgoWorld: Translating Exocentric View to Egocentric View using Rich Exocentric Observations
by: Park, Junho, et al.
Published: (2025)

Active View Selector: Fast and Accurate Active View Selection with Cross Reference Image Quality Assessment
by: Wang, Zirui, et al.
Published: (2025)

Moment- and Power-Spectrum-Based Gaussianity Regularization for Text-to-Image Models
by: Hwang, Jisung, et al.
Published: (2025)

Learning Visual Grounding from Generative Vision and Language Model
by: Wang, Shijie, et al.
Published: (2024)

Infusing Environmental Captions for Long-Form Video Language Grounding
by: Lee, Hyogun, et al.
Published: (2024)