Saved in:
| Main Authors: | Koo, Juil, Choi, Daehyeon, Youn, Sangwoo, Lee, Phillip Y., Sung, Minhyuk |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.13250 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Posterior Distillation Sampling
by: Koo, Juil, et al.
Published: (2023)
by: Koo, Juil, et al.
Published: (2023)
Token Warping Helps MLLMs Look from Nearby Viewpoints
by: Lee, Phillip Y., et al.
Published: (2026)
by: Lee, Phillip Y., et al.
Published: (2026)
ReGround: Improving Textual and Spatial Grounding at No Cost
by: Lee, Phillip Y., et al.
Published: (2024)
by: Lee, Phillip Y., et al.
Published: (2024)
Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object Poses
by: Yoo, Seungwoo, et al.
Published: (2024)
by: Yoo, Seungwoo, et al.
Published: (2024)
SyncTweedies: A General Generative Framework Based on Synchronized Diffusions
by: Kim, Jaihoon, et al.
Published: (2024)
by: Kim, Jaihoon, et al.
Published: (2024)
ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation
by: Min, Yunhong, et al.
Published: (2025)
by: Min, Yunhong, et al.
Published: (2025)
SALAD: Part-Level Latent Diffusion for 3D Shape Generation and Manipulation
by: Koo, Juil, et al.
Published: (2023)
by: Koo, Juil, et al.
Published: (2023)
GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation
by: Lee, Phillip Y., et al.
Published: (2024)
by: Lee, Phillip Y., et al.
Published: (2024)
BoxSplitGen: A Generative Model for 3D Part Bounding Boxes in Varying Granularity
by: Koo, Juil, et al.
Published: (2026)
by: Koo, Juil, et al.
Published: (2026)
VideoHandles: Editing 3D Object Compositions in Videos Using Video Generative Priors
by: Koo, Juil, et al.
Published: (2025)
by: Koo, Juil, et al.
Published: (2025)
DiverseVAR: Balancing Diversity and Quality of Next-Scale Visual Autoregressive Models
by: Park, Mingue, et al.
Published: (2025)
by: Park, Mingue, et al.
Published: (2025)
Unconditional Priors Matter! Improving Conditional Generation of Fine-Tuned Diffusion Models
by: Phunyaphibarn, Prin, et al.
Published: (2025)
by: Phunyaphibarn, Prin, et al.
Published: (2025)
DiffusionRollout: Uncertainty-Aware Rollout Planning in Long-Horizon PDE Solving
by: Yoo, Seungwoo, et al.
Published: (2026)
by: Yoo, Seungwoo, et al.
Published: (2026)
Perspective-Aware Reasoning in Vision-Language Models via Mental Imagery Simulation
by: Lee, Phillip Y., et al.
Published: (2025)
by: Lee, Phillip Y., et al.
Published: (2025)
OASIS: Online Sample Selection for Continual Visual Instruction Tuning
by: Lee, Minjae, et al.
Published: (2025)
by: Lee, Minjae, et al.
Published: (2025)
InstantDrag: Improving Interactivity in Drag-based Image Editing
by: Shin, Joonghyuk, et al.
Published: (2024)
by: Shin, Joonghyuk, et al.
Published: (2024)
PartSTAD: 2D-to-3D Part Segmentation Task Adaptation
by: Kim, Hyunjin, et al.
Published: (2024)
by: Kim, Hyunjin, et al.
Published: (2024)
MV2Cyl: Reconstructing 3D Extrusion Cylinders from Multi-View Images
by: Hong, Eunji, et al.
Published: (2024)
by: Hong, Eunji, et al.
Published: (2024)
Attention Misses Visual Risk: Risk-Adaptive Steering for Multimodal Safety Alignment
by: Park, Jonghyun, et al.
Published: (2025)
by: Park, Jonghyun, et al.
Published: (2025)
Proxy-Free Gaussian Splats Deformation with Splat-Based Surface Estimation
by: Kim, Jaeyeong, et al.
Published: (2025)
by: Kim, Jaeyeong, et al.
Published: (2025)
Occupancy-Based Dual Contouring
by: Hwang, Jisung, et al.
Published: (2024)
by: Hwang, Jisung, et al.
Published: (2024)
MorphGS: Morphology-Adaptive Articulated 3D Motion Transfer from Videos
by: Kim, Taeyeon, et al.
Published: (2026)
by: Kim, Taeyeon, et al.
Published: (2026)
MemBench: Memorized Image Trigger Prompt Dataset for Diffusion Models
by: Hong, Chunsan, et al.
Published: (2024)
by: Hong, Chunsan, et al.
Published: (2024)
vid-TLDR: Training Free Token merging for Light-weight Video Transformer
by: Choi, Joonmyung, et al.
Published: (2024)
by: Choi, Joonmyung, et al.
Published: (2024)
StochSync: Stochastic Diffusion Synchronization for Image Generation in Arbitrary Spaces
by: Yeo, Kyeongmin, et al.
Published: (2025)
by: Yeo, Kyeongmin, et al.
Published: (2025)
InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion
by: Lee, Jihyun, et al.
Published: (2024)
by: Lee, Jihyun, et al.
Published: (2024)
MatLat: Material Latent Space for PBR Texture Generation
by: Yeo, Kyeongmin, et al.
Published: (2025)
by: Yeo, Kyeongmin, et al.
Published: (2025)
Efficient multi-view training for 3D Gaussian Splatting
by: Choi, Minhyuk, et al.
Published: (2025)
by: Choi, Minhyuk, et al.
Published: (2025)
Active Prompt Learning with Vision-Language Model Priors
by: Kim, Hoyoung, et al.
Published: (2024)
by: Kim, Hoyoung, et al.
Published: (2024)
Budgeted Online Continual Learning by Adaptive Layer Freezing and Frequency-based Sampling
by: Seo, Minhyuk, et al.
Published: (2024)
by: Seo, Minhyuk, et al.
Published: (2024)
Formalizing the Sampling Design Space of Diffusion-Based Generative Models via Adaptive Solvers and Wasserstein-Bounded Timesteps
by: Jo, Sangwoo, et al.
Published: (2026)
by: Jo, Sangwoo, et al.
Published: (2026)
Beyond Referring Expressions: Scenario Comprehension Visual Grounding
by: He, Ruozhen, et al.
Published: (2026)
by: He, Ruozhen, et al.
Published: (2026)
Agentic Discovery with Active Hypothesis Exploration for Visual Recognition
by: Koo, Jaywon, et al.
Published: (2026)
by: Koo, Jaywon, et al.
Published: (2026)
Multimodal Dataset Distillation Made Simple by Prototype-Guided Data Synthesis
by: Choi, Junhyeok, et al.
Published: (2026)
by: Choi, Junhyeok, et al.
Published: (2026)
IN2OUT: Fine-Tuning Video Inpainting Model for Video Outpainting Using Hierarchical Discriminator
by: Youn, Sangwoo, et al.
Published: (2025)
by: Youn, Sangwoo, et al.
Published: (2025)
EgoWorld: Translating Exocentric View to Egocentric View using Rich Exocentric Observations
by: Park, Junho, et al.
Published: (2025)
by: Park, Junho, et al.
Published: (2025)
Active View Selector: Fast and Accurate Active View Selection with Cross Reference Image Quality Assessment
by: Wang, Zirui, et al.
Published: (2025)
by: Wang, Zirui, et al.
Published: (2025)
Moment- and Power-Spectrum-Based Gaussianity Regularization for Text-to-Image Models
by: Hwang, Jisung, et al.
Published: (2025)
by: Hwang, Jisung, et al.
Published: (2025)
Learning Visual Grounding from Generative Vision and Language Model
by: Wang, Shijie, et al.
Published: (2024)
by: Wang, Shijie, et al.
Published: (2024)
Infusing Environmental Captions for Long-Form Video Language Grounding
by: Lee, Hyogun, et al.
Published: (2024)
by: Lee, Hyogun, et al.
Published: (2024)
Similar Items
-
Posterior Distillation Sampling
by: Koo, Juil, et al.
Published: (2023) -
Token Warping Helps MLLMs Look from Nearby Viewpoints
by: Lee, Phillip Y., et al.
Published: (2026) -
ReGround: Improving Textual and Spatial Grounding at No Cost
by: Lee, Phillip Y., et al.
Published: (2024) -
Neural Pose Representation Learning for Generating and Transferring Non-Rigid Object Poses
by: Yoo, Seungwoo, et al.
Published: (2024) -
SyncTweedies: A General Generative Framework Based on Synchronized Diffusions
by: Kim, Jaihoon, et al.
Published: (2024)