Saved in:
| Main Authors: | Yun, Guhnoo, Yoo, Juhan, Kim, Kijung, Lee, Jeongho, Seo, Paul Hongsuck, Kim, Dong Hwan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.23947 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Initiation of Interaction Detection Framework using a Nonverbal Cue for Human-Robot Interaction
by: Yun, Guhnoo, et al.
Published: (2026)
by: Yun, Guhnoo, et al.
Published: (2026)
Robust Image Self-Recovery against Tampering using Watermark Generation with Pixel Shuffling
by: Kim, Minyoung, et al.
Published: (2025)
by: Kim, Minyoung, et al.
Published: (2025)
Breaking the Visual Shortcuts in Multimodal Knowledge-Based Visual Question Answering
by: Lee, Dosung, et al.
Published: (2025)
by: Lee, Dosung, et al.
Published: (2025)
Image Diffusion Models Exhibit Emergent Temporal Propagation in Videos
by: Kim, Youngseo, et al.
Published: (2025)
by: Kim, Youngseo, et al.
Published: (2025)
Direct Diffusion Score Preference Optimization via Stepwise Contrastive Policy-Pair Supervision
by: Kim, Dohyun, et al.
Published: (2025)
by: Kim, Dohyun, et al.
Published: (2025)
Learning Correlation Structures for Vision Transformers
by: Kim, Manjin, et al.
Published: (2024)
by: Kim, Manjin, et al.
Published: (2024)
Multi-Granularity Video Object Segmentation
by: Lim, Sangbeom, et al.
Published: (2024)
by: Lim, Sangbeom, et al.
Published: (2024)
Bridging Audio and Vision: Zero-Shot Audiovisual Segmentation by Connecting Pretrained Models
by: Lee, Seung-jae, et al.
Published: (2025)
by: Lee, Seung-jae, et al.
Published: (2025)
Bridging the Domain Gap: A Simple Domain Matching Method for Reference-based Image Super-Resolution in Remote Sensing
by: Min, Jeongho, et al.
Published: (2024)
by: Min, Jeongho, et al.
Published: (2024)
Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels
by: Shin, Heeseong, et al.
Published: (2024)
by: Shin, Heeseong, et al.
Published: (2024)
CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
by: Cho, Seokju, et al.
Published: (2023)
by: Cho, Seokju, et al.
Published: (2023)
TranSplat: Surface Embedding-guided 3D Gaussian Splatting for Transparent Object Manipulation
by: Kim, Jeongyun, et al.
Published: (2025)
by: Kim, Jeongyun, et al.
Published: (2025)
Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation
by: Yu, Seonghoon, et al.
Published: (2024)
by: Yu, Seonghoon, et al.
Published: (2024)
Hybrid-Vector Retrieval for Visually Rich Documents: Combining Single-Vector Efficiency and Multi-Vector Accuracy
by: Kim, Juyeon, et al.
Published: (2025)
by: Kim, Juyeon, et al.
Published: (2025)
Seg4Diff: Unveiling Open-Vocabulary Segmentation in Text-to-Image Diffusion Transformers
by: Kim, Chaehyun, et al.
Published: (2025)
by: Kim, Chaehyun, et al.
Published: (2025)
From Street to Orbit: Training-Free Cross-View Retrieval via Location Semantics and LLM Guidance
by: Min, Jeongho, et al.
Published: (2025)
by: Min, Jeongho, et al.
Published: (2025)
DialNav: Multi-turn Dialog Navigation with a Remote Guide
by: Han, Leekyeung, et al.
Published: (2025)
by: Han, Leekyeung, et al.
Published: (2025)
TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models
by: Kim, Jeongho, et al.
Published: (2024)
by: Kim, Jeongho, et al.
Published: (2024)
Clutt3R-Seg: Sparse-view 3D Instance Segmentation for Language-grounded Grasping in Cluttered Scenes
by: Noh, Jeongho, et al.
Published: (2026)
by: Noh, Jeongho, et al.
Published: (2026)
VARCO-VISION: Expanding Frontiers in Korean Vision-Language Models
by: Ju, Jeongho, et al.
Published: (2024)
by: Ju, Jeongho, et al.
Published: (2024)
From Wardrobe to Canvas: Wardrobe Polyptych LoRA for Part-level Controllable Human Image Generation
by: Kim, Jeongho, et al.
Published: (2025)
by: Kim, Jeongho, et al.
Published: (2025)
Memory-Efficient Personalization of Text-to-Image Diffusion Models via Selective Optimization Strategies
by: Choi, Seokeon, et al.
Published: (2025)
by: Choi, Seokeon, et al.
Published: (2025)
BF-STVSR: B-Splines and Fourier-Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution
by: Kim, Eunjin, et al.
Published: (2025)
by: Kim, Eunjin, et al.
Published: (2025)
H2G: Hierarchy-Aware Hyperbolic Grouping for 3D Scenes
by: Ko, ByungHa, et al.
Published: (2026)
by: Ko, ByungHa, et al.
Published: (2026)
What to Preserve and What to Transfer: Faithful, Identity-Preserving Diffusion-based Hairstyle Transfer
by: Chung, Chaeyeon, et al.
Published: (2024)
by: Chung, Chaeyeon, et al.
Published: (2024)
Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble
by: Cha, Juhan, et al.
Published: (2024)
by: Cha, Juhan, et al.
Published: (2024)
Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation
by: Kim, Min-Jung, et al.
Published: (2025)
by: Kim, Min-Jung, et al.
Published: (2025)
RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting
by: Seo, Ji Hyun, et al.
Published: (2025)
by: Seo, Ji Hyun, et al.
Published: (2025)
TERDNet: Transformer Encoder-Recurrent Decoder Network for Scene Change Detection
by: Yoon, Jiae, et al.
Published: (2026)
by: Yoon, Jiae, et al.
Published: (2026)
Fieldscale: Locality-Aware Field-based Adaptive Rescaling for Thermal Infrared Image
by: Gil, Hyeonjae, et al.
Published: (2024)
by: Gil, Hyeonjae, et al.
Published: (2024)
JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts
by: Son, Taein, et al.
Published: (2024)
by: Son, Taein, et al.
Published: (2024)
UniSpector: Towards Universal Open-set Defect Recognition via Spectral-Contrastive Visual Prompting
by: Kim, Geonuk, et al.
Published: (2026)
by: Kim, Geonuk, et al.
Published: (2026)
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning
by: Kim, Geewook, et al.
Published: (2024)
by: Kim, Geewook, et al.
Published: (2024)
PlugTrack: Multi-Perceptive Motion Analysis for Adaptive Fusion in Multi-Object Tracking
by: Kim, Seungjae, et al.
Published: (2025)
by: Kim, Seungjae, et al.
Published: (2025)
Superpixel Tokenization for Vision Transformers: Preserving Semantic Integrity in Visual Tokens
by: Lew, Jaihyun, et al.
Published: (2024)
by: Lew, Jaihyun, et al.
Published: (2024)
Degradation-Agnostic Statistical Facial Feature Transformation for Blind Face Restoration in Adverse Weather Conditions
by: Son, Chang-Hwan, et al.
Published: (2025)
by: Son, Chang-Hwan, et al.
Published: (2025)
ERASE: Eliminating Redundant Visual Tokens via Adaptive Two-Stage Token Pruning
by: Lee, Yuna, et al.
Published: (2026)
by: Lee, Yuna, et al.
Published: (2026)
PromptDresser: Improving the Quality and Controllability of Virtual Try-On via Generative Textual Prompt and Prompt-aware Mask
by: Kim, Jeongho, et al.
Published: (2024)
by: Kim, Jeongho, et al.
Published: (2024)
Masked Autoregressive Model for Weather Forecasting
by: Kim, Doyi, et al.
Published: (2024)
by: Kim, Doyi, et al.
Published: (2024)
A Decoding Scheme with Successive Aggregation of Multi-Level Features for Light-Weight Semantic Segmentation
by: Yoo, Jiwon, et al.
Published: (2024)
by: Yoo, Jiwon, et al.
Published: (2024)
Similar Items
-
Initiation of Interaction Detection Framework using a Nonverbal Cue for Human-Robot Interaction
by: Yun, Guhnoo, et al.
Published: (2026) -
Robust Image Self-Recovery against Tampering using Watermark Generation with Pixel Shuffling
by: Kim, Minyoung, et al.
Published: (2025) -
Breaking the Visual Shortcuts in Multimodal Knowledge-Based Visual Question Answering
by: Lee, Dosung, et al.
Published: (2025) -
Image Diffusion Models Exhibit Emergent Temporal Propagation in Videos
by: Kim, Youngseo, et al.
Published: (2025) -
Direct Diffusion Score Preference Optimization via Stepwise Contrastive Policy-Pair Supervision
by: Kim, Dohyun, et al.
Published: (2025)