:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yun, Guhnoo, Yoo, Juhan, Kim, Kijung, Lee, Jeongho, Seo, Paul Hongsuck, Kim, Dong Hwan
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2503.23947
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Initiation of Interaction Detection Framework using a Nonverbal Cue for Human-Robot Interaction
by: Yun, Guhnoo, et al.
Published: (2026)

Robust Image Self-Recovery against Tampering using Watermark Generation with Pixel Shuffling
by: Kim, Minyoung, et al.
Published: (2025)

Breaking the Visual Shortcuts in Multimodal Knowledge-Based Visual Question Answering
by: Lee, Dosung, et al.
Published: (2025)

Image Diffusion Models Exhibit Emergent Temporal Propagation in Videos
by: Kim, Youngseo, et al.
Published: (2025)

Direct Diffusion Score Preference Optimization via Stepwise Contrastive Policy-Pair Supervision
by: Kim, Dohyun, et al.
Published: (2025)

Learning Correlation Structures for Vision Transformers
by: Kim, Manjin, et al.
Published: (2024)

Multi-Granularity Video Object Segmentation
by: Lim, Sangbeom, et al.
Published: (2024)

Bridging Audio and Vision: Zero-Shot Audiovisual Segmentation by Connecting Pretrained Models
by: Lee, Seung-jae, et al.
Published: (2025)

Bridging the Domain Gap: A Simple Domain Matching Method for Reference-based Image Super-Resolution in Remote Sensing
by: Min, Jeongho, et al.
Published: (2024)

Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels
by: Shin, Heeseong, et al.
Published: (2024)

CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation
by: Cho, Seokju, et al.
Published: (2023)

TranSplat: Surface Embedding-guided 3D Gaussian Splatting for Transparent Object Manipulation
by: Kim, Jeongyun, et al.
Published: (2025)

Pseudo-RIS: Distinctive Pseudo-supervision Generation for Referring Image Segmentation
by: Yu, Seonghoon, et al.
Published: (2024)

Hybrid-Vector Retrieval for Visually Rich Documents: Combining Single-Vector Efficiency and Multi-Vector Accuracy
by: Kim, Juyeon, et al.
Published: (2025)

Seg4Diff: Unveiling Open-Vocabulary Segmentation in Text-to-Image Diffusion Transformers
by: Kim, Chaehyun, et al.
Published: (2025)

From Street to Orbit: Training-Free Cross-View Retrieval via Location Semantics and LLM Guidance
by: Min, Jeongho, et al.
Published: (2025)

DialNav: Multi-turn Dialog Navigation with a Remote Guide
by: Han, Leekyeung, et al.
Published: (2025)

TCAN: Animating Human Images with Temporally Consistent Pose Guidance using Diffusion Models
by: Kim, Jeongho, et al.
Published: (2024)

Clutt3R-Seg: Sparse-view 3D Instance Segmentation for Language-grounded Grasping in Cluttered Scenes
by: Noh, Jeongho, et al.
Published: (2026)

VARCO-VISION: Expanding Frontiers in Korean Vision-Language Models
by: Ju, Jeongho, et al.
Published: (2024)

From Wardrobe to Canvas: Wardrobe Polyptych LoRA for Part-level Controllable Human Image Generation
by: Kim, Jeongho, et al.
Published: (2025)

Memory-Efficient Personalization of Text-to-Image Diffusion Models via Selective Optimization Strategies
by: Choi, Seokeon, et al.
Published: (2025)

BF-STVSR: B-Splines and Fourier-Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution
by: Kim, Eunjin, et al.
Published: (2025)

H2G: Hierarchy-Aware Hyperbolic Grouping for 3D Scenes
by: Ko, ByungHa, et al.
Published: (2026)

What to Preserve and What to Transfer: Faithful, Identity-Preserving Diffusion-based Hairstyle Transfer
by: Chung, Chaeyeon, et al.
Published: (2024)

Robust Multimodal 3D Object Detection via Modality-Agnostic Decoding and Proximity-based Modality Ensemble
by: Cha, Juhan, et al.
Published: (2024)

Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation
by: Kim, Min-Jung, et al.
Published: (2025)

RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting
by: Seo, Ji Hyun, et al.
Published: (2025)

TERDNet: Transformer Encoder-Recurrent Decoder Network for Scene Change Detection
by: Yoon, Jiae, et al.
Published: (2026)

Fieldscale: Locality-Aware Field-based Adaptive Rescaling for Thermal Infrared Image
by: Gil, Hyeonjae, et al.
Published: (2024)

JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts
by: Son, Taein, et al.
Published: (2024)

UniSpector: Towards Universal Open-set Defect Recognition via Spectral-Contrastive Visual Prompting
by: Kim, Geonuk, et al.
Published: (2026)

On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning
by: Kim, Geewook, et al.
Published: (2024)

PlugTrack: Multi-Perceptive Motion Analysis for Adaptive Fusion in Multi-Object Tracking
by: Kim, Seungjae, et al.
Published: (2025)

Superpixel Tokenization for Vision Transformers: Preserving Semantic Integrity in Visual Tokens
by: Lew, Jaihyun, et al.
Published: (2024)

Degradation-Agnostic Statistical Facial Feature Transformation for Blind Face Restoration in Adverse Weather Conditions
by: Son, Chang-Hwan, et al.
Published: (2025)

ERASE: Eliminating Redundant Visual Tokens via Adaptive Two-Stage Token Pruning
by: Lee, Yuna, et al.
Published: (2026)

PromptDresser: Improving the Quality and Controllability of Virtual Try-On via Generative Textual Prompt and Prompt-aware Mask
by: Kim, Jeongho, et al.
Published: (2024)

Masked Autoregressive Model for Weather Forecasting
by: Kim, Doyi, et al.
Published: (2024)

A Decoding Scheme with Successive Aggregation of Multi-Level Features for Light-Weight Semantic Segmentation
by: Yoo, Jiwon, et al.
Published: (2024)