Saved in:
| Main Authors: | Lee, Minjae, Hur, Sungwoo, Hwang, Soojin, Kim, Won Hwa |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.12113 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
GenFlow: Generalizable Recurrent Flow for 6D Pose Refinement of Novel Objects
by: Moon, Sungphill, et al.
Published: (2024)
by: Moon, Sungphill, et al.
Published: (2024)
MoGIC: Boosting Motion Generation via Intention Understanding and Visual Context
by: Shi, Junyu, et al.
Published: (2025)
by: Shi, Junyu, et al.
Published: (2025)
FlowCLAS: Enhancing Normalizing Flow Via Contrastive Learning For Anomaly Segmentation
by: Lee, Chang Won, et al.
Published: (2024)
by: Lee, Chang Won, et al.
Published: (2024)
Context-Based Visual-Language Place Recognition
by: Woo, Soojin, et al.
Published: (2024)
by: Woo, Soojin, et al.
Published: (2024)
Joint-Embedding Predictive Architecture for Self-Supervised Learning of Mask Classification Architecture
by: Kim, Dong-Hee, et al.
Published: (2024)
by: Kim, Dong-Hee, et al.
Published: (2024)
CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object Detection
by: Kim, Jisong, et al.
Published: (2024)
by: Kim, Jisong, et al.
Published: (2024)
DIAL: Dense Image-text ALignment for Weakly Supervised Semantic Segmentation
by: Jang, Soojin, et al.
Published: (2024)
by: Jang, Soojin, et al.
Published: (2024)
Mask2Map: Vectorized HD Map Construction Using Bird's Eye View Segmentation Masks
by: Choi, Sehwan, et al.
Published: (2024)
by: Choi, Sehwan, et al.
Published: (2024)
Improving Calibration in Test-Time Prompt Tuning for Vision-Language Models via Data-Free Flatness-Aware Prompt Pretraining
by: Jang, Hyeonseo, et al.
Published: (2026)
by: Jang, Hyeonseo, et al.
Published: (2026)
VideoMaMa: Mask-Guided Video Matting via Generative Prior
by: Lim, Sangbeom, et al.
Published: (2026)
by: Lim, Sangbeom, et al.
Published: (2026)
Learning Multi-resolution Graph Edge Embedding for Discovering Brain Network Dysfunction in Neurological Disorders
by: Ma, Xin, et al.
Published: (2019)
by: Ma, Xin, et al.
Published: (2019)
Masked Spatial Propagation Network for Sparsity-Adaptive Depth Refinement
by: Jun, Jinyoung, et al.
Published: (2024)
by: Jun, Jinyoung, et al.
Published: (2024)
ECLIPSE: Efficient Continual Learning in Panoptic Segmentation with Visual Prompt Tuning
by: Kim, Beomyoung, et al.
Published: (2024)
by: Kim, Beomyoung, et al.
Published: (2024)
Unlocking the Capabilities of Masked Generative Models for Image Synthesis via Self-Guidance
by: Hur, Jiwan, et al.
Published: (2024)
by: Hur, Jiwan, et al.
Published: (2024)
SAMRefiner: Taming Segment Anything Model for Universal Mask Refinement
by: Lin, Yuqi, et al.
Published: (2025)
by: Lin, Yuqi, et al.
Published: (2025)
GIC: Gaussian-Informed Continuum for Physical Property Identification and Simulation
by: Cai, Junhao, et al.
Published: (2024)
by: Cai, Junhao, et al.
Published: (2024)
Segment Anyword: Mask Prompt Inversion for Open-Set Grounded Segmentation
by: Liu, Zhihua, et al.
Published: (2025)
by: Liu, Zhihua, et al.
Published: (2025)
GIC-DLC: Differentiable Logic Circuits for Hardware-Friendly Grayscale Image Compression
by: Aczel, Till, et al.
Published: (2026)
by: Aczel, Till, et al.
Published: (2026)
ProGIC: Progressive and Lightweight Generative Image Compression with Residual Vector Quantization
by: Cao, Hao, et al.
Published: (2026)
by: Cao, Hao, et al.
Published: (2026)
RCM-Fusion: Radar-Camera Multi-Level Fusion for 3D Object Detection
by: Kim, Jisong, et al.
Published: (2023)
by: Kim, Jisong, et al.
Published: (2023)
MR-Occ: Efficient Camera-LiDAR 3D Semantic Occupancy Prediction Using Hierarchical Multi-Resolution Voxel Representation
by: Seong, Minjae, et al.
Published: (2024)
by: Seong, Minjae, et al.
Published: (2024)
Global Prompt Refinement with Non-Interfering Attention Masking for One-Shot Federated Learning
by: Qi, Zhuang, et al.
Published: (2025)
by: Qi, Zhuang, et al.
Published: (2025)
Evaluating Visual Explanations of Attention Maps for Transformer-based Medical Imaging
by: Chung, Minjae, et al.
Published: (2025)
by: Chung, Minjae, et al.
Published: (2025)
Debiased Fine-Tuning for Vision-language Models by Prompt Regularization
by: Zhu, Beier, et al.
Published: (2023)
by: Zhu, Beier, et al.
Published: (2023)
Prompt-Guided Mask Proposal for Two-Stage Open-Vocabulary Segmentation
by: Li, Yu-Jhe, et al.
Published: (2024)
by: Li, Yu-Jhe, et al.
Published: (2024)
MaDis-Stereo: Enhanced Stereo Matching via Distilled Masked Image Modeling
by: Ahn, Jihye, et al.
Published: (2024)
by: Ahn, Jihye, et al.
Published: (2024)
O-MaMa: Learning Object Mask Matching between Egocentric and Exocentric Views
by: Mur-Labadia, Lorenzo, et al.
Published: (2025)
by: Mur-Labadia, Lorenzo, et al.
Published: (2025)
MARS: Mask Attention Refinement with Sequential Quadtree Nodes for Car Damage Instance Segmentation
by: Panboonyuen, Teerapong, et al.
Published: (2023)
by: Panboonyuen, Teerapong, et al.
Published: (2023)
Generalize Polyp Segmentation via Inpainting across Diverse Backgrounds and Pseudo-Mask Refinement
by: Ma, Jiajian, et al.
Published: (2024)
by: Ma, Jiajian, et al.
Published: (2024)
A Cross-Scale Decoder with Token Refinement for Off-Road Semantic Segmentation
by: An, Seongkyu Choi Jhonghyun
Published: (2026)
by: An, Seongkyu Choi Jhonghyun
Published: (2026)
Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation
by: Kim, Chanyoung, et al.
Published: (2024)
by: Kim, Chanyoung, et al.
Published: (2024)
Iterative Prompt Refinement for Safer Text-to-Image Generation
by: Jeon, Jinwoo, et al.
Published: (2025)
by: Jeon, Jinwoo, et al.
Published: (2025)
MaGGIe: Masked Guided Gradual Human Instance Matting
by: Huynh, Chuong, et al.
Published: (2024)
by: Huynh, Chuong, et al.
Published: (2024)
DepthFlow: Exploiting Depth-Flow Structural Correlations for Unsupervised Video Object Segmentation
by: Cho, Suhwan, et al.
Published: (2025)
by: Cho, Suhwan, et al.
Published: (2025)
Deforming Videos to Masks: Flow Matching for Referring Video Segmentation
by: Wang, Zanyi, et al.
Published: (2025)
by: Wang, Zanyi, et al.
Published: (2025)
NeuralVDB: High-resolution Sparse Volume Representation using Hierarchical Neural Networks
by: Kim, Doyub, et al.
Published: (2022)
by: Kim, Doyub, et al.
Published: (2022)
CNG-SFDA:Clean-and-Noisy Region Guided Online-Offline Source-Free Domain Adaptation
by: Cho, Hyeonwoo, et al.
Published: (2024)
by: Cho, Hyeonwoo, et al.
Published: (2024)
Effective Rank Analysis and Regularization for Enhanced 3D Gaussian Splatting
by: Hyung, Junha, et al.
Published: (2024)
by: Hyung, Junha, et al.
Published: (2024)
SAMIC: Segment Anything with In-Context Spatial Prompt Engineering
by: Nagendra, Savinay, et al.
Published: (2024)
by: Nagendra, Savinay, et al.
Published: (2024)
JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts
by: Son, Taein, et al.
Published: (2024)
by: Son, Taein, et al.
Published: (2024)
Similar Items
-
GenFlow: Generalizable Recurrent Flow for 6D Pose Refinement of Novel Objects
by: Moon, Sungphill, et al.
Published: (2024) -
MoGIC: Boosting Motion Generation via Intention Understanding and Visual Context
by: Shi, Junyu, et al.
Published: (2025) -
FlowCLAS: Enhancing Normalizing Flow Via Contrastive Learning For Anomaly Segmentation
by: Lee, Chang Won, et al.
Published: (2024) -
Context-Based Visual-Language Place Recognition
by: Woo, Soojin, et al.
Published: (2024) -
Joint-Embedding Predictive Architecture for Self-Supervised Learning of Mask Classification Architecture
by: Kim, Dong-Hee, et al.
Published: (2024)