Saved in:
| Main Authors: | Ye, Zihao, Cho, Jaehoon, Oh, Changjae |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.00258 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Prototype Unit for Image De-raining using Time-Lapse Data
by: Cho, Jaehoon, et al.
Published: (2024)
by: Cho, Jaehoon, et al.
Published: (2024)
The Detector Teaches Itself: Lightweight Self-Supervised Adaptation for Open-Vocabulary Object Detection
by: Wan, Yazhe, et al.
Published: (2026)
by: Wan, Yazhe, et al.
Published: (2026)
Chain-of-Caption: Training-free improvement of multimodal large language model on referring expression comprehension
by: Pang, Yik Lung, et al.
Published: (2026)
by: Pang, Yik Lung, et al.
Published: (2026)
Improving Generalization of Language-Conditioned Robot Manipulation
by: Cui, Chenglin, et al.
Published: (2025)
by: Cui, Chenglin, et al.
Published: (2025)
FlowOVD: Learning Generative Latent Flows for Zero-shot Open-vocabulary Detection
by: Wei, Yao, et al.
Published: (2026)
by: Wei, Yao, et al.
Published: (2026)
Diffusion-driven GAN Inversion for Multi-Modal Face Image Generation
by: Kim, Jihyun, et al.
Published: (2024)
by: Kim, Jihyun, et al.
Published: (2024)
Sparse multi-view hand-object reconstruction for unseen environments
by: Pang, Yik Lung, et al.
Published: (2024)
by: Pang, Yik Lung, et al.
Published: (2024)
Learning by Erasing: Conditional Entropy based Transferable Out-Of-Distribution Detection
by: Xing, Meng, et al.
Published: (2022)
by: Xing, Meng, et al.
Published: (2022)
Elevating Flow-Guided Video Inpainting with Reference Generation
by: Cho, Suhwan, et al.
Published: (2024)
by: Cho, Suhwan, et al.
Published: (2024)
Adaptive Multi-Modal Control of Digital Human Hand Synthesis Using a Region-Aware Cycle Loss
by: Fu, Qifan, et al.
Published: (2024)
by: Fu, Qifan, et al.
Published: (2024)
HanDrawer: Leveraging Spatial Information to Render Realistic Hands Using a Conditional Diffusion Model in Single Stage
by: Fu, Qifan, et al.
Published: (2025)
by: Fu, Qifan, et al.
Published: (2025)
Open-vocabulary object 6D pose estimation
by: Corsetti, Jaime, et al.
Published: (2023)
by: Corsetti, Jaime, et al.
Published: (2023)
Stereo Hand-Object Reconstruction for Human-to-Robot Handover
by: Pang, Yik Lung, et al.
Published: (2024)
by: Pang, Yik Lung, et al.
Published: (2024)
Learning human-to-robot handovers through 3D scene reconstruction
by: Wu, Yuekun, et al.
Published: (2025)
by: Wu, Yuekun, et al.
Published: (2025)
TransRef: Multi-Scale Reference Embedding Transformer for Reference-Guided Image Inpainting
by: Liu, Taorong, et al.
Published: (2023)
by: Liu, Taorong, et al.
Published: (2023)
Toward Human-Robot Teaming: Learning Handover Behaviors from 3D Scenes
by: Wu, Yuekun, et al.
Published: (2025)
by: Wu, Yuekun, et al.
Published: (2025)
High-resolution open-vocabulary object 6D pose estimation
by: Corsetti, Jaime, et al.
Published: (2024)
by: Corsetti, Jaime, et al.
Published: (2024)
Improved Masked Image Generation with Knowledge-Augmented Token Representations
by: Liang, Guotao, et al.
Published: (2025)
by: Liang, Guotao, et al.
Published: (2025)
Brain-Streams: fMRI-to-Image Reconstruction with Multi-modal Guidance
by: Joo, Jaehoon, et al.
Published: (2024)
by: Joo, Jaehoon, et al.
Published: (2024)
Transforming Static Images Using Generative Models for Video Salient Object Detection
by: Cho, Suhwan, et al.
Published: (2024)
by: Cho, Suhwan, et al.
Published: (2024)
Contrastive Local Manifold Learning for No-Reference Image Quality Assessment
by: Huang, Zihao, et al.
Published: (2024)
by: Huang, Zihao, et al.
Published: (2024)
Cross-aware Early Fusion with Stage-divided Vision and Language Transformer Encoders for Referring Image Segmentation
by: Cho, Yubin, et al.
Published: (2024)
by: Cho, Yubin, et al.
Published: (2024)
Griffin: Generative Reference and Layout Guided Image Composition
by: Mikaeili, Aryan, et al.
Published: (2025)
by: Mikaeili, Aryan, et al.
Published: (2025)
Towards Better De-raining Generalization via Rainy Characteristics Memorization and Replay
by: Wang, Kunyu, et al.
Published: (2025)
by: Wang, Kunyu, et al.
Published: (2025)
ConTEXTure: Consistent Multiview Images to Texture
by: Ahn, Jaehoon, et al.
Published: (2024)
by: Ahn, Jaehoon, et al.
Published: (2024)
AeroReformer: Aerial Referring Transformer for UAV-based Referring Image Segmentation
by: Li, Rui, et al.
Published: (2025)
by: Li, Rui, et al.
Published: (2025)
ContactGen: Contact-Guided Interactive 3D Human Generation for Partners
by: Gu, Dongjun, et al.
Published: (2024)
by: Gu, Dongjun, et al.
Published: (2024)
ImageRAG: Dynamic Image Retrieval for Reference-Guided Image Generation
by: Shalev-Arkushin, Rotem, et al.
Published: (2025)
by: Shalev-Arkushin, Rotem, et al.
Published: (2025)
Preference-Guided Debiasing for No-Reference Enhancement Image Quality Assessment
by: Gao, Shiqi, et al.
Published: (2026)
by: Gao, Shiqi, et al.
Published: (2026)
You Only Pose Once: A Minimalist's Detection Transformer for Monocular RGB Category-level 9D Multi-Object Pose Estimation
by: Lee, Hakjin, et al.
Published: (2025)
by: Lee, Hakjin, et al.
Published: (2025)
Key-point Guided Deformable Image Manipulation Using Diffusion Model
by: Oh, Seok-Hwan, et al.
Published: (2024)
by: Oh, Seok-Hwan, et al.
Published: (2024)
Learning Object-Centric Representations in SAR Images with Multi-Level Feature Fusion
by: Jang, Oh-Tae, et al.
Published: (2025)
by: Jang, Oh-Tae, et al.
Published: (2025)
EviRCOD: Evidence-Guided Probabilistic Decoding for Referring Camouflaged Object Detection
by: Wang, Ye, et al.
Published: (2026)
by: Wang, Ye, et al.
Published: (2026)
Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator
by: Choi, Wonhyeok, et al.
Published: (2024)
by: Choi, Wonhyeok, et al.
Published: (2024)
Durian: Dual Reference Image-Guided Portrait Animation with Attribute Transfer
by: Cha, Hyunsoo, et al.
Published: (2025)
by: Cha, Hyunsoo, et al.
Published: (2025)
Progressive Prompt-Guided Cross-Modal Reasoning for Referring Image Segmentation
by: Li, Jiachen, et al.
Published: (2026)
by: Li, Jiachen, et al.
Published: (2026)
GSFixer: Improving 3D Gaussian Splatting with Reference-Guided Video Diffusion Priors
by: Yin, Xingyilang, et al.
Published: (2025)
by: Yin, Xingyilang, et al.
Published: (2025)
RefMask3D: Language-Guided Transformer for 3D Referring Segmentation
by: He, Shuting, et al.
Published: (2024)
by: He, Shuting, et al.
Published: (2024)
MangaDiT: Reference-Guided Line Art Colorization with Hierarchical Attention in Diffusion Transformers
by: Qiu, Qianru, et al.
Published: (2025)
by: Qiu, Qianru, et al.
Published: (2025)
Reference-Guided Identity Preserving Face Restoration
by: Zhou, Mo, et al.
Published: (2025)
by: Zhou, Mo, et al.
Published: (2025)
Similar Items
-
A Prototype Unit for Image De-raining using Time-Lapse Data
by: Cho, Jaehoon, et al.
Published: (2024) -
The Detector Teaches Itself: Lightweight Self-Supervised Adaptation for Open-Vocabulary Object Detection
by: Wan, Yazhe, et al.
Published: (2026) -
Chain-of-Caption: Training-free improvement of multimodal large language model on referring expression comprehension
by: Pang, Yik Lung, et al.
Published: (2026) -
Improving Generalization of Language-Conditioned Robot Manipulation
by: Cui, Chenglin, et al.
Published: (2025) -
FlowOVD: Learning Generative Latent Flows for Zero-shot Open-vocabulary Detection
by: Wei, Yao, et al.
Published: (2026)