Saved in:
| Main Authors: | Kim, Junho, Kim, Young Min, Zahreddine, Ramzi, Welge, Weston A., Krishnan, Gurunandan, Ma, Sizhuo, Wang, Jian |
|---|---|
| Format: | Preprint |
| Published: |
2022
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2212.03177 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Velocity Disambiguation for Video Frame Interpolation
by: Zhong, Zhihang, et al.
Published: (2023)
by: Zhong, Zhihang, et al.
Published: (2023)
Delving Deep into Engagement Prediction of Short Videos
by: Li, Dasong, et al.
Published: (2024)
by: Li, Dasong, et al.
Published: (2024)
DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer
by: Chen, Wei-Ting, et al.
Published: (2024)
by: Chen, Wei-Ting, et al.
Published: (2024)
Fully Geometric Panoramic Localization
by: Kim, Junho, et al.
Published: (2024)
by: Kim, Junho, et al.
Published: (2024)
Calibrating Panoramic Depth Estimation for Practical Localization and Mapping
by: Kim, Junho, et al.
Published: (2023)
by: Kim, Junho, et al.
Published: (2023)
Finding 3D Scene Analogies with Multimodal Foundation Models
by: Kim, Junho, et al.
Published: (2025)
by: Kim, Junho, et al.
Published: (2025)
PICCOLO: Point Cloud-Centric Omnidirectional Localization
by: Kim, Junho, et al.
Published: (2021)
by: Kim, Junho, et al.
Published: (2021)
CPO: Change Robust Panorama to Point Cloud Localization
by: Kim, Junho, et al.
Published: (2022)
by: Kim, Junho, et al.
Published: (2022)
RoEL: Robust Event-based 3D Line Reconstruction
by: Bae, Gwangtak, et al.
Published: (2026)
by: Bae, Gwangtak, et al.
Published: (2026)
LTGS: Long-Term Gaussian Scene Chronology From Sparse View Updates
by: Kim, Minkwan, et al.
Published: (2025)
by: Kim, Minkwan, et al.
Published: (2025)
T2M-X: Learning Expressive Text-to-Motion Generation from Partially Annotated Data
by: Liu, Mingdian, et al.
Published: (2024)
by: Liu, Mingdian, et al.
Published: (2024)
Event-based Facial Keypoint Alignment via Cross-Modal Fusion Attention and Self-Supervised Multi-Event Representation Learning
by: Kang, Donghwa, et al.
Published: (2025)
by: Kang, Donghwa, et al.
Published: (2025)
Depth-Guided Privacy-Preserving Visual Localization Using 3D Sphere Clouds
by: Moon, Heejoon, et al.
Published: (2026)
by: Moon, Heejoon, et al.
Published: (2026)
Revisiting Geometric Obfuscation with Dual Convergent Lines for Privacy-Preserving Image Queries in Visual Localization
by: Kim, Jeonggon, et al.
Published: (2026)
by: Kim, Jeonggon, et al.
Published: (2026)
Learning 3D Scene Analogies with Neural Contextual Scene Maps
by: Kim, Junho, et al.
Published: (2025)
by: Kim, Junho, et al.
Published: (2025)
Analogical Trajectory Transfer
by: Kim, Junho, et al.
Published: (2026)
by: Kim, Junho, et al.
Published: (2026)
VG3T: Visual Geometry Grounded Gaussian Transformer
by: Kim, Junho, et al.
Published: (2025)
by: Kim, Junho, et al.
Published: (2025)
Gaussian Splatting Feature Fields for Privacy-Preserving Visual Localization
by: Pietrantoni, Maxime, et al.
Published: (2025)
by: Pietrantoni, Maxime, et al.
Published: (2025)
Bounded-Compute Multimodal Regression for Product-Rating Prediction
by: Leach, William, et al.
Published: (2026)
by: Leach, William, et al.
Published: (2026)
OnlineBEV: Recurrent Temporal Fusion in Bird's Eye View Representations for Multi-Camera 3D Perception
by: Koh, Junho, et al.
Published: (2025)
by: Koh, Junho, et al.
Published: (2025)
gQIR: Generative Quanta Image Reconstruction
by: Garg, Aryan, et al.
Published: (2026)
by: Garg, Aryan, et al.
Published: (2026)
Temporal-Mapping Photography for Event Cameras
by: Bao, Yuhan, et al.
Published: (2024)
by: Bao, Yuhan, et al.
Published: (2024)
Visual Style Prompting with Swapping Self-Attention
by: Jeong, Jaeseok, et al.
Published: (2024)
by: Jeong, Jaeseok, et al.
Published: (2024)
RCTDistill: Cross-Modal Knowledge Distillation Framework for Radar-Camera 3D Object Detection with Temporal Fusion
by: Bang, Geonho, et al.
Published: (2025)
by: Bang, Geonho, et al.
Published: (2025)
Visual Grounding from Event Cameras
by: Kong, Lingdong, et al.
Published: (2025)
by: Kong, Lingdong, et al.
Published: (2025)
InstantRestore: Single-Step Personalized Face Restoration with Shared-Image Attention
by: Zhang, Howard, et al.
Published: (2024)
by: Zhang, Howard, et al.
Published: (2024)
StyleKeeper: Prevent Content Leakage using Negative Visual Query Guidance
by: Jeong, Jaeseok, et al.
Published: (2025)
by: Jeong, Jaeseok, et al.
Published: (2025)
Schrödinger's Camera: First Steps Towards a Quantum-Based Privacy Preserving Camera
by: Kirkland, Hannah, et al.
Published: (2023)
by: Kirkland, Hannah, et al.
Published: (2023)
CAPA: Contribution-Aware Pruning and FFN Approximation for Efficient Large Vision-Language Models
by: Jha, Samyak, et al.
Published: (2026)
by: Jha, Samyak, et al.
Published: (2026)
UniSpector: Towards Universal Open-set Defect Recognition via Spectral-Contrastive Visual Prompting
by: Kim, Geonuk, et al.
Published: (2026)
by: Kim, Geonuk, et al.
Published: (2026)
Flow-Based Visual Stream Compression for Event Cameras
by: Stumpp, Daniel C., et al.
Published: (2024)
by: Stumpp, Daniel C., et al.
Published: (2024)
Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation
by: Kwak, Min-Seop, et al.
Published: (2025)
by: Kwak, Min-Seop, et al.
Published: (2025)
Is There a Better Source Distribution than Gaussian? Exploring Source Distributions for Image Flow Matching
by: Lee, Junho, et al.
Published: (2025)
by: Lee, Junho, et al.
Published: (2025)
Geometry-Aware Image Flow Matching
by: Lee, Junho, et al.
Published: (2026)
by: Lee, Junho, et al.
Published: (2026)
Infinite-Homography as Robust Conditioning for Camera-Controlled Video Generation
by: Kim, Min-Jung, et al.
Published: (2025)
by: Kim, Min-Jung, et al.
Published: (2025)
CapeLLM: Support-Free Category-Agnostic Pose Estimation with Multimodal Large Language Models
by: Kim, Junho, et al.
Published: (2024)
by: Kim, Junho, et al.
Published: (2024)
DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes
by: Park, Sungjune, et al.
Published: (2025)
by: Park, Sungjune, et al.
Published: (2025)
Real-Time Privacy Preservation for Robot Visual Perception
by: Choi, Minkyu, et al.
Published: (2025)
by: Choi, Minkyu, et al.
Published: (2025)
Raw2Event: Converting Raw Frame Camera into Event Camera
by: Ning, Zijie, et al.
Published: (2025)
by: Ning, Zijie, et al.
Published: (2025)
VG-CoT: Towards Trustworthy Visual Reasoning via Grounded Chain-of-Thought
by: Lim, Byeonggeuk, et al.
Published: (2026)
by: Lim, Byeonggeuk, et al.
Published: (2026)
Similar Items
-
Velocity Disambiguation for Video Frame Interpolation
by: Zhong, Zhihang, et al.
Published: (2023) -
Delving Deep into Engagement Prediction of Short Videos
by: Li, Dasong, et al.
Published: (2024) -
DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer
by: Chen, Wei-Ting, et al.
Published: (2024) -
Fully Geometric Panoramic Localization
by: Kim, Junho, et al.
Published: (2024) -
Calibrating Panoramic Depth Estimation for Practical Localization and Mapping
by: Kim, Junho, et al.
Published: (2023)