Saved in:
| Main Authors: | Mariotti, Octave, Du, Zhipeng, Bhalgat, Yash, Mac Aodha, Oisin, Bilen, Hakan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.08220 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Improving Semantic Correspondence with Viewpoint-Guided Spherical Maps
by: Mariotti, Octave, et al.
Published: (2023)
by: Mariotti, Octave, et al.
Published: (2023)
Spatially-Adaptive Hash Encodings For Neural Surface Reconstruction
by: Walker, Thomas, et al.
Published: (2024)
by: Walker, Thomas, et al.
Published: (2024)
DepthCues: Evaluating Monocular Depth Perception in Large Vision Models
by: Danier, Duolikun, et al.
Published: (2024)
by: Danier, Duolikun, et al.
Published: (2024)
View-Consistent Diffusion Representations for 3D-Consistent Video Generation
by: Danier, Duolikun, et al.
Published: (2025)
by: Danier, Duolikun, et al.
Published: (2025)
GeoGen: Geometry-Aware Generative Modeling via Signed Distance Functions
by: Esposito, Salvatore, et al.
Published: (2024)
by: Esposito, Salvatore, et al.
Published: (2024)
Interpretable Text-Guided Image Clustering via Iterative Search
by: Zhao, Bingchen, et al.
Published: (2025)
by: Zhao, Bingchen, et al.
Published: (2025)
SAOR: Single-View Articulated Object Reconstruction
by: Aygün, Mehmet, et al.
Published: (2023)
by: Aygün, Mehmet, et al.
Published: (2023)
Less is More: Discovering Concise Network Explanations
by: Kondapaneni, Neehar, et al.
Published: (2024)
by: Kondapaneni, Neehar, et al.
Published: (2024)
Labeled Data Selection for Category Discovery
by: Zhao, Bingchen, et al.
Published: (2024)
by: Zhao, Bingchen, et al.
Published: (2024)
MoonSeg3R: Monocular Online Zero-Shot Segment Anything in 3D with Reconstructive Foundation Priors
by: Du, Zhipeng, et al.
Published: (2025)
by: Du, Zhipeng, et al.
Published: (2025)
Representational Similarity via Interpretable Visual Concepts
by: Kondapaneni, Neehar, et al.
Published: (2025)
by: Kondapaneni, Neehar, et al.
Published: (2025)
Generating Binary Species Range Maps
by: Dorm, Filip, et al.
Published: (2024)
by: Dorm, Filip, et al.
Published: (2024)
HumMorph: Generalized Dynamic Human Neural Fields from Few Views
by: Zadrożny, Jakub, et al.
Published: (2025)
by: Zadrożny, Jakub, et al.
Published: (2025)
Representational Difference Explanations
by: Kondapaneni, Neehar, et al.
Published: (2025)
by: Kondapaneni, Neehar, et al.
Published: (2025)
Reproducibility Study of CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification
by: Shah, Manan, et al.
Published: (2024)
by: Shah, Manan, et al.
Published: (2024)
Enhancing 2D Representation Learning with a 3D Prior
by: Aygün, Mehmet, et al.
Published: (2024)
by: Aygün, Mehmet, et al.
Published: (2024)
MotionPhysics: Learnable Motion Distillation for Text-Guided Simulation
by: Wang, Miaowei, et al.
Published: (2026)
by: Wang, Miaowei, et al.
Published: (2026)
CleverBirds: A Multiple-Choice Benchmark for Fine-grained Human Knowledge Tracing
by: Bossemeyer, Leonie, et al.
Published: (2025)
by: Bossemeyer, Leonie, et al.
Published: (2025)
VesselSDF: Distance Field Priors for Vascular Network Reconstruction
by: Esposito, Salvatore, et al.
Published: (2025)
by: Esposito, Salvatore, et al.
Published: (2025)
Universal representations:The missing link between faces, text, planktons, and cat breeds
by: Bilen, Hakan, et al.
Published: (2017)
by: Bilen, Hakan, et al.
Published: (2017)
Active View Selector: Fast and Accurate Active View Selection with Cross Reference Image Quality Assessment
by: Wang, Zirui, et al.
Published: (2025)
by: Wang, Zirui, et al.
Published: (2025)
Sample-efficient Integration of New Modalities into Large Language Models
by: İnce, Osman Batur, et al.
Published: (2025)
by: İnce, Osman Batur, et al.
Published: (2025)
Click to Grasp: Zero-Shot Precise Manipulation via Visual Diffusion Descriptors
by: Tsagkas, Nikolaos, et al.
Published: (2024)
by: Tsagkas, Nikolaos, et al.
Published: (2024)
CrossSDF: 3D Reconstruction of Thin Structures From Cross-Sections
by: Walker, Thomas, et al.
Published: (2024)
by: Walker, Thomas, et al.
Published: (2024)
PAOLI: Pose-free Articulated Object Learning from Sparse-view Images
by: Deng, Jianning, et al.
Published: (2025)
by: Deng, Jianning, et al.
Published: (2025)
Visually Interpretable Subtask Reasoning for Visual Question Answering
by: Cheng, Yu, et al.
Published: (2025)
by: Cheng, Yu, et al.
Published: (2025)
Odd-One-Out: Anomaly Detection by Comparing with Neighbors
by: Bhunia, Ankan, et al.
Published: (2024)
by: Bhunia, Ankan, et al.
Published: (2024)
Articulate your NeRF: Unsupervised articulated object modeling via conditional view synthesis
by: Deng, Jianning, et al.
Published: (2024)
by: Deng, Jianning, et al.
Published: (2024)
Looking 3D: Anomaly Detection with 2D-3D Alignment
by: Bhunia, Ankan, et al.
Published: (2024)
by: Bhunia, Ankan, et al.
Published: (2024)
BiMotion: B-spline Motion for Text-guided Dynamic 3D Character Generation
by: Wang, Miaowei, et al.
Published: (2026)
by: Wang, Miaowei, et al.
Published: (2026)
WildSAT: Learning Satellite Image Representations from Wildlife Observations
by: Daroya, Rangel, et al.
Published: (2024)
by: Daroya, Rangel, et al.
Published: (2024)
The Temporal Trap: Entanglement in Pre-Trained Visual Representations for Visuomotor Policy Learning
by: Tsagkas, Nikolaos, et al.
Published: (2025)
by: Tsagkas, Nikolaos, et al.
Published: (2025)
AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings
by: Watson, Jamie, et al.
Published: (2024)
by: Watson, Jamie, et al.
Published: (2024)
Attentive Feature Aggregation or: How Policies Learn to Stop Worrying about Robustness and Attend to Task-Relevant Visual Cues
by: Tsagkas, Nikolaos, et al.
Published: (2025)
by: Tsagkas, Nikolaos, et al.
Published: (2025)
Vision Learners Meet Web Image-Text Pairs
by: Zhao, Bingchen, et al.
Published: (2023)
by: Zhao, Bingchen, et al.
Published: (2023)
Coarse or Fine? Recognising Action End States without Labels
by: Moltisanti, Davide, et al.
Published: (2024)
by: Moltisanti, Davide, et al.
Published: (2024)
MVSAnywhere: Zero-Shot Multi-View Stereo
by: Izquierdo, Sergio, et al.
Published: (2025)
by: Izquierdo, Sergio, et al.
Published: (2025)
Reflecting Reality: Enabling Diffusion Models to Produce Faithful Mirror Reflections
by: Dhiman, Ankit, et al.
Published: (2024)
by: Dhiman, Ankit, et al.
Published: (2024)
Neural Refinement for Absolute Pose Regression with Feature Synthesis
by: Chen, Shuai, et al.
Published: (2023)
by: Chen, Shuai, et al.
Published: (2023)
SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection
by: Tao, Yifu, et al.
Published: (2024)
by: Tao, Yifu, et al.
Published: (2024)
Similar Items
-
Improving Semantic Correspondence with Viewpoint-Guided Spherical Maps
by: Mariotti, Octave, et al.
Published: (2023) -
Spatially-Adaptive Hash Encodings For Neural Surface Reconstruction
by: Walker, Thomas, et al.
Published: (2024) -
DepthCues: Evaluating Monocular Depth Perception in Large Vision Models
by: Danier, Duolikun, et al.
Published: (2024) -
View-Consistent Diffusion Representations for 3D-Consistent Video Generation
by: Danier, Duolikun, et al.
Published: (2025) -
GeoGen: Geometry-Aware Generative Modeling via Signed Distance Functions
by: Esposito, Salvatore, et al.
Published: (2024)