:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mariotti, Octave, Du, Zhipeng, Bhalgat, Yash, Mac Aodha, Oisin, Bilen, Hakan
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2506.08220
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Improving Semantic Correspondence with Viewpoint-Guided Spherical Maps
by: Mariotti, Octave, et al.
Published: (2023)

Spatially-Adaptive Hash Encodings For Neural Surface Reconstruction
by: Walker, Thomas, et al.
Published: (2024)

DepthCues: Evaluating Monocular Depth Perception in Large Vision Models
by: Danier, Duolikun, et al.
Published: (2024)

View-Consistent Diffusion Representations for 3D-Consistent Video Generation
by: Danier, Duolikun, et al.
Published: (2025)

GeoGen: Geometry-Aware Generative Modeling via Signed Distance Functions
by: Esposito, Salvatore, et al.
Published: (2024)

Interpretable Text-Guided Image Clustering via Iterative Search
by: Zhao, Bingchen, et al.
Published: (2025)

SAOR: Single-View Articulated Object Reconstruction
by: Aygün, Mehmet, et al.
Published: (2023)

Less is More: Discovering Concise Network Explanations
by: Kondapaneni, Neehar, et al.
Published: (2024)

Labeled Data Selection for Category Discovery
by: Zhao, Bingchen, et al.
Published: (2024)

MoonSeg3R: Monocular Online Zero-Shot Segment Anything in 3D with Reconstructive Foundation Priors
by: Du, Zhipeng, et al.
Published: (2025)

Representational Similarity via Interpretable Visual Concepts
by: Kondapaneni, Neehar, et al.
Published: (2025)

Generating Binary Species Range Maps
by: Dorm, Filip, et al.
Published: (2024)

HumMorph: Generalized Dynamic Human Neural Fields from Few Views
by: Zadrożny, Jakub, et al.
Published: (2025)

Representational Difference Explanations
by: Kondapaneni, Neehar, et al.
Published: (2025)

Reproducibility Study of CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification
by: Shah, Manan, et al.
Published: (2024)

Enhancing 2D Representation Learning with a 3D Prior
by: Aygün, Mehmet, et al.
Published: (2024)

MotionPhysics: Learnable Motion Distillation for Text-Guided Simulation
by: Wang, Miaowei, et al.
Published: (2026)

CleverBirds: A Multiple-Choice Benchmark for Fine-grained Human Knowledge Tracing
by: Bossemeyer, Leonie, et al.
Published: (2025)

VesselSDF: Distance Field Priors for Vascular Network Reconstruction
by: Esposito, Salvatore, et al.
Published: (2025)

Universal representations:The missing link between faces, text, planktons, and cat breeds
by: Bilen, Hakan, et al.
Published: (2017)

Active View Selector: Fast and Accurate Active View Selection with Cross Reference Image Quality Assessment
by: Wang, Zirui, et al.
Published: (2025)

Sample-efficient Integration of New Modalities into Large Language Models
by: İnce, Osman Batur, et al.
Published: (2025)

Click to Grasp: Zero-Shot Precise Manipulation via Visual Diffusion Descriptors
by: Tsagkas, Nikolaos, et al.
Published: (2024)

CrossSDF: 3D Reconstruction of Thin Structures From Cross-Sections
by: Walker, Thomas, et al.
Published: (2024)

PAOLI: Pose-free Articulated Object Learning from Sparse-view Images
by: Deng, Jianning, et al.
Published: (2025)

Visually Interpretable Subtask Reasoning for Visual Question Answering
by: Cheng, Yu, et al.
Published: (2025)

Odd-One-Out: Anomaly Detection by Comparing with Neighbors
by: Bhunia, Ankan, et al.
Published: (2024)

Articulate your NeRF: Unsupervised articulated object modeling via conditional view synthesis
by: Deng, Jianning, et al.
Published: (2024)

Looking 3D: Anomaly Detection with 2D-3D Alignment
by: Bhunia, Ankan, et al.
Published: (2024)

BiMotion: B-spline Motion for Text-guided Dynamic 3D Character Generation
by: Wang, Miaowei, et al.
Published: (2026)

WildSAT: Learning Satellite Image Representations from Wildlife Observations
by: Daroya, Rangel, et al.
Published: (2024)

The Temporal Trap: Entanglement in Pre-Trained Visual Representations for Visuomotor Policy Learning
by: Tsagkas, Nikolaos, et al.
Published: (2025)

AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings
by: Watson, Jamie, et al.
Published: (2024)

Attentive Feature Aggregation or: How Policies Learn to Stop Worrying about Robustness and Attend to Task-Relevant Visual Cues
by: Tsagkas, Nikolaos, et al.
Published: (2025)

Vision Learners Meet Web Image-Text Pairs
by: Zhao, Bingchen, et al.
Published: (2023)

Coarse or Fine? Recognising Action End States without Labels
by: Moltisanti, Davide, et al.
Published: (2024)

MVSAnywhere: Zero-Shot Multi-View Stereo
by: Izquierdo, Sergio, et al.
Published: (2025)

Reflecting Reality: Enabling Diffusion Models to Produce Faithful Mirror Reflections
by: Dhiman, Ankit, et al.
Published: (2024)

Neural Refinement for Absolute Pose Regression with Feature Synthesis
by: Chen, Shuai, et al.
Published: (2023)

SiLVR: Scalable Lidar-Visual Reconstruction with Neural Radiance Fields for Robotic Inspection
by: Tao, Yifu, et al.
Published: (2024)