:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Cao, Ang, Arnaud, Sergio, Maksymets, Oleksandr, Yang, Jianing, Jain, Ayush, Yenamandra, Sriram, Martin, Ada, Berges, Vincent-Pierre, McVay, Paul, Partsey, Ruslan, Rajeswaran, Aravind, Meier, Franziska, Johnson, Justin, Park, Jeong Joon, Sax, Alexander
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2502.20389
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Locate 3D: Real-World Object Localization via Self-Supervised Learning in 3D
by: Arnaud, Sergio, et al.
Published: (2025)

What do we learn from a large-scale study of pre-trained visual representations in sim and real environments?
by: Silwal, Sneha, et al.
Published: (2023)

Unifying 2D and 3D Vision-Language Understanding
by: Jain, Ayush, et al.
Published: (2025)

Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?
by: Majumdar, Arjun, et al.
Published: (2023)

An Exploratory Study of Undergraduate Students' Perceptions of Visualization and Visualization Ability in Biochemistry
by: Andrew McVay, et al.
Published: (2026)

On Linear Separability under Linear Compression with Applications to Hard Support Vector Machine
by: McVay, Paul, et al.
Published: (2022)

HomeRobot Open Vocabulary Mobile Manipulation Challenge 2023 Participant Report (Team KuzHum)
by: Kuzma, Volodymyr, et al.
Published: (2024)

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
by: Yang, Jianing, et al.
Published: (2025)

From LLMs to Actions: Latent Codes as Bridges in Hierarchical Robot Control
by: Shentu, Yide, et al.
Published: (2024)

MoDem-V2: Visuo-Motor World Models for Real-World Robot Manipulation
by: Lancaster, Patrick, et al.
Published: (2023)

RetinaGS: Scalable Training for Dense Scene Rendering with Billion-Scale 3D Gaussians
by: Li, Bingling, et al.
Published: (2024)

Exploring potential reach and representativeness of a self‐weighing weight gain prevention intervention in adults with overweight and obesity
by: Kellie B. Scotti, et al.
Published: (2024)

Are VLMs Really Blind
by: Singh, Ayush, et al.
Published: (2024)

Semi-Supervised One-Shot Imitation Learning
by: Wu, Philipp, et al.
Published: (2024)

Exploring the Acceptability of the STOP Method for Addressing Weight Loss Misinformation on Social Media: An Interview Study
by: Danielle E. Jake‐Schoffman, et al.
Published: (2025)

An Image Is Worth Ten Thousand Words: Verbose-Text Induction Attacks on VLMs
by: Luo, Zhi, et al.
Published: (2025)

On Place, Well-Being, and Illness in the Andes
by: Marieka Sax
Published: (2015)

The Boy Problem: Many Boys Think School Is Stupid and Reading Stinks--Is There a Remedy?
by: Sax, Leonard
Published: (2007)

Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering
by: Chen, Jieying, et al.
Published: (2026)

MoE3D: A Mixture-of-Experts Module for 3D Reconstruction
by: Wang, Zichen, et al.
Published: (2026)

TouchMap-OR: Multi-View 3D Mapping of Hand-Surface Contacts
by: Ktistakis, Sophokles, et al.
Published: (2026)

The Green ring of a family of copointed Hopf algebras
by: Vay, Cristian
Published: (2022)

Linkage principle for small quantum groups
by: Vay, Cristian
Published: (2023)

Polyploidy in Enkianthus (Ericaceae)
by: Sax, Hally Jolivette
Published: (1960)

Bias in the Picture: Benchmarking VLMs with Social-Cue News Images and LLM-as-Judge Assessment
by: Narayanan, Aravind, et al.
Published: (2025)

Agents Play Thousands of 3D Video Games
by: Xu, Zhongwen, et al.
Published: (2025)

Probing Visual Language Priors in VLMs
by: Luo, Tiange, et al.
Published: (2024)

Chitrarth: Bridging Vision and Language for a Billion People
by: Khan, Shaharukh, et al.
Published: (2025)

Importance of Developing Emotional Intelligence in Preventing Addiction Syndrome
by: Viktoriia Mendelo, et al.
Published: (2024)

Stimpack: An Adaptive Rendering Optimization System for Scalable Cloud Gaming
by: Heo, Jin, et al.
Published: (2024)

The Prediction of Training Proficiency in Firefighters: A Study of Predictive Validity in Spain
by: Alfredo Berges
Published: (2018)

Justice and Righteousness in the Old Testament
by: Berges, Ulrich
Published: (2025)

Pattern-Based Phase-Separation of Tracer and Dispersed Phase Particles in Two-Phase Defocusing Particle Tracking Velocimetry
by: Sax, Christian, et al.
Published: (2025)

On the Particle Image Overlap in Single Camera Defocusing Approaches
by: Sax, Christian, et al.
Published: (2025)

Off-Diagonal Continuous Rado Numbers $x_1 + x_2 + \dots + x_k = x_0$
by: Vestal, Don, et al.
Published: (2025)

R4: Retrieval-Augmented Reasoning for Vision-Language Models in 4D Spatio-Temporal Space
by: Sohn, Tin Stribor, et al.
Published: (2025)

Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs
by: Li, Haoyuan, et al.
Published: (2025)

GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation
by: Khanna, Mukul, et al.
Published: (2024)

GaussRender: Learning 3D Occupancy with Gaussian Rendering
by: Chambon, Loïck, et al.
Published: (2025)

RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision
by: Pan, Mingjie, et al.
Published: (2023)