Saved in:
| Main Authors: | Cao, Ang, Arnaud, Sergio, Maksymets, Oleksandr, Yang, Jianing, Jain, Ayush, Yenamandra, Sriram, Martin, Ada, Berges, Vincent-Pierre, McVay, Paul, Partsey, Ruslan, Rajeswaran, Aravind, Meier, Franziska, Johnson, Justin, Park, Jeong Joon, Sax, Alexander |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.20389 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Locate 3D: Real-World Object Localization via Self-Supervised Learning in 3D
by: Arnaud, Sergio, et al.
Published: (2025)
by: Arnaud, Sergio, et al.
Published: (2025)
What do we learn from a large-scale study of pre-trained visual representations in sim and real environments?
by: Silwal, Sneha, et al.
Published: (2023)
by: Silwal, Sneha, et al.
Published: (2023)
Unifying 2D and 3D Vision-Language Understanding
by: Jain, Ayush, et al.
Published: (2025)
by: Jain, Ayush, et al.
Published: (2025)
Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?
by: Majumdar, Arjun, et al.
Published: (2023)
by: Majumdar, Arjun, et al.
Published: (2023)
An Exploratory Study of Undergraduate Students' Perceptions of Visualization and Visualization Ability in Biochemistry
by: Andrew McVay, et al.
Published: (2026)
by: Andrew McVay, et al.
Published: (2026)
On Linear Separability under Linear Compression with Applications to Hard Support Vector Machine
by: McVay, Paul, et al.
Published: (2022)
by: McVay, Paul, et al.
Published: (2022)
HomeRobot Open Vocabulary Mobile Manipulation Challenge 2023 Participant Report (Team KuzHum)
by: Kuzma, Volodymyr, et al.
Published: (2024)
by: Kuzma, Volodymyr, et al.
Published: (2024)
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
by: Yang, Jianing, et al.
Published: (2025)
by: Yang, Jianing, et al.
Published: (2025)
From LLMs to Actions: Latent Codes as Bridges in Hierarchical Robot Control
by: Shentu, Yide, et al.
Published: (2024)
by: Shentu, Yide, et al.
Published: (2024)
MoDem-V2: Visuo-Motor World Models for Real-World Robot Manipulation
by: Lancaster, Patrick, et al.
Published: (2023)
by: Lancaster, Patrick, et al.
Published: (2023)
RetinaGS: Scalable Training for Dense Scene Rendering with Billion-Scale 3D Gaussians
by: Li, Bingling, et al.
Published: (2024)
by: Li, Bingling, et al.
Published: (2024)
Exploring potential reach and representativeness of a self‐weighing weight gain prevention intervention in adults with overweight and obesity
by: Kellie B. Scotti, et al.
Published: (2024)
by: Kellie B. Scotti, et al.
Published: (2024)
Are VLMs Really Blind
by: Singh, Ayush, et al.
Published: (2024)
by: Singh, Ayush, et al.
Published: (2024)
Semi-Supervised One-Shot Imitation Learning
by: Wu, Philipp, et al.
Published: (2024)
by: Wu, Philipp, et al.
Published: (2024)
Exploring the Acceptability of the STOP Method for Addressing Weight Loss Misinformation on Social Media: An Interview Study
by: Danielle E. Jake‐Schoffman, et al.
Published: (2025)
by: Danielle E. Jake‐Schoffman, et al.
Published: (2025)
An Image Is Worth Ten Thousand Words: Verbose-Text Induction Attacks on VLMs
by: Luo, Zhi, et al.
Published: (2025)
by: Luo, Zhi, et al.
Published: (2025)
On Place, Well-Being, and Illness in the Andes
by: Marieka Sax
Published: (2015)
by: Marieka Sax
Published: (2015)
The Boy Problem: Many Boys Think School Is Stupid and Reading Stinks--Is There a Remedy?
by: Sax, Leonard
Published: (2007)
by: Sax, Leonard
Published: (2007)
Efficient Camera-Controlled Video Generation of Static Scenes via Sparse Diffusion and 3D Rendering
by: Chen, Jieying, et al.
Published: (2026)
by: Chen, Jieying, et al.
Published: (2026)
MoE3D: A Mixture-of-Experts Module for 3D Reconstruction
by: Wang, Zichen, et al.
Published: (2026)
by: Wang, Zichen, et al.
Published: (2026)
TouchMap-OR: Multi-View 3D Mapping of Hand-Surface Contacts
by: Ktistakis, Sophokles, et al.
Published: (2026)
by: Ktistakis, Sophokles, et al.
Published: (2026)
The Green ring of a family of copointed Hopf algebras
by: Vay, Cristian
Published: (2022)
by: Vay, Cristian
Published: (2022)
Linkage principle for small quantum groups
by: Vay, Cristian
Published: (2023)
by: Vay, Cristian
Published: (2023)
Polyploidy in Enkianthus (Ericaceae)
by: Sax, Hally Jolivette
Published: (1960)
by: Sax, Hally Jolivette
Published: (1960)
Bias in the Picture: Benchmarking VLMs with Social-Cue News Images and LLM-as-Judge Assessment
by: Narayanan, Aravind, et al.
Published: (2025)
by: Narayanan, Aravind, et al.
Published: (2025)
Agents Play Thousands of 3D Video Games
by: Xu, Zhongwen, et al.
Published: (2025)
by: Xu, Zhongwen, et al.
Published: (2025)
Probing Visual Language Priors in VLMs
by: Luo, Tiange, et al.
Published: (2024)
by: Luo, Tiange, et al.
Published: (2024)
Chitrarth: Bridging Vision and Language for a Billion People
by: Khan, Shaharukh, et al.
Published: (2025)
by: Khan, Shaharukh, et al.
Published: (2025)
Importance of Developing Emotional Intelligence in Preventing Addiction Syndrome
by: Viktoriia Mendelo, et al.
Published: (2024)
by: Viktoriia Mendelo, et al.
Published: (2024)
Stimpack: An Adaptive Rendering Optimization System for Scalable Cloud Gaming
by: Heo, Jin, et al.
Published: (2024)
by: Heo, Jin, et al.
Published: (2024)
The Prediction of Training Proficiency in Firefighters: A Study of Predictive Validity in Spain
by: Alfredo Berges
Published: (2018)
by: Alfredo Berges
Published: (2018)
Justice and Righteousness in the Old Testament
by: Berges, Ulrich
Published: (2025)
by: Berges, Ulrich
Published: (2025)
Pattern-Based Phase-Separation of Tracer and Dispersed Phase Particles in Two-Phase Defocusing Particle Tracking Velocimetry
by: Sax, Christian, et al.
Published: (2025)
by: Sax, Christian, et al.
Published: (2025)
On the Particle Image Overlap in Single Camera Defocusing Approaches
by: Sax, Christian, et al.
Published: (2025)
by: Sax, Christian, et al.
Published: (2025)
Off-Diagonal Continuous Rado Numbers $x_1 + x_2 + \dots + x_k = x_0$
by: Vestal, Don, et al.
Published: (2025)
by: Vestal, Don, et al.
Published: (2025)
R4: Retrieval-Augmented Reasoning for Vision-Language Models in 4D Spatio-Temporal Space
by: Sohn, Tin Stribor, et al.
Published: (2025)
by: Sohn, Tin Stribor, et al.
Published: (2025)
Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs
by: Li, Haoyuan, et al.
Published: (2025)
by: Li, Haoyuan, et al.
Published: (2025)
GOAT-Bench: A Benchmark for Multi-Modal Lifelong Navigation
by: Khanna, Mukul, et al.
Published: (2024)
by: Khanna, Mukul, et al.
Published: (2024)
GaussRender: Learning 3D Occupancy with Gaussian Rendering
by: Chambon, Loïck, et al.
Published: (2025)
by: Chambon, Loïck, et al.
Published: (2025)
RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision
by: Pan, Mingjie, et al.
Published: (2023)
by: Pan, Mingjie, et al.
Published: (2023)
Similar Items
-
Locate 3D: Real-World Object Localization via Self-Supervised Learning in 3D
by: Arnaud, Sergio, et al.
Published: (2025) -
What do we learn from a large-scale study of pre-trained visual representations in sim and real environments?
by: Silwal, Sneha, et al.
Published: (2023) -
Unifying 2D and 3D Vision-Language Understanding
by: Jain, Ayush, et al.
Published: (2025) -
Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence?
by: Majumdar, Arjun, et al.
Published: (2023) -
An Exploratory Study of Undergraduate Students' Perceptions of Visualization and Visualization Ability in Biochemistry
by: Andrew McVay, et al.
Published: (2026)