Saved in:
Bibliographic Details
Main Authors: Xiang, Tianyi, Cao, Jiahang, Guo, Sikai, Zhao, Guoyang, Luo, Andrew F., Ma, Jun
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.12633
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913135444623360
author Xiang, Tianyi
Cao, Jiahang
Guo, Sikai
Zhao, Guoyang
Luo, Andrew F.
Ma, Jun
author_facet Xiang, Tianyi
Cao, Jiahang
Guo, Sikai
Zhao, Guoyang
Luo, Andrew F.
Ma, Jun
contents Reconstructing physically valid 3D scenes from single-view observations is a prerequisite for bridging the gap between visual perception and robotic control. However, in scenarios requiring precise contact reasoning, such as robotic manipulation in highly cluttered environments, geometric fidelity alone is insufficient. Standard perception pipelines often neglect physical constraints, resulting in invalid states, e.g., floating objects or severe inter-penetration, rendering downstream simulation unreliable. To address these limitations, we propose a novel physics-constrained Real-to-Sim pipeline that reconstructs physically consistent 3D scenes from single-view RGB-D data. Central to our approach is a differentiable optimization pipeline that explicitly models spatial dependencies via a contact graph, jointly refining object poses and physical properties through differentiable rigid-body simulation. Extensive evaluations in both simulation and real-world settings demonstrate that our reconstructed scenes achieve high physical fidelity and faithfully replicate real-world contact dynamics, enabling stable and reliable contact-rich manipulation.
format Preprint
id arxiv_https___arxiv_org_abs_2602_12633
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Real-to-Sim for Highly Cluttered Environments via Physics-Consistent Inter-Object Reasoning
Xiang, Tianyi
Cao, Jiahang
Guo, Sikai
Zhao, Guoyang
Luo, Andrew F.
Ma, Jun
Robotics
Reconstructing physically valid 3D scenes from single-view observations is a prerequisite for bridging the gap between visual perception and robotic control. However, in scenarios requiring precise contact reasoning, such as robotic manipulation in highly cluttered environments, geometric fidelity alone is insufficient. Standard perception pipelines often neglect physical constraints, resulting in invalid states, e.g., floating objects or severe inter-penetration, rendering downstream simulation unreliable. To address these limitations, we propose a novel physics-constrained Real-to-Sim pipeline that reconstructs physically consistent 3D scenes from single-view RGB-D data. Central to our approach is a differentiable optimization pipeline that explicitly models spatial dependencies via a contact graph, jointly refining object poses and physical properties through differentiable rigid-body simulation. Extensive evaluations in both simulation and real-world settings demonstrate that our reconstructed scenes achieve high physical fidelity and faithfully replicate real-world contact dynamics, enabling stable and reliable contact-rich manipulation.
title Real-to-Sim for Highly Cluttered Environments via Physics-Consistent Inter-Object Reasoning
topic Robotics
url https://arxiv.org/abs/2602.12633