Saved in:
| Main Authors: | Zeng, Hongliang, Zhang, Ping, Wu, Chengjiong, Wang, Jiahua, Ye, Tingyu, Li, Fang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.01191 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Masked Generative Extractor for Synergistic Representation and 3D Generation of Point Clouds
by: Zeng, Hongliang, et al.
Published: (2024)
by: Zeng, Hongliang, et al.
Published: (2024)
Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge
by: Kang, Li, et al.
Published: (2026)
by: Kang, Li, et al.
Published: (2026)
FlowBot3D: Learning 3D Articulation Flow to Manipulate Articulated Objects
by: Eisner, Ben, et al.
Published: (2022)
by: Eisner, Ben, et al.
Published: (2022)
SpatialAnt: Autonomous Zero-Shot Robot Navigation via Active Scene Reconstruction and Visual Anticipation
by: Zhang, Jiwen, et al.
Published: (2026)
by: Zhang, Jiwen, et al.
Published: (2026)
RPMArt: Towards Robust Perception and Manipulation for Articulated Objects
by: Wang, Junbo, et al.
Published: (2024)
by: Wang, Junbo, et al.
Published: (2024)
GAMORA: A Gesture Articulated Meta Operative Robotic Arm for Hazardous Material Handling in Containment-Level Environments
by: Wasay, Farha Abdul, et al.
Published: (2025)
by: Wasay, Farha Abdul, et al.
Published: (2025)
SynHLMA:Synthesizing Hand Language Manipulation for Articulated Object with Discrete Human Object Interaction Representation
by: zhi, Wang, et al.
Published: (2025)
by: zhi, Wang, et al.
Published: (2025)
V$^2$-SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy
by: Bai, Long, et al.
Published: (2024)
by: Bai, Long, et al.
Published: (2024)
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
by: Mu, Yao, et al.
Published: (2024)
by: Mu, Yao, et al.
Published: (2024)
OSSAR: Towards Open-Set Surgical Activity Recognition in Robot-assisted Surgery
by: Bai, Long, et al.
Published: (2024)
by: Bai, Long, et al.
Published: (2024)
ArtFormer: Controllable Generation of Diverse 3D Articulated Objects
by: Su, Jiayi, et al.
Published: (2024)
by: Su, Jiayi, et al.
Published: (2024)
Learning Environment-Aware Affordance for 3D Articulated Object Manipulation under Occlusions
by: Wu, Ruihai, et al.
Published: (2023)
by: Wu, Ruihai, et al.
Published: (2023)
Opening Articulated Structures in the Real World
by: Gupta, Arjun, et al.
Published: (2024)
by: Gupta, Arjun, et al.
Published: (2024)
Robix: A Unified Model for Robot Interaction, Reasoning and Planning
by: Fang, Huang, et al.
Published: (2025)
by: Fang, Huang, et al.
Published: (2025)
Part-Guided 3D RL for Sim2Real Articulated Object Manipulation
by: Xie, Pengwei, et al.
Published: (2024)
by: Xie, Pengwei, et al.
Published: (2024)
Articulated 3D Scene Graphs for Open-World Mobile Manipulation
by: Büchner, Martin, et al.
Published: (2026)
by: Büchner, Martin, et al.
Published: (2026)
Humanoid Occupancy: Enabling A Generalized Multimodal Occupancy Perception System on Humanoid Robots
by: Cui, Wei, et al.
Published: (2025)
by: Cui, Wei, et al.
Published: (2025)
Distracted Robot: How Visual Clutter Undermine Robotic Manipulation
by: Rasouli, Amir, et al.
Published: (2025)
by: Rasouli, Amir, et al.
Published: (2025)
VideoArtGS: Building Digital Twins of Articulated Objects from Monocular Video
by: Liu, Yu, et al.
Published: (2025)
by: Liu, Yu, et al.
Published: (2025)
UAD: Unsupervised Affordance Distillation for Generalization in Robotic Manipulation
by: Tang, Yihe, et al.
Published: (2025)
by: Tang, Yihe, et al.
Published: (2025)
CAPT: Category-level Articulation Estimation from a Single Point Cloud Using Transformer
by: Fu, Lian, et al.
Published: (2024)
by: Fu, Lian, et al.
Published: (2024)
InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy
by: Chen, Xinyi, et al.
Published: (2025)
by: Chen, Xinyi, et al.
Published: (2025)
Neural Implicit Representation for Building Digital Twins of Unknown Articulated Objects
by: Weng, Yijia, et al.
Published: (2024)
by: Weng, Yijia, et al.
Published: (2024)
RGB-only Active 3D Scene Graph Generation for Indoor Mobile Robots
by: Modi, Giorgia, et al.
Published: (2026)
by: Modi, Giorgia, et al.
Published: (2026)
Spatial Policy: Guiding Visuomotor Robotic Manipulation with Spatial-Aware Modeling and Reasoning
by: Liu, Yijun, et al.
Published: (2025)
by: Liu, Yijun, et al.
Published: (2025)
ActiveVLN: Towards Active Exploration via Multi-Turn RL in Vision-and-Language Navigation
by: Zhang, Zekai, et al.
Published: (2025)
by: Zhang, Zekai, et al.
Published: (2025)
DiLA: Disentangled Latent Action World Models
by: Zhang, Tianqiu, et al.
Published: (2026)
by: Zhang, Tianqiu, et al.
Published: (2026)
RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation
by: Wang, Boyang, et al.
Published: (2026)
by: Wang, Boyang, et al.
Published: (2026)
Recognizing Actions from Robotic View for Natural Human-Robot Interaction
by: Wang, Ziyi, et al.
Published: (2025)
by: Wang, Ziyi, et al.
Published: (2025)
ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
by: Huang, Wenlong, et al.
Published: (2024)
by: Huang, Wenlong, et al.
Published: (2024)
HiRT: Enhancing Robotic Control with Hierarchical Robot Transformers
by: Zhang, Jianke, et al.
Published: (2024)
by: Zhang, Jianke, et al.
Published: (2024)
VitaTouch: Property-Aware Vision-Tactile-Language Model for Robotic Quality Inspection in Manufacturing
by: Zong, Junyi, et al.
Published: (2026)
by: Zong, Junyi, et al.
Published: (2026)
VLM See, Robot Do: Human Demo Video to Robot Action Plan via Vision Language Model
by: Wang, Beichen, et al.
Published: (2024)
by: Wang, Beichen, et al.
Published: (2024)
On-Device Diffusion Transformer Policy for Efficient Robot Manipulation
by: Wu, Yiming, et al.
Published: (2025)
by: Wu, Yiming, et al.
Published: (2025)
FlowHOI: Flow-based Semantics-Grounded Generation of Hand-Object Interactions for Dexterous Robot Manipulation
by: Zeng, Huajian, et al.
Published: (2026)
by: Zeng, Huajian, et al.
Published: (2026)
Robots Pre-train Robots: Manipulation-Centric Robotic Representation from Large-Scale Robot Datasets
by: Jiang, Guangqi, et al.
Published: (2024)
by: Jiang, Guangqi, et al.
Published: (2024)
Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery
by: Wang, Guankun, et al.
Published: (2024)
by: Wang, Guankun, et al.
Published: (2024)
MCRL4OR: Multimodal Contrastive Representation Learning for Off-Road Environmental Perception
by: Yang, Yi, et al.
Published: (2025)
by: Yang, Yi, et al.
Published: (2025)
Embodied Tree of Thoughts: Deliberate Manipulation Planning with Embodied World Model
by: Xu, Wenjiang, et al.
Published: (2025)
by: Xu, Wenjiang, et al.
Published: (2025)
J-ORA: A Framework and Multimodal Dataset for Japanese Object Identification, Reference, Action Prediction in Robot Perception
by: Atuhurra, Jesse, et al.
Published: (2025)
by: Atuhurra, Jesse, et al.
Published: (2025)
Similar Items
-
Masked Generative Extractor for Synergistic Representation and 3D Generation of Point Clouds
by: Zeng, Hongliang, et al.
Published: (2024) -
Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge
by: Kang, Li, et al.
Published: (2026) -
FlowBot3D: Learning 3D Articulation Flow to Manipulate Articulated Objects
by: Eisner, Ben, et al.
Published: (2022) -
SpatialAnt: Autonomous Zero-Shot Robot Navigation via Active Scene Reconstruction and Visual Anticipation
by: Zhang, Jiwen, et al.
Published: (2026) -
RPMArt: Towards Robust Perception and Manipulation for Articulated Objects
by: Wang, Junbo, et al.
Published: (2024)