Guardado en:
| Autores principales: | Li, Chengshu, Liang, Jacky, Zeng, Andy, Chen, Xinyun, Hausman, Karol, Sadigh, Dorsa, Levine, Sergey, Fei-Fei, Li, Xia, Fei, Ichter, Brian |
|---|---|
| Formato: | Preprint |
| Publicado: |
2023
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2312.04474 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities
por: Chen, Boyuan, et al.
Publicado: (2024)
por: Chen, Boyuan, et al.
Publicado: (2024)
Physically Grounded Vision-Language Models for Robotic Manipulation
por: Gao, Jensen, et al.
Publicado: (2023)
por: Gao, Jensen, et al.
Publicado: (2023)
PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
por: Nasiriany, Soroush, et al.
Publicado: (2024)
por: Nasiriany, Soroush, et al.
Publicado: (2024)
GenCHiP: Generating Robot Policy Code for High-Precision and Contact-Rich Manipulation Tasks
por: Burns, Kaylee, et al.
Publicado: (2024)
por: Burns, Kaylee, et al.
Publicado: (2024)
Generative Expressive Robot Behaviors using Large Language Models
por: Mahadevan, Karthik, et al.
Publicado: (2024)
por: Mahadevan, Karthik, et al.
Publicado: (2024)
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections
por: Zha, Lihan, et al.
Publicado: (2023)
por: Zha, Lihan, et al.
Publicado: (2023)
AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents
por: Ahn, Michael, et al.
Publicado: (2024)
por: Ahn, Michael, et al.
Publicado: (2024)
Bridging Perception and Action: Spatially-Grounded Mid-Level Representations for Robot Generalization
por: Yang, Jonathan, et al.
Publicado: (2025)
por: Yang, Jonathan, et al.
Publicado: (2025)
Chain-of-Modality: Learning Manipulation Programs from Multimodal Human Videos with Vision-Language-Models
por: Wang, Chen, et al.
Publicado: (2025)
por: Wang, Chen, et al.
Publicado: (2025)
Vocal Sandbox: Continual Learning and Adaptation for Situated Human-Robot Collaboration
por: Grannen, Jennifer, et al.
Publicado: (2024)
por: Grannen, Jennifer, et al.
Publicado: (2024)
ProVox: Personalization and Proactive Planning for Situated Human-Robot Collaboration
por: Grannen, Jennifer, et al.
Publicado: (2025)
por: Grannen, Jennifer, et al.
Publicado: (2025)
Grounding Robot Generalization in Training Data via Retrieval-Augmented VLMs
por: Gao, Jensen, et al.
Publicado: (2026)
por: Gao, Jensen, et al.
Publicado: (2026)
How to Train Your Robots? The Impact of Demonstration Modality on Imitation Learning
por: Li, Haozhuo, et al.
Publicado: (2025)
por: Li, Haozhuo, et al.
Publicado: (2025)
Efficiently Generating Expressive Quadruped Behaviors via Language-Guided Preference Learning
por: Clark, Jaden, et al.
Publicado: (2025)
por: Clark, Jaden, et al.
Publicado: (2025)
Action-Free Reasoning for Policy Generalization
por: Clark, Jaden, et al.
Publicado: (2025)
por: Clark, Jaden, et al.
Publicado: (2025)
HandelBot: Real-World Piano Playing via Fast Adaptation of Dexterous Robot Policies
por: Xie, Amber, et al.
Publicado: (2026)
por: Xie, Amber, et al.
Publicado: (2026)
CoNVOI: Context-aware Navigation using Vision Language Models in Outdoor and Indoor Environments
por: Sathyamoorthy, Adarsh Jagan, et al.
Publicado: (2024)
por: Sathyamoorthy, Adarsh Jagan, et al.
Publicado: (2024)
Precise Robot Command Understanding Using Grammar-Constrained Large Language Models
por: Huo, Xinyun, et al.
Publicado: (2026)
por: Huo, Xinyun, et al.
Publicado: (2026)
RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
por: Sundaresan, Priya, et al.
Publicado: (2024)
por: Sundaresan, Priya, et al.
Publicado: (2024)
Robo-Instruct: Simulator-Augmented Instruction Alignment For Finetuning Code LLMs
por: Hu, Zichao, et al.
Publicado: (2024)
por: Hu, Zichao, et al.
Publicado: (2024)
Batch Active Learning of Reward Functions from Human Preferences
por: Bıyık, Erdem, et al.
Publicado: (2024)
por: Bıyık, Erdem, et al.
Publicado: (2024)
Unified Video Action Model
por: Li, Shuang, et al.
Publicado: (2025)
por: Li, Shuang, et al.
Publicado: (2025)
SteerVLA: Steering Vision-Language-Action Models in Long-Tail Driving Scenarios
por: Gao, Tian, et al.
Publicado: (2026)
por: Gao, Tian, et al.
Publicado: (2026)
Joint Action Language Modelling for Transparent Policy Execution
por: Wulff, Theodor, et al.
Publicado: (2025)
por: Wulff, Theodor, et al.
Publicado: (2025)
Data Analogies Enable Efficient Cross-Embodiment Transfer
por: Yang, Jonathan, et al.
Publicado: (2026)
por: Yang, Jonathan, et al.
Publicado: (2026)
Toward Grounded Commonsense Reasoning
por: Kwon, Minae, et al.
Publicado: (2023)
por: Kwon, Minae, et al.
Publicado: (2023)
FAST: Efficient Action Tokenization for Vision-Language-Action Models
por: Pertsch, Karl, et al.
Publicado: (2025)
por: Pertsch, Karl, et al.
Publicado: (2025)
Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation
por: Yang, Jonathan, et al.
Publicado: (2024)
por: Yang, Jonathan, et al.
Publicado: (2024)
Will People Enjoy a Robot Trainer? A Case Study with Snoopie the Pacerbot
por: Du, Maximilian, et al.
Publicado: (2026)
por: Du, Maximilian, et al.
Publicado: (2026)
Invariance Co-training for Robot Visual Generalization
por: Yang, Jonathan, et al.
Publicado: (2025)
por: Yang, Jonathan, et al.
Publicado: (2025)
Language Guided Skill Discovery
por: Rho, Seungeun, et al.
Publicado: (2024)
por: Rho, Seungeun, et al.
Publicado: (2024)
Towards No-Code Programming of Cobots: Experiments with Code Synthesis by Large Code Models for Conversational Programming
por: Kranti, Chalamalasetti, et al.
Publicado: (2024)
por: Kranti, Chalamalasetti, et al.
Publicado: (2024)
Vision Language Models are In-Context Value Learners
por: Ma, Yecheng Jason, et al.
Publicado: (2024)
por: Ma, Yecheng Jason, et al.
Publicado: (2024)
Latent Diffusion Planning for Imitation Learning
por: Xie, Amber, et al.
Publicado: (2025)
por: Xie, Amber, et al.
Publicado: (2025)
Data Retrieval with Importance Weights for Few-Shot Imitation Learning
por: Xie, Amber, et al.
Publicado: (2025)
por: Xie, Amber, et al.
Publicado: (2025)
What Matters for Batch Online Reinforcement Learning in Robotics?
por: Dong, Perry, et al.
Publicado: (2025)
por: Dong, Perry, et al.
Publicado: (2025)
AgentThink: A Unified Framework for Tool-Augmented Chain-of-Thought Reasoning in Vision-Language Models for Autonomous Driving
por: Qian, Kangan, et al.
Publicado: (2025)
por: Qian, Kangan, et al.
Publicado: (2025)
UAD: Unsupervised Affordance Distillation for Generalization in Robotic Manipulation
por: Tang, Yihe, et al.
Publicado: (2025)
por: Tang, Yihe, et al.
Publicado: (2025)
EXPO-FT: Sample-Efficient Reinforcement Learning Finetuning for Vision-Language-Action Models
por: Dong, Perry, et al.
Publicado: (2026)
por: Dong, Perry, et al.
Publicado: (2026)
IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation
por: Qu, Kaixian, et al.
Publicado: (2024)
por: Qu, Kaixian, et al.
Publicado: (2024)
Ejemplares similares
-
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities
por: Chen, Boyuan, et al.
Publicado: (2024) -
Physically Grounded Vision-Language Models for Robotic Manipulation
por: Gao, Jensen, et al.
Publicado: (2023) -
PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
por: Nasiriany, Soroush, et al.
Publicado: (2024) -
GenCHiP: Generating Robot Policy Code for High-Precision and Contact-Rich Manipulation Tasks
por: Burns, Kaylee, et al.
Publicado: (2024) -
Generative Expressive Robot Behaviors using Large Language Models
por: Mahadevan, Karthik, et al.
Publicado: (2024)