:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Li, Chengshu, Liang, Jacky, Zeng, Andy, Chen, Xinyun, Hausman, Karol, Sadigh, Dorsa, Levine, Sergey, Fei-Fei, Li, Xia, Fei, Ichter, Brian
Formato:	Preprint
Publicado:	2023
Materias:	Computation and Language Artificial Intelligence Machine Learning Robotics
Acceso en línea:	https://arxiv.org/abs/2312.04474
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities
por: Chen, Boyuan, et al.
Publicado: (2024)

Physically Grounded Vision-Language Models for Robotic Manipulation
por: Gao, Jensen, et al.
Publicado: (2023)

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
por: Nasiriany, Soroush, et al.
Publicado: (2024)

GenCHiP: Generating Robot Policy Code for High-Precision and Contact-Rich Manipulation Tasks
por: Burns, Kaylee, et al.
Publicado: (2024)

Generative Expressive Robot Behaviors using Large Language Models
por: Mahadevan, Karthik, et al.
Publicado: (2024)

Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections
por: Zha, Lihan, et al.
Publicado: (2023)

AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents
por: Ahn, Michael, et al.
Publicado: (2024)

Bridging Perception and Action: Spatially-Grounded Mid-Level Representations for Robot Generalization
por: Yang, Jonathan, et al.
Publicado: (2025)

Chain-of-Modality: Learning Manipulation Programs from Multimodal Human Videos with Vision-Language-Models
por: Wang, Chen, et al.
Publicado: (2025)

Vocal Sandbox: Continual Learning and Adaptation for Situated Human-Robot Collaboration
por: Grannen, Jennifer, et al.
Publicado: (2024)

ProVox: Personalization and Proactive Planning for Situated Human-Robot Collaboration
por: Grannen, Jennifer, et al.
Publicado: (2025)

Grounding Robot Generalization in Training Data via Retrieval-Augmented VLMs
por: Gao, Jensen, et al.
Publicado: (2026)

How to Train Your Robots? The Impact of Demonstration Modality on Imitation Learning
por: Li, Haozhuo, et al.
Publicado: (2025)

Efficiently Generating Expressive Quadruped Behaviors via Language-Guided Preference Learning
por: Clark, Jaden, et al.
Publicado: (2025)

Action-Free Reasoning for Policy Generalization
por: Clark, Jaden, et al.
Publicado: (2025)

HandelBot: Real-World Piano Playing via Fast Adaptation of Dexterous Robot Policies
por: Xie, Amber, et al.
Publicado: (2026)

CoNVOI: Context-aware Navigation using Vision Language Models in Outdoor and Indoor Environments
por: Sathyamoorthy, Adarsh Jagan, et al.
Publicado: (2024)

Precise Robot Command Understanding Using Grammar-Constrained Large Language Models
por: Huo, Xinyun, et al.
Publicado: (2026)

RT-Sketch: Goal-Conditioned Imitation Learning from Hand-Drawn Sketches
por: Sundaresan, Priya, et al.
Publicado: (2024)

Robo-Instruct: Simulator-Augmented Instruction Alignment For Finetuning Code LLMs
por: Hu, Zichao, et al.
Publicado: (2024)

Batch Active Learning of Reward Functions from Human Preferences
por: Bıyık, Erdem, et al.
Publicado: (2024)

Unified Video Action Model
por: Li, Shuang, et al.
Publicado: (2025)

SteerVLA: Steering Vision-Language-Action Models in Long-Tail Driving Scenarios
por: Gao, Tian, et al.
Publicado: (2026)

Joint Action Language Modelling for Transparent Policy Execution
por: Wulff, Theodor, et al.
Publicado: (2025)

Data Analogies Enable Efficient Cross-Embodiment Transfer
por: Yang, Jonathan, et al.
Publicado: (2026)

Toward Grounded Commonsense Reasoning
por: Kwon, Minae, et al.
Publicado: (2023)

FAST: Efficient Action Tokenization for Vision-Language-Action Models
por: Pertsch, Karl, et al.
Publicado: (2025)

Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation
por: Yang, Jonathan, et al.
Publicado: (2024)

Will People Enjoy a Robot Trainer? A Case Study with Snoopie the Pacerbot
por: Du, Maximilian, et al.
Publicado: (2026)

Invariance Co-training for Robot Visual Generalization
por: Yang, Jonathan, et al.
Publicado: (2025)

Language Guided Skill Discovery
por: Rho, Seungeun, et al.
Publicado: (2024)

Towards No-Code Programming of Cobots: Experiments with Code Synthesis by Large Code Models for Conversational Programming
por: Kranti, Chalamalasetti, et al.
Publicado: (2024)

Vision Language Models are In-Context Value Learners
por: Ma, Yecheng Jason, et al.
Publicado: (2024)

Latent Diffusion Planning for Imitation Learning
por: Xie, Amber, et al.
Publicado: (2025)

Data Retrieval with Importance Weights for Few-Shot Imitation Learning
por: Xie, Amber, et al.
Publicado: (2025)

What Matters for Batch Online Reinforcement Learning in Robotics?
por: Dong, Perry, et al.
Publicado: (2025)

AgentThink: A Unified Framework for Tool-Augmented Chain-of-Thought Reasoning in Vision-Language Models for Autonomous Driving
por: Qian, Kangan, et al.
Publicado: (2025)

UAD: Unsupervised Affordance Distillation for Generalization in Robotic Manipulation
por: Tang, Yihe, et al.
Publicado: (2025)

EXPO-FT: Sample-Efficient Reinforcement Learning Finetuning for Vision-Language-Action Models
por: Dong, Perry, et al.
Publicado: (2026)

IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation
por: Qu, Kaixian, et al.
Publicado: (2024)