:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Park, Seohong, Kreiman, Tobias, Levine, Sergey
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence Robotics
Online Access:	https://arxiv.org/abs/2402.15567
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
by: Park, Seohong, et al.
Published: (2023)

Decoupled Q-Chunking
by: Li, Qiyang, et al.
Published: (2025)

HIQL: Offline Goal-Conditioned RL with Latent States as Actions
by: Park, Seohong, et al.
Published: (2023)

Dual Goal Representations
by: Park, Seohong, et al.
Published: (2025)

Flow Q-Learning
by: Park, Seohong, et al.
Published: (2025)

Scalable Offline Model-Based RL with Action Chunks
by: Park, Kwanyoung, et al.
Published: (2025)

Real-Time Execution of Action Chunking Flow Policies
by: Black, Kevin, et al.
Published: (2025)

Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings
by: Frans, Kevin, et al.
Published: (2024)

Is Value Learning Really the Main Bottleneck in Offline RL?
by: Park, Seohong, et al.
Published: (2024)

OGBench: Benchmarking Offline Goal-Conditioned RL
by: Park, Seohong, et al.
Published: (2024)

Transitive RL: Value Learning via Divide and Conquer
by: Park, Seohong, et al.
Published: (2025)

Intention-Conditioned Flow Occupancy Models
by: Zheng, Chongyi, et al.
Published: (2025)

RLDG: Robotic Generalist Policy Distillation via Reinforcement Learning
by: Xu, Charles, et al.
Published: (2024)

Posterior Behavioral Cloning: Pretraining BC Policies for Efficient RL Finetuning
by: Wagenmaker, Andrew, et al.
Published: (2025)

RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
by: Stachowicz, Kyle, et al.
Published: (2024)

Q-learning with Adjoint Matching
by: Li, Qiyang, et al.
Published: (2026)

Reinforcement Learning with Action Chunking
by: Li, Qiyang, et al.
Published: (2025)

GHIL-Glue: Hierarchical Control with Filtered Subgoal Images
by: Hatch, Kyle B., et al.
Published: (2024)

Steering Your Diffusion Policy with Latent Space Reinforcement Learning
by: Wagenmaker, Andrew, et al.
Published: (2025)

Horizon Reduction Makes RL Scalable
by: Park, Seohong, et al.
Published: (2025)

RAPTOR: A Foundation Policy for Quadrotor Control
by: Eschmann, Jonas, et al.
Published: (2025)

Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation
by: Feng, Yunhai, et al.
Published: (2025)

KALIE: Fine-Tuning Vision-Language Models for Open-World Manipulation without Robot Data
by: Tang, Grace, et al.
Published: (2024)

Drifting Field Policy: A One-Step Generative Policy via Wasserstein Gradient Flow
by: Koo, Juil, et al.
Published: (2026)

Toward Accurate Long-Horizon Robotic Manipulation: Language-to-Action with Foundation Models via Scene Graphs
by: Dinesh, Sushil Samuel, et al.
Published: (2025)

Towards Interpretable Foundation Models of Robot Behavior: A Task Specific Policy Generation Approach
by: Sheidlower, Isaac, et al.
Published: (2024)

Yell At Your Robot: Improving On-the-Fly from Language Corrections
by: Shi, Lucy Xiaoyang, et al.
Published: (2024)

PEEK: Guiding and Minimal Image Representations for Zero-Shot Generalization of Robot Manipulation Policies
by: Zhang, Jesse, et al.
Published: (2025)

Language Guided Skill Discovery
by: Rho, Seungeun, et al.
Published: (2024)

Diffusion Guidance Is a Controllable Policy Improvement Operator
by: Frans, Kevin, et al.
Published: (2025)

Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss
by: Zheng, Ruijie, et al.
Published: (2024)

Ensembling Prioritized Hybrid Policies for Multi-agent Pathfinding
by: Tang, Huijie, et al.
Published: (2024)

An Interactive Agent Foundation Model
by: Durante, Zane, et al.
Published: (2024)

Explainable Representation of Finite-Memory Policies for POMDPs using Decision Trees
by: Azeem, Muqsit, et al.
Published: (2024)

The Ingredients for Robotic Diffusion Transformers
by: Dasari, Sudeep, et al.
Published: (2024)

Human Implicit Preference-Based Policy Fine-tuning for Multi-Agent Reinforcement Learning in USV Swarm
by: Kim, Hyeonjun, et al.
Published: (2025)

Online Foundation Model Selection in Robotics
by: Li, Po-han, et al.
Published: (2024)

Fast Adaptation with Behavioral Foundation Models
by: Sikchi, Harshit, et al.
Published: (2025)

Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model
by: Yuan, Xiu, et al.
Published: (2024)

Policy-Guided Diffusion
by: Jackson, Matthew Thomas, et al.
Published: (2024)