:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Azad, Abdus Salam, Gur, Izzeddin, Emhoff, Jasper, Alexis, Nathaniel, Faust, Aleksandra, Abbeel, Pieter, Stoica, Ion
Format:	Preprint
Published:	2022
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2210.10243
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web
by: Furuta, Hiroki, et al.
Published: (2023)

A Real-World WebAgent with Planning, Long Context Understanding, and Program Synthesis
by: Gur, Izzeddin, et al.
Published: (2023)

Multimodal Web Navigation with Instruction-Finetuned Foundation Models
by: Furuta, Hiroki, et al.
Published: (2023)

Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings
by: Frans, Kevin, et al.
Published: (2024)

Geometric-Averaged Preference Optimization for Soft Preference Labels
by: Furuta, Hiroki, et al.
Published: (2024)

Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
by: Chow, Yinlam, et al.
Published: (2024)

Coarse-to-fine Q-Network with Action Sequence for Data-Efficient Reinforcement Learning
by: Seo, Younggyo, et al.
Published: (2024)

DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing
by: Lee, Vint, et al.
Published: (2023)

Offline Imitation Learning Through Graph Search and Retrieval
by: Yin, Zhao-Heng, et al.
Published: (2024)

Learning a Diffusion Model Policy from Rewards via Q-Score Matching
by: Psenka, Michael, et al.
Published: (2023)

Cooperative Inverse Reinforcement Learning
by: Hadfield-Menell, Dylan, et al.
Published: (2016)

Visual Representation Learning with Stochastic Frame Prediction
by: Jang, Huiwon, et al.
Published: (2024)

Accelerating Reinforcement Learning with Value-Conditional State Entropy Exploration
by: Kim, Dongyoung, et al.
Published: (2023)

Interactive Task Planning with Language Models
by: Li, Boyi, et al.
Published: (2023)

K-Search: LLM Kernel Generation via Co-Evolving Intrinsic World Model
by: Cao, Shiyi, et al.
Published: (2026)

Object-centric 3D Motion Field for Robot Learning from Human Videos
by: Yin, Zhao-Heng, et al.
Published: (2025)

When Does Non-Uniform Replay Matter in Reinforcement Learning?
by: Korniak, Michal, et al.
Published: (2026)

Learning Interactive Real-World Simulators
by: Yang, Sherry, et al.
Published: (2023)

Semi-Supervised One-Shot Imitation Learning
by: Wu, Philipp, et al.
Published: (2024)

Body Transformer: Leveraging Robot Embodiment for Policy Learning
by: Sferrazza, Carmelo, et al.
Published: (2024)

Qrita: High-performance Top-k and Top-p using Pivot-based Truncation and Selection
by: Park, Jongseok, et al.
Published: (2026)

Spectral Alignment in Forward-Backward Representations via Temporal Abstraction
by: Azad, Seyed Mahdi B., et al.
Published: (2026)

Cliqueformer: Model-Based Optimization with Structured Transformers
by: Kuba, Jakub Grudzien, et al.
Published: (2024)

Learning Sim-to-Real Humanoid Locomotion in 15 Minutes
by: Seo, Younggyo, et al.
Published: (2025)

EgoZero: Robot Learning from Smart Glasses
by: Liu, Vincent, et al.
Published: (2025)

Video2Policy: Scaling up Manipulation Tasks in Simulation through Internet Videos
by: Ye, Weirui, et al.
Published: (2025)

MOKA: Open-World Robotic Manipulation through Mark-Based Visual Prompting
by: Liu, Fangchen, et al.
Published: (2024)

From LLMs to Actions: Latent Codes as Bridges in Hierarchical Robot Control
by: Shentu, Yide, et al.
Published: (2024)

Small LLMs Do Not Learn a Generalizable Theory of Mind via Reinforcement Learning
by: Sarangi, Sneheel, et al.
Published: (2025)

depyf: Open the Opaque Box of PyTorch Compiler for Machine Learning Researchers
by: You, Kaichao, et al.
Published: (2024)

GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents
by: Shetty, Manish, et al.
Published: (2025)

Reinforcement Learning for Versatile, Dynamic, and Robust Bipedal Locomotion Control
by: Li, Zhongyu, et al.
Published: (2024)

Symbolic Regression for Beyond the Standard Model Physics
by: AbdusSalam, Shehu, et al.
Published: (2024)

Lightning Grasp: High Performance Procedural Grasp Synthesis with Contact Fields
by: Yin, Zhao-Heng, et al.
Published: (2025)

Learning to Model the World with Language
by: Lin, Jessy, et al.
Published: (2023)

Multi-Objective Learning for Diffusion Models: A Statistical Theory under Semi-Supervised Learning
by: Cheng, Ziheng, et al.
Published: (2026)

FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control
by: Seo, Younggyo, et al.
Published: (2025)

Feel the Force: Contact-Driven Learning from Humans
by: Adeniji, Ademi, et al.
Published: (2025)

A Statistical Framework for Ranking LLM-Based Chatbots
by: Ameli, Siavash, et al.
Published: (2024)

Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous Sensors via Language Grounding
by: Jones, Joshua, et al.
Published: (2025)