:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Bowden, James C., Levine, Sergey, Listgarten, Jennifer
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2511.03032
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Unlocking Guidance for Discrete State-Space Diffusion and Flow Models
by: Nisonoff, Hunter, et al.
Published: (2024)

Autofocused oracles for model-based design
by: Fannjiang, Clara, et al.
Published: (2020)

Is novelty predictable?
by: Fannjiang, Clara, et al.
Published: (2023)

Conformal Prediction Under Feedback Covariate Shift for Biomolecular Design
by: Fannjiang, Clara, et al.
Published: (2022)

Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation
by: Myers, Vivek, et al.
Published: (2024)

Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
by: Wilcoxson, Max, et al.
Published: (2024)

ViVa: Video-Trained Value Functions for Guiding Online RL from Diverse Data
by: Dashora, Nitish, et al.
Published: (2025)

ProteinGuide: On-the-fly property guidance for protein sequence generative models
by: Xiong, Junhao, et al.
Published: (2025)

Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design
by: Wang, Chenyu, et al.
Published: (2024)

Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings
by: Frans, Kevin, et al.
Published: (2024)

Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
by: Kuba, Jakub Grudzien, et al.
Published: (2024)

Digi-Q: Learning Q-Value Functions for Training Device-Control Agents
by: Bai, Hao, et al.
Published: (2025)

Q-learning with Adjoint Matching
by: Li, Qiyang, et al.
Published: (2026)

RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
by: Stachowicz, Kyle, et al.
Published: (2024)

A Stable Whitening Optimizer for Efficient Neural Network Training
by: Frans, Kevin, et al.
Published: (2025)

What Really Matters in Matrix-Whitening Optimizers?
by: Frans, Kevin, et al.
Published: (2025)

Unsupervised-to-Online Reinforcement Learning
by: Kim, Junsu, et al.
Published: (2024)

Visual Pre-Training on Unlabeled Images using Reinforcement Learning
by: Ghosh, Dibya, et al.
Published: (2025)

Dual Goal Representations
by: Park, Seohong, et al.
Published: (2025)

Flow Q-Learning
by: Park, Seohong, et al.
Published: (2025)

Navigating Ideation Space: Decomposed Conceptual Representations for Positioning Scientific Ideas
by: Shen, Yuexi, et al.
Published: (2026)

Differentially Private Optimization for Non-Decomposable Objective Functions
by: Kong, Weiwei, et al.
Published: (2023)

Decomposing Control Lyapunov Functions for Efficient Reinforcement Learning
by: Lopez, Antonio, et al.
Published: (2024)

A Versatile Influence Function for Data Attribution with Non-Decomposable Loss
by: Deng, Junwei, et al.
Published: (2024)

Decomposed Direct Preference Optimization for Structure-Based Drug Design
by: Cheng, Xiwei, et al.
Published: (2024)

Reinforcement Learning with Action Chunking
by: Li, Qiyang, et al.
Published: (2025)

Decoupled Q-Chunking
by: Li, Qiyang, et al.
Published: (2025)

Behavioral Exploration: Learning to Explore via In-Context Adaptation
by: Wagenmaker, Andrew, et al.
Published: (2025)

Diffusion Guidance Is a Controllable Policy Improvement Operator
by: Frans, Kevin, et al.
Published: (2025)

METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
by: Park, Seohong, et al.
Published: (2023)

Foundation Policies with Hilbert Representations
by: Park, Seohong, et al.
Published: (2024)

Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference
by: Eysenbach, Benjamin, et al.
Published: (2024)

Strategically Conservative Q-Learning
by: Shimizu, Yutaka, et al.
Published: (2024)

Deep Neural Networks Tend To Extrapolate Predictably
by: Kang, Katie, et al.
Published: (2023)

Diversity By Design: Leveraging Distribution Matching for Offline Model-Based Optimization
by: Yao, Michael S., et al.
Published: (2025)

Cliqueformer: Model-Based Optimization with Structured Transformers
by: Kuba, Jakub Grudzien, et al.
Published: (2024)

Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
by: Hong, Joey, et al.
Published: (2024)

OmniVLA: An Omni-Modal Vision-Language-Action Model for Robot Navigation
by: Hirose, Noriaki, et al.
Published: (2025)

Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
by: Nakamoto, Mitsuhiko, et al.
Published: (2024)

AsyncVLA: An Asynchronous VLA for Fast and Robust Navigation on the Edge
by: Hirose, Noriaki, et al.
Published: (2026)