Saved in:
| Main Authors: | Bowden, James C., Levine, Sergey, Listgarten, Jennifer |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.03032 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Unlocking Guidance for Discrete State-Space Diffusion and Flow Models
by: Nisonoff, Hunter, et al.
Published: (2024)
by: Nisonoff, Hunter, et al.
Published: (2024)
Autofocused oracles for model-based design
by: Fannjiang, Clara, et al.
Published: (2020)
by: Fannjiang, Clara, et al.
Published: (2020)
Is novelty predictable?
by: Fannjiang, Clara, et al.
Published: (2023)
by: Fannjiang, Clara, et al.
Published: (2023)
Conformal Prediction Under Feedback Covariate Shift for Biomolecular Design
by: Fannjiang, Clara, et al.
Published: (2022)
by: Fannjiang, Clara, et al.
Published: (2022)
Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation
by: Myers, Vivek, et al.
Published: (2024)
by: Myers, Vivek, et al.
Published: (2024)
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration
by: Wilcoxson, Max, et al.
Published: (2024)
by: Wilcoxson, Max, et al.
Published: (2024)
ViVa: Video-Trained Value Functions for Guiding Online RL from Diverse Data
by: Dashora, Nitish, et al.
Published: (2025)
by: Dashora, Nitish, et al.
Published: (2025)
ProteinGuide: On-the-fly property guidance for protein sequence generative models
by: Xiong, Junhao, et al.
Published: (2025)
by: Xiong, Junhao, et al.
Published: (2025)
Fine-Tuning Discrete Diffusion Models via Reward Optimization with Applications to DNA and Protein Design
by: Wang, Chenyu, et al.
Published: (2024)
by: Wang, Chenyu, et al.
Published: (2024)
Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings
by: Frans, Kevin, et al.
Published: (2024)
by: Frans, Kevin, et al.
Published: (2024)
Functional Graphical Models: Structure Enables Offline Data-Driven Optimization
by: Kuba, Jakub Grudzien, et al.
Published: (2024)
by: Kuba, Jakub Grudzien, et al.
Published: (2024)
Digi-Q: Learning Q-Value Functions for Training Device-Control Agents
by: Bai, Hao, et al.
Published: (2025)
by: Bai, Hao, et al.
Published: (2025)
Q-learning with Adjoint Matching
by: Li, Qiyang, et al.
Published: (2026)
by: Li, Qiyang, et al.
Published: (2026)
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
by: Stachowicz, Kyle, et al.
Published: (2024)
by: Stachowicz, Kyle, et al.
Published: (2024)
A Stable Whitening Optimizer for Efficient Neural Network Training
by: Frans, Kevin, et al.
Published: (2025)
by: Frans, Kevin, et al.
Published: (2025)
What Really Matters in Matrix-Whitening Optimizers?
by: Frans, Kevin, et al.
Published: (2025)
by: Frans, Kevin, et al.
Published: (2025)
Unsupervised-to-Online Reinforcement Learning
by: Kim, Junsu, et al.
Published: (2024)
by: Kim, Junsu, et al.
Published: (2024)
Visual Pre-Training on Unlabeled Images using Reinforcement Learning
by: Ghosh, Dibya, et al.
Published: (2025)
by: Ghosh, Dibya, et al.
Published: (2025)
Dual Goal Representations
by: Park, Seohong, et al.
Published: (2025)
by: Park, Seohong, et al.
Published: (2025)
Flow Q-Learning
by: Park, Seohong, et al.
Published: (2025)
by: Park, Seohong, et al.
Published: (2025)
Navigating Ideation Space: Decomposed Conceptual Representations for Positioning Scientific Ideas
by: Shen, Yuexi, et al.
Published: (2026)
by: Shen, Yuexi, et al.
Published: (2026)
Differentially Private Optimization for Non-Decomposable Objective Functions
by: Kong, Weiwei, et al.
Published: (2023)
by: Kong, Weiwei, et al.
Published: (2023)
Decomposing Control Lyapunov Functions for Efficient Reinforcement Learning
by: Lopez, Antonio, et al.
Published: (2024)
by: Lopez, Antonio, et al.
Published: (2024)
A Versatile Influence Function for Data Attribution with Non-Decomposable Loss
by: Deng, Junwei, et al.
Published: (2024)
by: Deng, Junwei, et al.
Published: (2024)
Decomposed Direct Preference Optimization for Structure-Based Drug Design
by: Cheng, Xiwei, et al.
Published: (2024)
by: Cheng, Xiwei, et al.
Published: (2024)
Reinforcement Learning with Action Chunking
by: Li, Qiyang, et al.
Published: (2025)
by: Li, Qiyang, et al.
Published: (2025)
Decoupled Q-Chunking
by: Li, Qiyang, et al.
Published: (2025)
by: Li, Qiyang, et al.
Published: (2025)
Behavioral Exploration: Learning to Explore via In-Context Adaptation
by: Wagenmaker, Andrew, et al.
Published: (2025)
by: Wagenmaker, Andrew, et al.
Published: (2025)
Diffusion Guidance Is a Controllable Policy Improvement Operator
by: Frans, Kevin, et al.
Published: (2025)
by: Frans, Kevin, et al.
Published: (2025)
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
by: Park, Seohong, et al.
Published: (2023)
by: Park, Seohong, et al.
Published: (2023)
Foundation Policies with Hilbert Representations
by: Park, Seohong, et al.
Published: (2024)
by: Park, Seohong, et al.
Published: (2024)
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference
by: Eysenbach, Benjamin, et al.
Published: (2024)
by: Eysenbach, Benjamin, et al.
Published: (2024)
Strategically Conservative Q-Learning
by: Shimizu, Yutaka, et al.
Published: (2024)
by: Shimizu, Yutaka, et al.
Published: (2024)
Deep Neural Networks Tend To Extrapolate Predictably
by: Kang, Katie, et al.
Published: (2023)
by: Kang, Katie, et al.
Published: (2023)
Diversity By Design: Leveraging Distribution Matching for Offline Model-Based Optimization
by: Yao, Michael S., et al.
Published: (2025)
by: Yao, Michael S., et al.
Published: (2025)
Cliqueformer: Model-Based Optimization with Structured Transformers
by: Kuba, Jakub Grudzien, et al.
Published: (2024)
by: Kuba, Jakub Grudzien, et al.
Published: (2024)
Q-SFT: Q-Learning for Language Models via Supervised Fine-Tuning
by: Hong, Joey, et al.
Published: (2024)
by: Hong, Joey, et al.
Published: (2024)
OmniVLA: An Omni-Modal Vision-Language-Action Model for Robot Navigation
by: Hirose, Noriaki, et al.
Published: (2025)
by: Hirose, Noriaki, et al.
Published: (2025)
Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
by: Nakamoto, Mitsuhiko, et al.
Published: (2024)
by: Nakamoto, Mitsuhiko, et al.
Published: (2024)
AsyncVLA: An Asynchronous VLA for Fast and Robust Navigation on the Edge
by: Hirose, Noriaki, et al.
Published: (2026)
by: Hirose, Noriaki, et al.
Published: (2026)
Similar Items
-
Unlocking Guidance for Discrete State-Space Diffusion and Flow Models
by: Nisonoff, Hunter, et al.
Published: (2024) -
Autofocused oracles for model-based design
by: Fannjiang, Clara, et al.
Published: (2020) -
Is novelty predictable?
by: Fannjiang, Clara, et al.
Published: (2023) -
Conformal Prediction Under Feedback Covariate Shift for Biomolecular Design
by: Fannjiang, Clara, et al.
Published: (2022) -
Policy Adaptation via Language Optimization: Decomposing Tasks for Few-Shot Imitation
by: Myers, Vivek, et al.
Published: (2024)