Saved in:
| Main Authors: | Levine, Alexander, Stone, Peter, Zhang, Amy |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.03016 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Offline Action-Free Learning of Ex-BMDPs by Comparing Diverse Datasets
by: Levine, Alexander, et al.
Published: (2025)
by: Levine, Alexander, et al.
Published: (2025)
Multistep Inverse Is Not All You Need
by: Levine, Alexander, et al.
Published: (2024)
by: Levine, Alexander, et al.
Published: (2024)
Federated Learning With Energy Harvesting Devices: An MDP Framework
by: Zhang, Kai, et al.
Published: (2024)
by: Zhang, Kai, et al.
Published: (2024)
MDP Planning as Policy Inference
by: Tolpin, David
Published: (2026)
by: Tolpin, David
Published: (2026)
t-DGR: A Trajectory-Based Deep Generative Replay Method for Continual Learning in Decision Making
by: Yue, William, et al.
Published: (2024)
by: Yue, William, et al.
Published: (2024)
Proto Successor Measure: Representing the Behavior Space of an RL Agent
by: Agarwal, Siddhant, et al.
Published: (2024)
by: Agarwal, Siddhant, et al.
Published: (2024)
Will My Robot Achieve My Goals? Predicting the Probability that an MDP Policy Reaches a User-Specified Behavior Target
by: Guyer, Alexander, et al.
Published: (2022)
by: Guyer, Alexander, et al.
Published: (2022)
Track-MDP: Reinforcement Learning for Target Tracking with Controlled Sensing
by: Subramaniam, Adarsh M., et al.
Published: (2024)
by: Subramaniam, Adarsh M., et al.
Published: (2024)
The Trajectory Alignment Coefficient in Two Acts: From Reward Tuning to Reward Learning
by: Muslimani, Calarina, et al.
Published: (2026)
by: Muslimani, Calarina, et al.
Published: (2026)
Geometric Re-Analysis of Classical MDP Solving Algorithms
by: Mustafin, Arsenii, et al.
Published: (2025)
by: Mustafin, Arsenii, et al.
Published: (2025)
Learning in Markov Decision Processes with Exogenous Dynamics
by: Maran, Davide, et al.
Published: (2026)
by: Maran, Davide, et al.
Published: (2026)
Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information
by: Zhang, Ziyi, et al.
Published: (2024)
by: Zhang, Ziyi, et al.
Published: (2024)
MDP Geometry, Normalization and Reward Balancing Solvers
by: Mustafin, Arsenii, et al.
Published: (2024)
by: Mustafin, Arsenii, et al.
Published: (2024)
Reinforcement Learning with Exogenous States and Rewards
by: Trimponias, George, et al.
Published: (2023)
by: Trimponias, George, et al.
Published: (2023)
Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy
by: Grooten, Bram, et al.
Published: (2025)
by: Grooten, Bram, et al.
Published: (2025)
Online MDP with Transition Prototypes: A Robust Adaptive Approach
by: Sun, Shuo, et al.
Published: (2024)
by: Sun, Shuo, et al.
Published: (2024)
None To Optima in Few Shots: Bayesian Optimization with MDP Priors
by: Li, Diantong, et al.
Published: (2025)
by: Li, Diantong, et al.
Published: (2025)
A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems
by: Chen, Xin, et al.
Published: (2025)
by: Chen, Xin, et al.
Published: (2025)
Causal Bayesian Optimization via Exogenous Distribution Learning
by: Ren, Shaogang, et al.
Published: (2024)
by: Ren, Shaogang, et al.
Published: (2024)
Using Forwards-Backwards Models to Approximate MDP Homomorphisms
by: Mavor-Parker, Augustine N., et al.
Published: (2022)
by: Mavor-Parker, Augustine N., et al.
Published: (2022)
Single-Trajectory Distributionally Robust Reinforcement Learning
by: Liang, Zhipeng, et al.
Published: (2023)
by: Liang, Zhipeng, et al.
Published: (2023)
Exogenous Isomorphism for Counterfactual Identifiability
by: Chen, Yikang, et al.
Published: (2025)
by: Chen, Yikang, et al.
Published: (2025)
Exogenous Matching: Learning Good Proposals for Tractable Counterfactual Estimation
by: Chen, Yikang, et al.
Published: (2024)
by: Chen, Yikang, et al.
Published: (2024)
RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
by: Stachowicz, Kyle, et al.
Published: (2024)
by: Stachowicz, Kyle, et al.
Published: (2024)
Learning to Stabilize Unknown LTI Systems on a Single Trajectory under Stochastic Noise
by: Zhang, Ziyi, et al.
Published: (2024)
by: Zhang, Ziyi, et al.
Published: (2024)
Exploiting Exogenous Structure for Sample-Efficient Reinforcement Learning
by: Wan, Jia, et al.
Published: (2024)
by: Wan, Jia, et al.
Published: (2024)
ICU-Sepsis: A Benchmark MDP Built from Real Medical Data
by: Choudhary, Kartik, et al.
Published: (2024)
by: Choudhary, Kartik, et al.
Published: (2024)
Visual Pre-Training on Unlabeled Images using Reinforcement Learning
by: Ghosh, Dibya, et al.
Published: (2025)
by: Ghosh, Dibya, et al.
Published: (2025)
Learning Memory Mechanisms for Decision Making through Demonstrations
by: Yue, William, et al.
Published: (2024)
by: Yue, William, et al.
Published: (2024)
AsyncVLA: An Asynchronous VLA for Fast and Robust Navigation on the Edge
by: Hirose, Noriaki, et al.
Published: (2026)
by: Hirose, Noriaki, et al.
Published: (2026)
d3LLM: Ultra-Fast Diffusion LLM using Pseudo-Trajectory Distillation
by: Qian, Yu-Yang, et al.
Published: (2026)
by: Qian, Yu-Yang, et al.
Published: (2026)
A Finite-Sample Analysis of an Actor-Critic Algorithm for Mean-Variance Optimization in a Discounted MDP
by: Sangadi, Tejaram, et al.
Published: (2024)
by: Sangadi, Tejaram, et al.
Published: (2024)
Deep reinforcement learning for weakly coupled MDP's with continuous actions
by: Robledo, Francisco, et al.
Published: (2024)
by: Robledo, Francisco, et al.
Published: (2024)
MDP: Multidimensional Vision Model Pruning with Latency Constraint
by: Sun, Xinglong, et al.
Published: (2025)
by: Sun, Xinglong, et al.
Published: (2025)
MDP modeling for multi-stage stochastic programs
by: Morton, David P., et al.
Published: (2025)
by: Morton, David P., et al.
Published: (2025)
MI-to-Mid Distilled Compression (M2M-DC): An Hybrid-Information-Guided-Block Pruning with Progressive Inner Slicing Approach to Model Compression
by: Levine, Lionel, et al.
Published: (2025)
by: Levine, Lionel, et al.
Published: (2025)
Addressing Correlated Latent Exogenous Variables in Debiased Recommender Systems
by: Zhang, Shuqiang, et al.
Published: (2025)
by: Zhang, Shuqiang, et al.
Published: (2025)
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
by: Mukherjee, Subhojyoti, et al.
Published: (2024)
by: Mukherjee, Subhojyoti, et al.
Published: (2024)
Factored Latent Action World Models
by: Wang, Zizhao, et al.
Published: (2026)
by: Wang, Zizhao, et al.
Published: (2026)
Exogenous Randomness Empowering Random Forests
by: Mei, Tianxing, et al.
Published: (2024)
by: Mei, Tianxing, et al.
Published: (2024)
Similar Items
-
Offline Action-Free Learning of Ex-BMDPs by Comparing Diverse Datasets
by: Levine, Alexander, et al.
Published: (2025) -
Multistep Inverse Is Not All You Need
by: Levine, Alexander, et al.
Published: (2024) -
Federated Learning With Energy Harvesting Devices: An MDP Framework
by: Zhang, Kai, et al.
Published: (2024) -
MDP Planning as Policy Inference
by: Tolpin, David
Published: (2026) -
t-DGR: A Trajectory-Based Deep Generative Replay Method for Continual Learning in Decision Making
by: Yue, William, et al.
Published: (2024)