:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Levine, Alexander, Stone, Peter, Zhang, Amy
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2410.03016
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Offline Action-Free Learning of Ex-BMDPs by Comparing Diverse Datasets
by: Levine, Alexander, et al.
Published: (2025)

Multistep Inverse Is Not All You Need
by: Levine, Alexander, et al.
Published: (2024)

Federated Learning With Energy Harvesting Devices: An MDP Framework
by: Zhang, Kai, et al.
Published: (2024)

MDP Planning as Policy Inference
by: Tolpin, David
Published: (2026)

t-DGR: A Trajectory-Based Deep Generative Replay Method for Continual Learning in Decision Making
by: Yue, William, et al.
Published: (2024)

Proto Successor Measure: Representing the Behavior Space of an RL Agent
by: Agarwal, Siddhant, et al.
Published: (2024)

Will My Robot Achieve My Goals? Predicting the Probability that an MDP Policy Reaches a User-Specified Behavior Target
by: Guyer, Alexander, et al.
Published: (2022)

Track-MDP: Reinforcement Learning for Target Tracking with Controlled Sensing
by: Subramaniam, Adarsh M., et al.
Published: (2024)

The Trajectory Alignment Coefficient in Two Acts: From Reward Tuning to Reward Learning
by: Muslimani, Calarina, et al.
Published: (2026)

Geometric Re-Analysis of Classical MDP Solving Algorithms
by: Mustafin, Arsenii, et al.
Published: (2025)

Learning in Markov Decision Processes with Exogenous Dynamics
by: Maran, Davide, et al.
Published: (2026)

Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information
by: Zhang, Ziyi, et al.
Published: (2024)

MDP Geometry, Normalization and Reward Balancing Solvers
by: Mustafin, Arsenii, et al.
Published: (2024)

Reinforcement Learning with Exogenous States and Rewards
by: Trimponias, George, et al.
Published: (2023)

Out-of-Distribution Generalization with a SPARC: Racing 100 Unseen Vehicles with a Single Policy
by: Grooten, Bram, et al.
Published: (2025)

Online MDP with Transition Prototypes: A Robust Adaptive Approach
by: Sun, Shuo, et al.
Published: (2024)

None To Optima in Few Shots: Bayesian Optimization with MDP Priors
by: Li, Diantong, et al.
Published: (2025)

A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems
by: Chen, Xin, et al.
Published: (2025)

Causal Bayesian Optimization via Exogenous Distribution Learning
by: Ren, Shaogang, et al.
Published: (2024)

Using Forwards-Backwards Models to Approximate MDP Homomorphisms
by: Mavor-Parker, Augustine N., et al.
Published: (2022)

Single-Trajectory Distributionally Robust Reinforcement Learning
by: Liang, Zhipeng, et al.
Published: (2023)

Exogenous Isomorphism for Counterfactual Identifiability
by: Chen, Yikang, et al.
Published: (2025)

Exogenous Matching: Learning Good Proposals for Tractable Counterfactual Estimation
by: Chen, Yikang, et al.
Published: (2024)

RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes
by: Stachowicz, Kyle, et al.
Published: (2024)

Learning to Stabilize Unknown LTI Systems on a Single Trajectory under Stochastic Noise
by: Zhang, Ziyi, et al.
Published: (2024)

Exploiting Exogenous Structure for Sample-Efficient Reinforcement Learning
by: Wan, Jia, et al.
Published: (2024)

ICU-Sepsis: A Benchmark MDP Built from Real Medical Data
by: Choudhary, Kartik, et al.
Published: (2024)

Visual Pre-Training on Unlabeled Images using Reinforcement Learning
by: Ghosh, Dibya, et al.
Published: (2025)

Learning Memory Mechanisms for Decision Making through Demonstrations
by: Yue, William, et al.
Published: (2024)

AsyncVLA: An Asynchronous VLA for Fast and Robust Navigation on the Edge
by: Hirose, Noriaki, et al.
Published: (2026)

d3LLM: Ultra-Fast Diffusion LLM using Pseudo-Trajectory Distillation
by: Qian, Yu-Yang, et al.
Published: (2026)

A Finite-Sample Analysis of an Actor-Critic Algorithm for Mean-Variance Optimization in a Discounted MDP
by: Sangadi, Tejaram, et al.
Published: (2024)

Deep reinforcement learning for weakly coupled MDP's with continuous actions
by: Robledo, Francisco, et al.
Published: (2024)

MDP: Multidimensional Vision Model Pruning with Latency Constraint
by: Sun, Xinglong, et al.
Published: (2025)

MDP modeling for multi-stage stochastic programs
by: Morton, David P., et al.
Published: (2025)

MI-to-Mid Distilled Compression (M2M-DC): An Hybrid-Information-Guided-Block Pruning with Progressive Inner Slicing Approach to Model Compression
by: Levine, Lionel, et al.
Published: (2025)

Addressing Correlated Latent Exogenous Variables in Debiased Recommender Systems
by: Zhang, Shuqiang, et al.
Published: (2025)

SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
by: Mukherjee, Subhojyoti, et al.
Published: (2024)

Factored Latent Action World Models
by: Wang, Zizhao, et al.
Published: (2026)

Exogenous Randomness Empowering Random Forests
by: Mei, Tianxing, et al.
Published: (2024)