:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Tolpin, David
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2602.17375
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Efficient Incremental Belief Updates Using Weighted Virtual Observations
by: Tolpin, David
Published: (2024)

Fast Neural Inverse Kinematics on Human Body Motions
by: Tolpin, David, et al.
Published: (2025)

Neural Human Pose Prior
by: Heker, Michal, et al.
Published: (2025)

SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
by: Mukherjee, Subhojyoti, et al.
Published: (2024)

Geometric Re-Analysis of Classical MDP Solving Algorithms
by: Mustafin, Arsenii, et al.
Published: (2025)

MDP Geometry, Normalization and Reward Balancing Solvers
by: Mustafin, Arsenii, et al.
Published: (2024)

MDP modeling for multi-stage stochastic programs
by: Morton, David P., et al.
Published: (2025)

Online MDP with Transition Prototypes: A Robust Adaptive Approach
by: Sun, Shuo, et al.
Published: (2024)

None To Optima in Few Shots: Bayesian Optimization with MDP Priors
by: Li, Diantong, et al.
Published: (2025)

Will My Robot Achieve My Goals? Predicting the Probability that an MDP Policy Reaches a User-Specified Behavior Target
by: Guyer, Alexander, et al.
Published: (2022)

Federated Learning With Energy Harvesting Devices: An MDP Framework
by: Zhang, Kai, et al.
Published: (2024)

Using Forwards-Backwards Models to Approximate MDP Homomorphisms
by: Mavor-Parker, Augustine N., et al.
Published: (2022)

Track-MDP: Reinforcement Learning for Target Tracking with Controlled Sensing
by: Subramaniam, Adarsh M., et al.
Published: (2024)

Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information
by: Zhang, Ziyi, et al.
Published: (2024)

ICU-Sepsis: A Benchmark MDP Built from Real Medical Data
by: Choudhary, Kartik, et al.
Published: (2024)

Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory
by: Levine, Alexander, et al.
Published: (2024)

Deep reinforcement learning for weakly coupled MDP's with continuous actions
by: Robledo, Francisco, et al.
Published: (2024)

MDP: Multidimensional Vision Model Pruning with Latency Constraint
by: Sun, Xinglong, et al.
Published: (2025)

A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems
by: Chen, Xin, et al.
Published: (2025)

Tsallis Entropy Regularization for Linearly Solvable MDP and Linear Quadratic Regulator
by: Hashizume, Yota, et al.
Published: (2024)

User Response in Ad Auctions: An MDP Formulation of Long-Term Revenue Optimization
by: Cai, Yang, et al.
Published: (2023)

Multi-Task Vehicle Routing Solver via Mixture of Specialized Experts under State-Decomposable MDP
by: Pan, Yuxin, et al.
Published: (2025)

A Finite-Sample Analysis of an Actor-Critic Algorithm for Mean-Variance Optimization in a Discounted MDP
by: Sangadi, Tejaram, et al.
Published: (2024)

Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference
by: Eysenbach, Benjamin, et al.
Published: (2024)

Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space Inference
by: Kong, Deqian, et al.
Published: (2024)

A Factored MDP Approach To Moving Target Defense With Dynamic Threat Modeling and Cost Efficiency
by: Bose, Megha, et al.
Published: (2024)

Optimizing Predictive Maintenance in Intelligent Manufacturing: An Integrated FNO-DAE-GNN-PPO MDP Framework
by: Qiu, Shiqing
Published: (2025)

MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs
by: Sun, Hui, et al.
Published: (2025)

Online Policy Learning and Inference by Matrix Completion
by: Duan, Congyuan, et al.
Published: (2024)

Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies
by: DeWeese, Alex, et al.
Published: (2024)

Expected Free Energy-based Planning as Variational Inference
by: de Vries, Bert, et al.
Published: (2025)

Planning with a Learned Policy Basis to Optimally Solve Complex Tasks
by: Infante, Guillermo, et al.
Published: (2024)

Bayesian Inference of Contextual Bandit Policies via Empirical Likelihood
by: Ouyang, Jiangrong, et al.
Published: (2026)

Lever: Inference-Time Policy Reuse under Support Constraints
by: Vitenko, Ihor, et al.
Published: (2026)

Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
by: Liu, Weidong, et al.
Published: (2023)

Hierarchical Policy Blending as Inference for Reactive Robot Control
by: Hansel, Kay, et al.
Published: (2022)

Hitting Time Isomorphism for Multi-Stage Planning with Foundation Policies
by: Boock, Magnus Victor, et al.
Published: (2026)

Inference Time Policy Optimization for Offline RL with Differentiable World Models
by: Deb, Rohan, et al.
Published: (2026)

Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation
by: Gan, Feichen, et al.
Published: (2025)

Beating the Winner's Curse via Inference-Aware Policy Optimization
by: Bastani, Hamsa, et al.
Published: (2025)