Saved in:
| Main Author: | Tolpin, David |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.17375 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Efficient Incremental Belief Updates Using Weighted Virtual Observations
by: Tolpin, David
Published: (2024)
by: Tolpin, David
Published: (2024)
Fast Neural Inverse Kinematics on Human Body Motions
by: Tolpin, David, et al.
Published: (2025)
by: Tolpin, David, et al.
Published: (2025)
Neural Human Pose Prior
by: Heker, Michal, et al.
Published: (2025)
by: Heker, Michal, et al.
Published: (2025)
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
by: Mukherjee, Subhojyoti, et al.
Published: (2024)
by: Mukherjee, Subhojyoti, et al.
Published: (2024)
Geometric Re-Analysis of Classical MDP Solving Algorithms
by: Mustafin, Arsenii, et al.
Published: (2025)
by: Mustafin, Arsenii, et al.
Published: (2025)
MDP Geometry, Normalization and Reward Balancing Solvers
by: Mustafin, Arsenii, et al.
Published: (2024)
by: Mustafin, Arsenii, et al.
Published: (2024)
MDP modeling for multi-stage stochastic programs
by: Morton, David P., et al.
Published: (2025)
by: Morton, David P., et al.
Published: (2025)
Online MDP with Transition Prototypes: A Robust Adaptive Approach
by: Sun, Shuo, et al.
Published: (2024)
by: Sun, Shuo, et al.
Published: (2024)
None To Optima in Few Shots: Bayesian Optimization with MDP Priors
by: Li, Diantong, et al.
Published: (2025)
by: Li, Diantong, et al.
Published: (2025)
Will My Robot Achieve My Goals? Predicting the Probability that an MDP Policy Reaches a User-Specified Behavior Target
by: Guyer, Alexander, et al.
Published: (2022)
by: Guyer, Alexander, et al.
Published: (2022)
Federated Learning With Energy Harvesting Devices: An MDP Framework
by: Zhang, Kai, et al.
Published: (2024)
by: Zhang, Kai, et al.
Published: (2024)
Using Forwards-Backwards Models to Approximate MDP Homomorphisms
by: Mavor-Parker, Augustine N., et al.
Published: (2022)
by: Mavor-Parker, Augustine N., et al.
Published: (2022)
Track-MDP: Reinforcement Learning for Target Tracking with Controlled Sensing
by: Subramaniam, Adarsh M., et al.
Published: (2024)
by: Subramaniam, Adarsh M., et al.
Published: (2024)
Predictive Control and Regret Analysis of Non-Stationary MDP with Look-ahead Information
by: Zhang, Ziyi, et al.
Published: (2024)
by: Zhang, Ziyi, et al.
Published: (2024)
ICU-Sepsis: A Benchmark MDP Built from Real Medical Data
by: Choudhary, Kartik, et al.
Published: (2024)
by: Choudhary, Kartik, et al.
Published: (2024)
Learning a Fast Mixing Exogenous Block MDP using a Single Trajectory
by: Levine, Alexander, et al.
Published: (2024)
by: Levine, Alexander, et al.
Published: (2024)
Deep reinforcement learning for weakly coupled MDP's with continuous actions
by: Robledo, Francisco, et al.
Published: (2024)
by: Robledo, Francisco, et al.
Published: (2024)
MDP: Multidimensional Vision Model Pruning with Latency Constraint
by: Sun, Xinglong, et al.
Published: (2025)
by: Sun, Xinglong, et al.
Published: (2025)
A Minimax-MDP Framework with Future-imposed Conditions for Learning-augmented Problems
by: Chen, Xin, et al.
Published: (2025)
by: Chen, Xin, et al.
Published: (2025)
Tsallis Entropy Regularization for Linearly Solvable MDP and Linear Quadratic Regulator
by: Hashizume, Yota, et al.
Published: (2024)
by: Hashizume, Yota, et al.
Published: (2024)
User Response in Ad Auctions: An MDP Formulation of Long-Term Revenue Optimization
by: Cai, Yang, et al.
Published: (2023)
by: Cai, Yang, et al.
Published: (2023)
Multi-Task Vehicle Routing Solver via Mixture of Specialized Experts under State-Decomposable MDP
by: Pan, Yuxin, et al.
Published: (2025)
by: Pan, Yuxin, et al.
Published: (2025)
A Finite-Sample Analysis of an Actor-Critic Algorithm for Mean-Variance Optimization in a Discounted MDP
by: Sangadi, Tejaram, et al.
Published: (2024)
by: Sangadi, Tejaram, et al.
Published: (2024)
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference
by: Eysenbach, Benjamin, et al.
Published: (2024)
by: Eysenbach, Benjamin, et al.
Published: (2024)
Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space Inference
by: Kong, Deqian, et al.
Published: (2024)
by: Kong, Deqian, et al.
Published: (2024)
A Factored MDP Approach To Moving Target Defense With Dynamic Threat Modeling and Cost Efficiency
by: Bose, Megha, et al.
Published: (2024)
by: Bose, Megha, et al.
Published: (2024)
Optimizing Predictive Maintenance in Intelligent Manufacturing: An Integrated FNO-DAE-GNN-PPO MDP Framework
by: Qiu, Shiqing
Published: (2025)
by: Qiu, Shiqing
Published: (2025)
MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs
by: Sun, Hui, et al.
Published: (2025)
by: Sun, Hui, et al.
Published: (2025)
Online Policy Learning and Inference by Matrix Completion
by: Duan, Congyuan, et al.
Published: (2024)
by: Duan, Congyuan, et al.
Published: (2024)
Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies
by: DeWeese, Alex, et al.
Published: (2024)
by: DeWeese, Alex, et al.
Published: (2024)
Expected Free Energy-based Planning as Variational Inference
by: de Vries, Bert, et al.
Published: (2025)
by: de Vries, Bert, et al.
Published: (2025)
Planning with a Learned Policy Basis to Optimally Solve Complex Tasks
by: Infante, Guillermo, et al.
Published: (2024)
by: Infante, Guillermo, et al.
Published: (2024)
Bayesian Inference of Contextual Bandit Policies via Empirical Likelihood
by: Ouyang, Jiangrong, et al.
Published: (2026)
by: Ouyang, Jiangrong, et al.
Published: (2026)
Lever: Inference-Time Policy Reuse under Support Constraints
by: Vitenko, Ihor, et al.
Published: (2026)
by: Vitenko, Ihor, et al.
Published: (2026)
Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning
by: Liu, Weidong, et al.
Published: (2023)
by: Liu, Weidong, et al.
Published: (2023)
Hierarchical Policy Blending as Inference for Reactive Robot Control
by: Hansel, Kay, et al.
Published: (2022)
by: Hansel, Kay, et al.
Published: (2022)
Hitting Time Isomorphism for Multi-Stage Planning with Foundation Policies
by: Boock, Magnus Victor, et al.
Published: (2026)
by: Boock, Magnus Victor, et al.
Published: (2026)
Inference Time Policy Optimization for Offline RL with Differentiable World Models
by: Deb, Rohan, et al.
Published: (2026)
by: Deb, Rohan, et al.
Published: (2026)
Conformal Prediction Beyond the Horizon: Distribution-Free Inference for Policy Evaluation
by: Gan, Feichen, et al.
Published: (2025)
by: Gan, Feichen, et al.
Published: (2025)
Beating the Winner's Curse via Inference-Aware Policy Optimization
by: Bastani, Hamsa, et al.
Published: (2025)
by: Bastani, Hamsa, et al.
Published: (2025)
Similar Items
-
Efficient Incremental Belief Updates Using Weighted Virtual Observations
by: Tolpin, David
Published: (2024) -
Fast Neural Inverse Kinematics on Human Body Motions
by: Tolpin, David, et al.
Published: (2025) -
Neural Human Pose Prior
by: Heker, Michal, et al.
Published: (2025) -
SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP
by: Mukherjee, Subhojyoti, et al.
Published: (2024) -
Geometric Re-Analysis of Classical MDP Solving Algorithms
by: Mustafin, Arsenii, et al.
Published: (2025)