Saved in:
| Main Authors: | Tao, Zhenyu, Xu, Wei, You, Xiaohu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.17265 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Generalized Bisimulation Metric of State Similarity between Markov Decision Processes: From Theoretical Propositions to Applications
by: Tao, Zhenyu, et al.
Published: (2025)
by: Tao, Zhenyu, et al.
Published: (2025)
Provable Performance Bounds for Digital Twin-driven Deep Reinforcement Learning in Wireless Networks: A Novel Digital-Twin Bisimulation Metric
by: Tao, Zhenyu, et al.
Published: (2025)
by: Tao, Zhenyu, et al.
Published: (2025)
Large Vision Model-Enhanced Digital Twin with Deep Reinforcement Learning for User Association and Load Balancing in Dynamic Wireless Networks
by: Tao, Zhenyu, et al.
Published: (2024)
by: Tao, Zhenyu, et al.
Published: (2024)
Digital Twin Assisted Deep Reinforcement Learning for Online Admission Control in Sliced Network
by: Tao, Zhenyu, et al.
Published: (2023)
by: Tao, Zhenyu, et al.
Published: (2023)
Wireless Network Digital Twin for 6G: Generative AI as A Key Enabler
by: Tao, Zhenyu, et al.
Published: (2023)
by: Tao, Zhenyu, et al.
Published: (2023)
Generalized Linear Markov Decision Process
by: Zhang, Sinian, et al.
Published: (2025)
by: Zhang, Sinian, et al.
Published: (2025)
Monitored Markov Decision Processes
by: Parisi, Simone, et al.
Published: (2024)
by: Parisi, Simone, et al.
Published: (2024)
Topology-Aware State Abstraction with Tangle Cores for Markov Decision Processes
by: Shihab, Ibne Farabi, et al.
Published: (2026)
by: Shihab, Ibne Farabi, et al.
Published: (2026)
Learn A Flexible Exploration Model for Parameterized Action Markov Decision Processes
by: Wang, Zijian, et al.
Published: (2025)
by: Wang, Zijian, et al.
Published: (2025)
Policy Gradient for Robust Markov Decision Processes
by: Wang, Qiuhao, et al.
Published: (2024)
by: Wang, Qiuhao, et al.
Published: (2024)
Federated Control in Markov Decision Processes
by: Jin, Hao, et al.
Published: (2024)
by: Jin, Hao, et al.
Published: (2024)
A Cantor-Kantorovich Metric Between Markov Decision Processes with Application to Transfer Learning
by: Banse, Adrien, et al.
Published: (2024)
by: Banse, Adrien, et al.
Published: (2024)
Linear Mixture Distributionally Robust Markov Decision Processes
by: Liu, Zhishuai, et al.
Published: (2025)
by: Liu, Zhishuai, et al.
Published: (2025)
Homomorphic Mappings for Value-Preserving State Aggregation in Markov Decision Processes
by: Zhao, Shuo, et al.
Published: (2025)
by: Zhao, Shuo, et al.
Published: (2025)
Theoretical Analysis of Engression and Reverse Markov Engression
by: Huang, Jiaqi, et al.
Published: (2026)
by: Huang, Jiaqi, et al.
Published: (2026)
Learning in Markov Decision Processes with Exogenous Dynamics
by: Maran, Davide, et al.
Published: (2026)
by: Maran, Davide, et al.
Published: (2026)
Optimal Decision Tree Policies for Markov Decision Processes
by: Vos, Daniël, et al.
Published: (2023)
by: Vos, Daniël, et al.
Published: (2023)
Policy Testing in Markov Decision Processes
by: Ariu, Kaito, et al.
Published: (2025)
by: Ariu, Kaito, et al.
Published: (2025)
Quantum Speedups in Regret Analysis of Infinite Horizon Average-Reward Markov Decision Processes
by: Ganguly, Bhargav, et al.
Published: (2023)
by: Ganguly, Bhargav, et al.
Published: (2023)
Markov Decision Processes under External Temporal Processes
by: Ayyagari, Ranga Shaarad, et al.
Published: (2023)
by: Ayyagari, Ranga Shaarad, et al.
Published: (2023)
A Markov Decision Process for Variable Selection in Branch & Bound
by: Strang, Paul, et al.
Published: (2025)
by: Strang, Paul, et al.
Published: (2025)
The regret lower bound for communicating Markov Decision Processes
by: Boone, Victor, et al.
Published: (2025)
by: Boone, Victor, et al.
Published: (2025)
An Orthogonal Learner for Individualized Outcomes in Markov Decision Processes
by: Javurek, Emil, et al.
Published: (2025)
by: Javurek, Emil, et al.
Published: (2025)
Initial Distribution Sensitivity of Constrained Markov Decision Processes
by: Tercan, Alperen, et al.
Published: (2025)
by: Tercan, Alperen, et al.
Published: (2025)
Improving Controller Generalization with Dimensionless Markov Decision Processes
by: Charvet, Valentin, et al.
Published: (2025)
by: Charvet, Valentin, et al.
Published: (2025)
Model-Based Exploration in Monitored Markov Decision Processes
by: Kazemipour, Alireza, et al.
Published: (2025)
by: Kazemipour, Alireza, et al.
Published: (2025)
Horizon-Free Regret for Linear Markov Decision Processes
by: Zhang, Zihan, et al.
Published: (2024)
by: Zhang, Zihan, et al.
Published: (2024)
Learning Utilities from Demonstrations in Markov Decision Processes
by: Lazzati, Filippo, et al.
Published: (2024)
by: Lazzati, Filippo, et al.
Published: (2024)
Achieving Constant Regret in Linear Markov Decision Processes
by: Zhang, Weitong, et al.
Published: (2024)
by: Zhang, Weitong, et al.
Published: (2024)
Concentration of Cumulative Reward in Markov Decision Processes
by: Sayedana, Borna, et al.
Published: (2024)
by: Sayedana, Borna, et al.
Published: (2024)
Corruption-Robust Algorithms with Uncertainty Weighting for Nonlinear Contextual Bandits and Markov Decision Processes
by: Ye, Chenlu, et al.
Published: (2022)
by: Ye, Chenlu, et al.
Published: (2022)
Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space
by: Adler, Saghar, et al.
Published: (2023)
by: Adler, Saghar, et al.
Published: (2023)
Risk-sensitive Markov Decision Process and Learning under General Utility Functions
by: Wu, Zhengqi, et al.
Published: (2023)
by: Wu, Zhengqi, et al.
Published: (2023)
No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes
by: Bayrooti, Jasmine, et al.
Published: (2025)
by: Bayrooti, Jasmine, et al.
Published: (2025)
Transition Transfer $Q$-Learning for Composite Markov Decision Processes
by: Chai, Jinhang, et al.
Published: (2025)
by: Chai, Jinhang, et al.
Published: (2025)
Logarithmic Regret of Exploration in Average Reward Markov Decision Processes
by: Boone, Victor, et al.
Published: (2025)
by: Boone, Victor, et al.
Published: (2025)
Best-of-Both-Worlds for Heavy-Tailed Markov Decision Processes
by: Chen, Yu, et al.
Published: (2026)
by: Chen, Yu, et al.
Published: (2026)
Performance Improvement Bounds for Lipschitz Configurable Markov Decision Processes
by: Metelli, Alberto Maria
Published: (2024)
by: Metelli, Alberto Maria
Published: (2024)
Transition Constrained Bayesian Optimization via Markov Decision Processes
by: Folch, Jose Pablo, et al.
Published: (2024)
by: Folch, Jose Pablo, et al.
Published: (2024)
Learning Markov Decision Processes under Fully Bandit Feedback
by: Zhuo, Zhengjia, et al.
Published: (2026)
by: Zhuo, Zhengjia, et al.
Published: (2026)
Similar Items
-
A Generalized Bisimulation Metric of State Similarity between Markov Decision Processes: From Theoretical Propositions to Applications
by: Tao, Zhenyu, et al.
Published: (2025) -
Provable Performance Bounds for Digital Twin-driven Deep Reinforcement Learning in Wireless Networks: A Novel Digital-Twin Bisimulation Metric
by: Tao, Zhenyu, et al.
Published: (2025) -
Large Vision Model-Enhanced Digital Twin with Deep Reinforcement Learning for User Association and Load Balancing in Dynamic Wireless Networks
by: Tao, Zhenyu, et al.
Published: (2024) -
Digital Twin Assisted Deep Reinforcement Learning for Online Admission Control in Sliced Network
by: Tao, Zhenyu, et al.
Published: (2023) -
Wireless Network Digital Twin for 6G: Generative AI as A Key Enabler
by: Tao, Zhenyu, et al.
Published: (2023)