Enregistré dans:
| Auteurs principaux: | Guo, Xin, Lyu, Zijiu |
|---|---|
| Format: | Preprint |
| Publié: |
2025
|
| Sujets: | |
| Accès en ligne: | https://arxiv.org/abs/2510.15165 |
| Tags: |
Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
|
Documents similaires
Deterministic Policy Gradient for Reinforcement Learning with Continuous Time and State
par: Cheng, Ziheng, et autres
Publié: (2025)
par: Cheng, Ziheng, et autres
Publié: (2025)
Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning
par: Qiu, Shuang, et autres
Publié: (2024)
par: Qiu, Shuang, et autres
Publié: (2024)
Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement Learning
par: Zhang, Chenyu, et autres
Publié: (2024)
par: Zhang, Chenyu, et autres
Publié: (2024)
Operator Models for Continuous-Time Offline Reinforcement Learning
par: Hoischen, Nicolas, et autres
Publié: (2025)
par: Hoischen, Nicolas, et autres
Publié: (2025)
Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning
par: Wiltzer, Harley, et autres
Publié: (2024)
par: Wiltzer, Harley, et autres
Publié: (2024)
Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control
par: Hua, Chengxiu, et autres
Publié: (2025)
par: Hua, Chengxiu, et autres
Publié: (2025)
A Minibatch-SGD-Based Learning Meta-Policy for Inventory Systems with Myopic Optimal Policy
par: Lyu, Jiameng, et autres
Publié: (2024)
par: Lyu, Jiameng, et autres
Publié: (2024)
Fast Policy Learning for Linear Quadratic Control with Entropy Regularization
par: Guo, Xin, et autres
Publié: (2023)
par: Guo, Xin, et autres
Publié: (2023)
Mean-Field Games with Constraints
par: Hu, Anran, et autres
Publié: (2025)
par: Hu, Anran, et autres
Publié: (2025)
Learning to Solve Optimization Problems Constrained with Partial Differential Equations
par: Guven, Yusuf, et autres
Publié: (2025)
par: Guven, Yusuf, et autres
Publié: (2025)
Control and optimization for Neural Partial Differential Equations in Supervised Learning
par: Bensoussan, Alain, et autres
Publié: (2025)
par: Bensoussan, Alain, et autres
Publié: (2025)
Accuracy of Discretely Sampled Stochastic Policies in Continuous-time Reinforcement Learning
par: Jia, Yanwei, et autres
Publié: (2025)
par: Jia, Yanwei, et autres
Publié: (2025)
Achieve Performatively Optimal Policy for Performative Reinforcement Learning
par: Chen, Ziyi, et autres
Publié: (2025)
par: Chen, Ziyi, et autres
Publié: (2025)
A Differential and Pointwise Control Approach to Reinforcement Learning
par: Nguyen, Minh, et autres
Publié: (2024)
par: Nguyen, Minh, et autres
Publié: (2024)
Deep Reinforcement Learning: A Convex Optimization Approach
par: Gattami, Ather
Publié: (2024)
par: Gattami, Ather
Publié: (2024)
Non-Parametric Learning of Stochastic Differential Equations with Non-asymptotic Fast Rates of Convergence
par: Bonalli, Riccardo, et autres
Publié: (2023)
par: Bonalli, Riccardo, et autres
Publié: (2023)
A Hessian-Aware Stochastic Differential Equation for Modelling SGD
par: Li, Xiang, et autres
Publié: (2024)
par: Li, Xiang, et autres
Publié: (2024)
Control Theoretic Approach to Fine-Tuning and Transfer Learning
par: Bayram, Erkan, et autres
Publié: (2024)
par: Bayram, Erkan, et autres
Publié: (2024)
Single- vs. Dual-Policy Reinforcement Learning for Dynamic Bike Rebalancing
par: Liang, Jiaqi, et autres
Publié: (2024)
par: Liang, Jiaqi, et autres
Publié: (2024)
A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies
par: Nanda, Phalguni, et autres
Publié: (2025)
par: Nanda, Phalguni, et autres
Publié: (2025)
A Graph-Partitioning Based Continuous Optimization Approach to Semi-supervised Clustering Problems
par: Liu, Wei, et autres
Publié: (2025)
par: Liu, Wei, et autres
Publié: (2025)
Learning When to Restart: Nonstationary Newsvendor from Uncensored to Censored Demand
par: Chen, Xin, et autres
Publié: (2025)
par: Chen, Xin, et autres
Publié: (2025)
Learning Dynamic Mechanisms in Unknown Environments: A Reinforcement Learning Approach
par: Qiu, Shuang, et autres
Publié: (2022)
par: Qiu, Shuang, et autres
Publié: (2022)
Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning
par: Li, Zihao, et autres
Publié: (2024)
par: Li, Zihao, et autres
Publié: (2024)
Linear-Quadratic Mean-Field Reinforcement Learning: Convergence of Policy Gradient Methods
par: Carmona, René, et autres
Publié: (2019)
par: Carmona, René, et autres
Publié: (2019)
TreeDQN: Sample-Efficient Off-Policy Reinforcement Learning for Combinatorial Optimization
par: Sorokin, D., et autres
Publié: (2023)
par: Sorokin, D., et autres
Publié: (2023)
Sublinear Regret for a Class of Continuous-Time Linear-Quadratic Reinforcement Learning Problems
par: Huang, Yilie, et autres
Publié: (2024)
par: Huang, Yilie, et autres
Publié: (2024)
Signature Approach for Contextual Bandits with Nonlinear and Path-dependent Rewards
par: Guo, Xin, et autres
Publié: (2026)
par: Guo, Xin, et autres
Publié: (2026)
UVIP: Model-Free Approach to Evaluate Reinforcement Learning Algorithms
par: Belomestny, Denis, et autres
Publié: (2021)
par: Belomestny, Denis, et autres
Publié: (2021)
Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning
par: Zeng, Sihan, et autres
Publié: (2024)
par: Zeng, Sihan, et autres
Publié: (2024)
Control, Optimal Transport and Neural Differential Equations in Supervised Learning
par: Phung, Minh-Nhat, et autres
Publié: (2025)
par: Phung, Minh-Nhat, et autres
Publié: (2025)
Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions
par: Long, Kehan, et autres
Publié: (2025)
par: Long, Kehan, et autres
Publié: (2025)
Reinforcement Learning with Random Time Horizons
par: Borrell, Enric Ribera, et autres
Publié: (2025)
par: Borrell, Enric Ribera, et autres
Publié: (2025)
Deep Reinforcement Learning for Infinite Horizon Mean Field Problems in Continuous Spaces
par: Angiuli, Andrea, et autres
Publié: (2023)
par: Angiuli, Andrea, et autres
Publié: (2023)
Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control
par: Tabas, Sadegh Sadeghi, et autres
Publié: (2024)
par: Tabas, Sadegh Sadeghi, et autres
Publié: (2024)
Sequential Bayesian Optimal Experimental Design in Infinite Dimensions via Policy Gradient Reinforcement Learning
par: Shen, Kaichen, et autres
Publié: (2026)
par: Shen, Kaichen, et autres
Publié: (2026)
Reinforcement Learning Approaches for the Orienteering Problem with Stochastic and Dynamic Release Dates
par: Li, Yuanyuan, et autres
Publié: (2022)
par: Li, Yuanyuan, et autres
Publié: (2022)
Continuous-Time Reinforcement Learning for Asset-Liability Management
par: Huang, Yilie
Publié: (2025)
par: Huang, Yilie
Publié: (2025)
Data-Driven Exploration for a Class of Continuous-Time Indefinite Linear--Quadratic Reinforcement Learning Problems
par: Huang, Yilie, et autres
Publié: (2025)
par: Huang, Yilie, et autres
Publié: (2025)
A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning
par: Zeng, Sihan, et autres
Publié: (2021)
par: Zeng, Sihan, et autres
Publié: (2021)
Documents similaires
-
Deterministic Policy Gradient for Reinforcement Learning with Continuous Time and State
par: Cheng, Ziheng, et autres
Publié: (2025) -
Traversing Pareto Optimal Policies: Provably Efficient Multi-Objective Reinforcement Learning
par: Qiu, Shuang, et autres
Publié: (2024) -
Finite-Time Analysis of On-Policy Heterogeneous Federated Reinforcement Learning
par: Zhang, Chenyu, et autres
Publié: (2024) -
Operator Models for Continuous-Time Offline Reinforcement Learning
par: Hoischen, Nicolas, et autres
Publié: (2025) -
Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning
par: Wiltzer, Harley, et autres
Publié: (2024)