Enregistré dans:
| Auteurs principaux: | Hua, Chengxiu, Gu, Jiawen, Tang, Yushun |
|---|---|
| Format: | Preprint |
| Publié: |
2025
|
| Sujets: | |
| Accès en ligne: | https://arxiv.org/abs/2510.17122 |
| Tags: |
Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
|
Documents similaires
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
par: Zhao, Hanyang, et autres
Publié: (2025)
par: Zhao, Hanyang, et autres
Publié: (2025)
Operator Models for Continuous-Time Offline Reinforcement Learning
par: Hoischen, Nicolas, et autres
Publié: (2025)
par: Hoischen, Nicolas, et autres
Publié: (2025)
Risk-Sensitive Q-Learning in Continuous Time with Application to Dynamic Portfolio Selection
par: Xie, Chuhan
Publié: (2025)
par: Xie, Chuhan
Publié: (2025)
Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning
par: Wiltzer, Harley, et autres
Publié: (2024)
par: Wiltzer, Harley, et autres
Publié: (2024)
Score Matching Diffusion Based Feedback Control and Planning of Nonlinear Systems
par: Elamvazhuthi, Karthik, et autres
Publié: (2025)
par: Elamvazhuthi, Karthik, et autres
Publié: (2025)
Policy Transfer for Continuous-Time Reinforcement Learning: A (Rough) Differential Equation Approach
par: Guo, Xin, et autres
Publié: (2025)
par: Guo, Xin, et autres
Publié: (2025)
Q-Measure-Learning for Continuous State RL: Efficient Implementation and Convergence
par: Wang, Shengbo
Publié: (2026)
par: Wang, Shengbo
Publié: (2026)
Deterministic Policy Gradient for Reinforcement Learning with Continuous Time and State
par: Cheng, Ziheng, et autres
Publié: (2025)
par: Cheng, Ziheng, et autres
Publié: (2025)
Unified Convergence Analysis for Score-Based Diffusion Models with Deterministic Samplers
par: Li, Runjia, et autres
Publié: (2024)
par: Li, Runjia, et autres
Publié: (2024)
Sublinear Regret for a Class of Continuous-Time Linear-Quadratic Reinforcement Learning Problems
par: Huang, Yilie, et autres
Publié: (2024)
par: Huang, Yilie, et autres
Publié: (2024)
Continuous-Time Reinforcement Learning for Asset-Liability Management
par: Huang, Yilie
Publié: (2025)
par: Huang, Yilie
Publié: (2025)
Convergence and stability of Q-learning in Hierarchical Reinforcement Learning
par: Manenti, Massimiliano, et autres
Publié: (2025)
par: Manenti, Massimiliano, et autres
Publié: (2025)
Deep Reinforcement Learning for Infinite Horizon Mean Field Problems in Continuous Spaces
par: Angiuli, Andrea, et autres
Publié: (2023)
par: Angiuli, Andrea, et autres
Publié: (2023)
Continuous-time q-Learning for Jump-Diffusion Models under Tsallis Entropy
par: Bo, Lijun, et autres
Publié: (2024)
par: Bo, Lijun, et autres
Publié: (2024)
Decoupled Continuous-Time Reinforcement Learning via Hamiltonian Flow
par: Nguyen, Minh
Publié: (2026)
par: Nguyen, Minh
Publié: (2026)
Data-Driven Exploration for a Class of Continuous-Time Indefinite Linear--Quadratic Reinforcement Learning Problems
par: Huang, Yilie, et autres
Publié: (2025)
par: Huang, Yilie, et autres
Publié: (2025)
Analysis of Multiscale Reinforcement Q-Learning Algorithms for Mean Field Control Games
par: Angiuli, Andrea, et autres
Publié: (2024)
par: Angiuli, Andrea, et autres
Publié: (2024)
Reinforcement Learning for a Discrete-Time Linear-Quadratic Control Problem with an Application
par: Li, Lucky
Publié: (2024)
par: Li, Lucky
Publié: (2024)
Towards Quantifying the Hessian Structure of Neural Networks
par: Dong, Zhaorui, et autres
Publié: (2025)
par: Dong, Zhaorui, et autres
Publié: (2025)
ART for Diffusion Sampling: A Reinforcement Learning Approach to Timestep Schedule
par: Huang, Yilie, et autres
Publié: (2026)
par: Huang, Yilie, et autres
Publié: (2026)
A Two-Time-Scale Stochastic Optimization Framework with Applications in Control and Reinforcement Learning
par: Zeng, Sihan, et autres
Publié: (2021)
par: Zeng, Sihan, et autres
Publié: (2021)
Accuracy of Discretely Sampled Stochastic Policies in Continuous-time Reinforcement Learning
par: Jia, Yanwei, et autres
Publié: (2025)
par: Jia, Yanwei, et autres
Publié: (2025)
Generalized Continuous-Time Models for Nesterov's Accelerated Gradient Methods
par: Park, Chanwoong, et autres
Publié: (2024)
par: Park, Chanwoong, et autres
Publié: (2024)
On the Interpolation Effect of Score Smoothing in Diffusion Models
par: Chen, Zhengdao
Publié: (2025)
par: Chen, Zhengdao
Publié: (2025)
A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies
par: Nanda, Phalguni, et autres
Publié: (2025)
par: Nanda, Phalguni, et autres
Publié: (2025)
Adjoint Matching: Fine-tuning Flow and Diffusion Generative Models with Memoryless Stochastic Optimal Control
par: Domingo-Enrich, Carles, et autres
Publié: (2024)
par: Domingo-Enrich, Carles, et autres
Publié: (2024)
Set Invariance with Probability One for Controlled Diffusion: Score-based Approach
par: Wang, Wenqing, et autres
Publié: (2025)
par: Wang, Wenqing, et autres
Publié: (2025)
Weight-Parameterization in Continuous Time Deep Neural Networks for Surrogate Modeling
par: Rosso, Haley, et autres
Publié: (2025)
par: Rosso, Haley, et autres
Publié: (2025)
A General Continuous-Time Formulation of Stochastic ADMM and Its Variants
par: Li, Chris Junchi
Publié: (2024)
par: Li, Chris Junchi
Publié: (2024)
Global Solutions to Master Equations for Continuous Time Heterogeneous Agent Macroeconomic Models
par: Gu, Zhouzhou, et autres
Publié: (2024)
par: Gu, Zhouzhou, et autres
Publié: (2024)
Mitigating Forgetting in Continual Learning with Selective Gradient Projection
par: Singh, Anika, et autres
Publié: (2026)
par: Singh, Anika, et autres
Publié: (2026)
Reinforcement Learning and Regret Bounds for Admission Control
par: Weber, Lucas, et autres
Publié: (2024)
par: Weber, Lucas, et autres
Publié: (2024)
From Score Matching to Diffusion: A Fine-Grained Error Analysis in the Gaussian Setting
par: Hurault, Samuel, et autres
Publié: (2025)
par: Hurault, Samuel, et autres
Publié: (2025)
Leveraging Continuous Time to Understand Momentum When Training Diagonal Linear Networks
par: Papazov, Hristo, et autres
Publié: (2024)
par: Papazov, Hristo, et autres
Publié: (2024)
Dynamic Controlled Variables Based Dynamic Self-Optimizing Control
par: Zhou, Chenchen, et autres
Publié: (2026)
par: Zhou, Chenchen, et autres
Publié: (2026)
Last Iterate Convergence of Incremental Methods and Applications in Continual Learning
par: Cai, Xufeng, et autres
Publié: (2024)
par: Cai, Xufeng, et autres
Publié: (2024)
Pessimistic Nonlinear Least-Squares Value Iteration for Offline Reinforcement Learning
par: Di, Qiwei, et autres
Publié: (2023)
par: Di, Qiwei, et autres
Publié: (2023)
Convergence of Actor-Critic Learning for Mean Field Games and Mean Field Control in Continuous Spaces
par: Fouque, Jean-Pierre, et autres
Publié: (2025)
par: Fouque, Jean-Pierre, et autres
Publié: (2025)
A Nearly Optimal and Low-Switching Algorithm for Reinforcement Learning with General Function Approximation
par: Zhao, Heyang, et autres
Publié: (2023)
par: Zhao, Heyang, et autres
Publié: (2023)
Adam Converges Without Any Modification On Update Rules
par: Zhang, Yushun, et autres
Publié: (2026)
par: Zhang, Yushun, et autres
Publié: (2026)
Documents similaires
-
Score as Action: Fine-Tuning Diffusion Generative Models by Continuous-time Reinforcement Learning
par: Zhao, Hanyang, et autres
Publié: (2025) -
Operator Models for Continuous-Time Offline Reinforcement Learning
par: Hoischen, Nicolas, et autres
Publié: (2025) -
Risk-Sensitive Q-Learning in Continuous Time with Application to Dynamic Portfolio Selection
par: Xie, Chuhan
Publié: (2025) -
Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning
par: Wiltzer, Harley, et autres
Publié: (2024) -
Score Matching Diffusion Based Feedback Control and Planning of Nonlinear Systems
par: Elamvazhuthi, Karthik, et autres
Publié: (2025)