Saved in:
| Main Authors: | Xu, Wenhan, Jiang, Jiashuo, Deng, Lei, Tsang, Danny Hin-Kwok |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.04291 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Real-Time Network Traffic Forecasting with Missing Data: A Generative Model Approach
by: Deng, Lei, et al.
Published: (2025)
by: Deng, Lei, et al.
Published: (2025)
Adaptive Resolving Methods for Reinforcement Learning with Function Approximations
by: Jiang, Jiashuo, et al.
Published: (2025)
by: Jiang, Jiashuo, et al.
Published: (2025)
Constrained Online Two-stage Stochastic Optimization: Near Optimal Algorithms via Adversarial Learning
by: Jiang, Jiashuo
Published: (2023)
by: Jiang, Jiashuo
Published: (2023)
Off Policy Lyapunov Stability in Reinforcement Learning
by: Gill, Sarvan, et al.
Published: (2025)
by: Gill, Sarvan, et al.
Published: (2025)
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
by: Shen, Han, et al.
Published: (2024)
by: Shen, Han, et al.
Published: (2024)
Finite-Time Minimax Bounds and an Optimal Lyapunov Policy in Queueing Control
by: Liu, Yujie, et al.
Published: (2025)
by: Liu, Yujie, et al.
Published: (2025)
Stability Enhancement in Reinforcement Learning via Adaptive Control Lyapunov Function
by: Chen, Donghe, et al.
Published: (2025)
by: Chen, Donghe, et al.
Published: (2025)
Solving Richly Constrained Reinforcement Learning through State Augmentation and Reward Penalties
by: Jiang, Hao, et al.
Published: (2023)
by: Jiang, Hao, et al.
Published: (2023)
One-Step Sampler for Boltzmann Distributions via Drifting
by: Cao, Wenhan, et al.
Published: (2026)
by: Cao, Wenhan, et al.
Published: (2026)
Adaptive KDE for Real-Time Thresholding: Prioritized Queues for Financial Crime Investigation
by: Butvinik, Danny, et al.
Published: (2026)
by: Butvinik, Danny, et al.
Published: (2026)
Queue-based Eco-Driving at Roundabouts with Reinforcement Learning
by: Schlamp, Anna-Lena, et al.
Published: (2024)
by: Schlamp, Anna-Lena, et al.
Published: (2024)
Enhanced Penalty-based Bidirectional Reinforcement Learning Algorithms
by: Pula, Sai Gana Sandeep, et al.
Published: (2025)
by: Pula, Sai Gana Sandeep, et al.
Published: (2025)
Lyapunov Stability Learning with Nonlinear Control via Inductive Biases
by: Lu, Yupu, et al.
Published: (2025)
by: Lu, Yupu, et al.
Published: (2025)
Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions
by: Long, Kehan, et al.
Published: (2025)
by: Long, Kehan, et al.
Published: (2025)
Lyapunov-Stable Adaptive Control for Multimodal Concept Drift
by: Pan, Tianyu Bell, et al.
Published: (2025)
by: Pan, Tianyu Bell, et al.
Published: (2025)
Stable-Drift: A Patient-Aware Latent Drift Replay Method for Stabilizing Representations in Continual Learning
by: Theofilou, Paraskevi-Antonia, et al.
Published: (2025)
by: Theofilou, Paraskevi-Antonia, et al.
Published: (2025)
Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning
by: Xu, Yuanda, et al.
Published: (2026)
by: Xu, Yuanda, et al.
Published: (2026)
Lend a Hand: Semi Training-Free Cued Speech Recognition via MLLM-Driven Hand Modeling for Barrier-free Communication
by: Huang, Guanjie, et al.
Published: (2025)
by: Huang, Guanjie, et al.
Published: (2025)
Efficient Reinforcement Learning for Routing Jobs in Heterogeneous Queueing Systems
by: Jali, Neharika, et al.
Published: (2024)
by: Jali, Neharika, et al.
Published: (2024)
Impact of Computation in Integral Reinforcement Learning for Continuous-Time Control
by: Cao, Wenhan, et al.
Published: (2024)
by: Cao, Wenhan, et al.
Published: (2024)
Online Semi-infinite Linear Programming: Efficient Algorithms via Function Approximation
by: Zong, Yiming, et al.
Published: (2026)
by: Zong, Yiming, et al.
Published: (2026)
Stochastic Penalty-Barrier Methods for Constrained Machine Learning
by: Bosák, Adam, et al.
Published: (2026)
by: Bosák, Adam, et al.
Published: (2026)
FedIN: Federated Intermediate Layers Learning for Model Heterogeneity
by: Chan, Yun-Hin, et al.
Published: (2023)
by: Chan, Yun-Hin, et al.
Published: (2023)
Learning-Based Pricing and Matching for Two-Sided Queues
by: Yang, Zixian, et al.
Published: (2024)
by: Yang, Zixian, et al.
Published: (2024)
Reinforcement Learning with Intrinsically Motivated Feedback Graph for Lost-sales Inventory Control
by: Liu, Zifan, et al.
Published: (2024)
by: Liu, Zifan, et al.
Published: (2024)
Reinforcement Learning in Queue-Reactive Models: Application to Optimal Execution
by: Espana, Tomas, et al.
Published: (2025)
by: Espana, Tomas, et al.
Published: (2025)
Learning to Price with Resource Constraints: From Full Information to Machine-Learned Prices
by: Ao, Ruicheng, et al.
Published: (2025)
by: Ao, Ruicheng, et al.
Published: (2025)
Stabilizing MoE Reinforcement Learning by Aligning Training and Inference Routers
by: Ma, Wenhan, et al.
Published: (2025)
by: Ma, Wenhan, et al.
Published: (2025)
Achieving Instance-dependent Sample Complexity for Constrained Markov Decision Process
by: Jiang, Jiashuo, et al.
Published: (2024)
by: Jiang, Jiashuo, et al.
Published: (2024)
FedPEAT: Convergence of Federated Learning, Parameter-Efficient Fine Tuning, and Emulator Assisted Tuning for Artificial Intelligence Foundation Models with Mobile Edge Computing
by: Chua, Terence Jie, et al.
Published: (2023)
by: Chua, Terence Jie, et al.
Published: (2023)
Reinforcement Learning Based Traffic Signal Design to Minimize Queue Lengths
by: Nandakumar, Anirud, et al.
Published: (2025)
by: Nandakumar, Anirud, et al.
Published: (2025)
Accelerated Gradient Methods for Sparse Statistical Learning with Nonconvex Penalties
by: Yang, Kai, et al.
Published: (2020)
by: Yang, Kai, et al.
Published: (2020)
Restless Bandits with Individual Penalty Constraints: Near-Optimal Indices and Deep Reinforcement Learning
by: Zamir, Nida, et al.
Published: (2026)
by: Zamir, Nida, et al.
Published: (2026)
Decomposing Control Lyapunov Functions for Efficient Reinforcement Learning
by: Lopez, Antonio, et al.
Published: (2024)
by: Lopez, Antonio, et al.
Published: (2024)
Admission Control of Quasi-Reversible Queueing Systems: Optimization and Reinforcement Learning
by: Comte, Céline, et al.
Published: (2025)
by: Comte, Céline, et al.
Published: (2025)
Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach
by: Chen, Xuyang, et al.
Published: (2025)
by: Chen, Xuyang, et al.
Published: (2025)
Understanding and Improving Model Averaging in Federated Learning on Heterogeneous Data
by: Zhou, Tailin, et al.
Published: (2023)
by: Zhou, Tailin, et al.
Published: (2023)
Degeneracy is OK: Logarithmic Regret for Network Revenue Management with Indiscrete Distributions
by: Jiang, Jiashuo, et al.
Published: (2022)
by: Jiang, Jiashuo, et al.
Published: (2022)
On Penalty-based Bilevel Gradient Descent Method
by: Shen, Han, et al.
Published: (2023)
by: Shen, Han, et al.
Published: (2023)
Just Enough Thinking: Efficient Reasoning with Adaptive Length Penalties Reinforcement Learning
by: Xiang, Violet, et al.
Published: (2025)
by: Xiang, Violet, et al.
Published: (2025)
Similar Items
-
Real-Time Network Traffic Forecasting with Missing Data: A Generative Model Approach
by: Deng, Lei, et al.
Published: (2025) -
Adaptive Resolving Methods for Reinforcement Learning with Function Approximations
by: Jiang, Jiashuo, et al.
Published: (2025) -
Constrained Online Two-stage Stochastic Optimization: Near Optimal Algorithms via Adversarial Learning
by: Jiang, Jiashuo
Published: (2023) -
Off Policy Lyapunov Stability in Reinforcement Learning
by: Gill, Sarvan, et al.
Published: (2025) -
Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
by: Shen, Han, et al.
Published: (2024)