:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xu, Wenhan, Jiang, Jiashuo, Deng, Lei, Tsang, Danny Hin-Kwok
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2506.04291
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Real-Time Network Traffic Forecasting with Missing Data: A Generative Model Approach
by: Deng, Lei, et al.
Published: (2025)

Adaptive Resolving Methods for Reinforcement Learning with Function Approximations
by: Jiang, Jiashuo, et al.
Published: (2025)

Constrained Online Two-stage Stochastic Optimization: Near Optimal Algorithms via Adversarial Learning
by: Jiang, Jiashuo
Published: (2023)

Off Policy Lyapunov Stability in Reinforcement Learning
by: Gill, Sarvan, et al.
Published: (2025)

Principled Penalty-based Methods for Bilevel Reinforcement Learning and RLHF
by: Shen, Han, et al.
Published: (2024)

Finite-Time Minimax Bounds and an Optimal Lyapunov Policy in Queueing Control
by: Liu, Yujie, et al.
Published: (2025)

Stability Enhancement in Reinforcement Learning via Adaptive Control Lyapunov Function
by: Chen, Donghe, et al.
Published: (2025)

Solving Richly Constrained Reinforcement Learning through State Augmentation and Reward Penalties
by: Jiang, Hao, et al.
Published: (2023)

One-Step Sampler for Boltzmann Distributions via Drifting
by: Cao, Wenhan, et al.
Published: (2026)

Adaptive KDE for Real-Time Thresholding: Prioritized Queues for Financial Crime Investigation
by: Butvinik, Danny, et al.
Published: (2026)

Queue-based Eco-Driving at Roundabouts with Reinforcement Learning
by: Schlamp, Anna-Lena, et al.
Published: (2024)

Enhanced Penalty-based Bidirectional Reinforcement Learning Algorithms
by: Pula, Sai Gana Sandeep, et al.
Published: (2025)

Lyapunov Stability Learning with Nonlinear Control via Inductive Biases
by: Lu, Yupu, et al.
Published: (2025)

Certifying Stability of Reinforcement Learning Policies using Generalized Lyapunov Functions
by: Long, Kehan, et al.
Published: (2025)

Lyapunov-Stable Adaptive Control for Multimodal Concept Drift
by: Pan, Tianyu Bell, et al.
Published: (2025)

Stable-Drift: A Patient-Aware Latent Drift Replay Method for Stabilizing Representations in Continual Learning
by: Theofilou, Paraskevi-Antonia, et al.
Published: (2025)

Overconfident Errors Need Stronger Correction: Asymmetric Confidence Penalties for Reinforcement Learning
by: Xu, Yuanda, et al.
Published: (2026)

Lend a Hand: Semi Training-Free Cued Speech Recognition via MLLM-Driven Hand Modeling for Barrier-free Communication
by: Huang, Guanjie, et al.
Published: (2025)

Efficient Reinforcement Learning for Routing Jobs in Heterogeneous Queueing Systems
by: Jali, Neharika, et al.
Published: (2024)

Impact of Computation in Integral Reinforcement Learning for Continuous-Time Control
by: Cao, Wenhan, et al.
Published: (2024)

Online Semi-infinite Linear Programming: Efficient Algorithms via Function Approximation
by: Zong, Yiming, et al.
Published: (2026)

Stochastic Penalty-Barrier Methods for Constrained Machine Learning
by: Bosák, Adam, et al.
Published: (2026)

FedIN: Federated Intermediate Layers Learning for Model Heterogeneity
by: Chan, Yun-Hin, et al.
Published: (2023)

Learning-Based Pricing and Matching for Two-Sided Queues
by: Yang, Zixian, et al.
Published: (2024)

Reinforcement Learning with Intrinsically Motivated Feedback Graph for Lost-sales Inventory Control
by: Liu, Zifan, et al.
Published: (2024)

Reinforcement Learning in Queue-Reactive Models: Application to Optimal Execution
by: Espana, Tomas, et al.
Published: (2025)

Learning to Price with Resource Constraints: From Full Information to Machine-Learned Prices
by: Ao, Ruicheng, et al.
Published: (2025)

Stabilizing MoE Reinforcement Learning by Aligning Training and Inference Routers
by: Ma, Wenhan, et al.
Published: (2025)

Achieving Instance-dependent Sample Complexity for Constrained Markov Decision Process
by: Jiang, Jiashuo, et al.
Published: (2024)

FedPEAT: Convergence of Federated Learning, Parameter-Efficient Fine Tuning, and Emulator Assisted Tuning for Artificial Intelligence Foundation Models with Mobile Edge Computing
by: Chua, Terence Jie, et al.
Published: (2023)

Reinforcement Learning Based Traffic Signal Design to Minimize Queue Lengths
by: Nandakumar, Anirud, et al.
Published: (2025)

Accelerated Gradient Methods for Sparse Statistical Learning with Nonconvex Penalties
by: Yang, Kai, et al.
Published: (2020)

Restless Bandits with Individual Penalty Constraints: Near-Optimal Indices and Deep Reinforcement Learning
by: Zamir, Nida, et al.
Published: (2026)

Decomposing Control Lyapunov Functions for Efficient Reinforcement Learning
by: Lopez, Antonio, et al.
Published: (2024)

Admission Control of Quasi-Reversible Queueing Systems: Optimization and Reinforcement Learning
by: Comte, Céline, et al.
Published: (2025)

Taming OOD Actions for Offline Reinforcement Learning: An Advantage-Based Approach
by: Chen, Xuyang, et al.
Published: (2025)

Understanding and Improving Model Averaging in Federated Learning on Heterogeneous Data
by: Zhou, Tailin, et al.
Published: (2023)

Degeneracy is OK: Logarithmic Regret for Network Revenue Management with Indiscrete Distributions
by: Jiang, Jiashuo, et al.
Published: (2022)

On Penalty-based Bilevel Gradient Descent Method
by: Shen, Han, et al.
Published: (2023)

Just Enough Thinking: Efficient Reasoning with Adaptive Length Penalties Reinforcement Learning
by: Xiang, Violet, et al.
Published: (2025)