:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Chen, Zijun, Zhang, Zihan
Natura:	Preprint
Pubblicazione:	2026
Soggetti:	Machine Learning
Accesso online:	https://arxiv.org/abs/2605.15692
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Improved Regret Bound for Safe Reinforcement Learning via Tighter Cost Pessimism and Reward Optimism
di: Yu, Kihyun, et al.
Pubblicazione: (2024)

Posterior Sampling-Based Bayesian Optimization with Tighter Bayesian Regret Bounds
di: Takeno, Shion, et al.
Pubblicazione: (2023)

Tighter Regret Lower Bound for Gaussian Process Bandits with Squared Exponential Kernel in Hypersphere
di: Iwazaki, Shogo
Pubblicazione: (2026)

Queue Length Regret Bounds for Contextual Queueing Bandits
di: Bae, Seoungbin, et al.
Pubblicazione: (2026)

Tighter Generalisation Bounds via Interpolation
di: Viallard, Paul, et al.
Pubblicazione: (2024)

Sharp Gap-Dependent Variance-Aware Regret Bounds for Tabular MDPs
di: Chen, Shulun, et al.
Pubblicazione: (2025)

Regret Bounds for Noise-Free Cascaded Kernelized Bandits
di: Li, Zihan, et al.
Pubblicazione: (2022)

Reinforcement Learning and Regret Bounds for Admission Control
di: Weber, Lucas, et al.
Pubblicazione: (2024)

Variance-Dependent Regret Lower Bounds for Contextual Bandits
di: He, Jiafan, et al.
Pubblicazione: (2025)

Tighter Risk Bounds for Mixtures of Experts
di: Akretche, Wissam, et al.
Pubblicazione: (2024)

Tighter Confidence Bounds for Sequential Kernel Regression
di: Flynn, Hamish, et al.
Pubblicazione: (2024)

Kernelized Reinforcement Learning with Order Optimal Regret Bounds
di: Vakili, Sattar, et al.
Pubblicazione: (2023)

Variance-Aware Regret Bounds for Stochastic Contextual Dueling Bandits
di: Di, Qiwei, et al.
Pubblicazione: (2023)

Improved Regret Bounds of (Multinomial) Logistic Bandits via Regret-to-Confidence-Set Conversion
di: Lee, Junghyun, et al.
Pubblicazione: (2023)

Transfer in Reinforcement Learning via Regret Bounds for Learning Agents
di: Tuynman, Adrienne, et al.
Pubblicazione: (2022)

Regret Bounds for Reinforcement Learning from Multi-Source Imperfect Preferences
di: Shi, Ming, et al.
Pubblicazione: (2026)

Regret Bounds for Adversarial Contextual Bandits with General Function Approximation and Delayed Feedback
di: Levy, Orin, et al.
Pubblicazione: (2025)

Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning
di: Moradipari, Ahmadreza, et al.
Pubblicazione: (2023)

Regret Bounds and Reinforcement Learning Exploration of EXP-based Algorithms
di: Xu, Mengfan, et al.
Pubblicazione: (2020)

Near-optimal Per-Action Regret Bounds for Sleeping Bandits
di: Nguyen, Quan, et al.
Pubblicazione: (2024)

Open Problem: Order Optimal Regret Bounds for Kernel-Based Reinforcement Learning
di: Vakili, Sattar
Pubblicazione: (2024)

Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds
di: Liang, Hao, et al.
Pubblicazione: (2022)

Information-Theoretic Minimax Regret Bounds for Reinforcement Learning based on Duality
di: Bongole, Raghav, et al.
Pubblicazione: (2024)

Finite and Corruption-Robust Regret Bounds in Online Inverse Linear Optimization under M-Convex Action Sets
di: Oki, Taihei, et al.
Pubblicazione: (2026)

Label-NTK Alignments and A Tighter Convergence Bound in the NTK Regime
di: Marreddy, Ruchirinkil, et al.
Pubblicazione: (2026)

How Does Variance Shape the Regret in Contextual Bandits?
di: Jia, Zeyu, et al.
Pubblicazione: (2024)

Horizon-Free Regret for Linear Markov Decision Processes
di: Zhang, Zihan, et al.
Pubblicazione: (2024)

Optimal Regret for Policy Optimization in Contextual Bandits
di: Levy, Orin, et al.
Pubblicazione: (2026)

Fast Best-in-Class Regret for Contextual Bandits
di: Girard, Samuel, et al.
Pubblicazione: (2025)

Eluder-based Regret for Stochastic Contextual MDPs
di: Levy, Orin, et al.
Pubblicazione: (2022)

Tighter CMI-Based Generalization Bounds via Stochastic Projection and Quantization
di: Sefidgaran, Milad, et al.
Pubblicazione: (2025)

Gradient-Variation Regret Bounds for Unconstrained Online Learning
di: Zhao, Yuheng, et al.
Pubblicazione: (2026)

Logarithmic Regret for Online KL-Regularized Reinforcement Learning
di: Zhao, Heyang, et al.
Pubblicazione: (2025)

Optimal Dynamic Regret by Transformers for Non-Stationary Reinforcement Learning
di: Chen, Baiyuan, et al.
Pubblicazione: (2025)

Regret Guarantees for Linear Contextual Stochastic Shortest Path
di: Polikar, Dor, et al.
Pubblicazione: (2025)

Stagewise Reinforcement Learning and the Geometry of the Regret Landscape
di: Elliott, Chris, et al.
Pubblicazione: (2026)

No-Regret Reinforcement Learning in Smooth MDPs
di: Maran, Davide, et al.
Pubblicazione: (2024)

Achieving Tractable Minimax Optimal Regret in Average Reward MDPs
di: Boone, Victor, et al.
Pubblicazione: (2024)

Posterior Sampling Reinforcement Learning with Gaussian Processes for Continuous Control: Sublinear Regret Bounds for Unbounded State Spaces
di: Flynn, Hamish, et al.
Pubblicazione: (2026)

Improved Kernel Alignment Regret Bound for Online Kernel Learning
di: Li, Junfan, et al.
Pubblicazione: (2022)