Saved in:
| Main Authors: | Birkbeck, John, Sobey, Adam, Cerutti, Federico, Flynn, Katherine Heseltine Hurley, Norman, Timothy J. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.03577 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
It's About Time: Temporal References in Emergent Communication
by: Lipinski, Olaf, et al.
Published: (2023)
by: Lipinski, Olaf, et al.
Published: (2023)
HAVA: Hybrid Approach to Value-Alignment through Reward Weighing for Reinforcement Learning
by: Varys, Kryspin, et al.
Published: (2025)
by: Varys, Kryspin, et al.
Published: (2025)
Speaking Your Language: Spatial Relationships in Interpretable Emergent Communication
by: Lipinski, Olaf, et al.
Published: (2024)
by: Lipinski, Olaf, et al.
Published: (2024)
Posterior Sampling Reinforcement Learning with Gaussian Processes for Continuous Control: Sublinear Regret Bounds for Unbounded State Spaces
by: Flynn, Hamish, et al.
Published: (2026)
by: Flynn, Hamish, et al.
Published: (2026)
Methodological Insights into Structural Causal Modelling and Uncertainty-Aware Forecasting for Economic Indicators
by: Cerutti, Federico
Published: (2025)
by: Cerutti, Federico
Published: (2025)
It's Not You, It's Clipping: A Soft Trust-Region via Probability Smoothing for LLM RL
by: Dwyer, Madeleine, et al.
Published: (2025)
by: Dwyer, Madeleine, et al.
Published: (2025)
Lifelong Reinforcement Learning via Neuromodulation
by: Lee, Sebastian, et al.
Published: (2024)
by: Lee, Sebastian, et al.
Published: (2024)
Confidence Sequences for Generalized Linear Models via Regret Analysis
by: Clerico, Eugenio, et al.
Published: (2025)
by: Clerico, Eugenio, et al.
Published: (2025)
Approaches to human activity recognition via passive radar
by: Bresciani, Christian, et al.
Published: (2024)
by: Bresciani, Christian, et al.
Published: (2024)
Stagewise Reinforcement Learning and the Geometry of the Regret Landscape
by: Elliott, Chris, et al.
Published: (2026)
by: Elliott, Chris, et al.
Published: (2026)
No-Regret Reinforcement Learning in Smooth MDPs
by: Maran, Davide, et al.
Published: (2024)
by: Maran, Davide, et al.
Published: (2024)
GenCircuit-RL: Reinforcement Learning from Hierarchical Verification for Genetic Circuit Design
by: Flynn, Noah
Published: (2026)
by: Flynn, Noah
Published: (2026)
Test-Time Regret Minimization in Meta Reinforcement Learning
by: Mutti, Mirco, et al.
Published: (2024)
by: Mutti, Mirco, et al.
Published: (2024)
Logarithmic Regret for Online KL-Regularized Reinforcement Learning
by: Zhao, Heyang, et al.
Published: (2025)
by: Zhao, Heyang, et al.
Published: (2025)
Efficient, Low-Regret, Online Reinforcement Learning for Linear MDPs
by: John, Philips George, et al.
Published: (2024)
by: John, Philips George, et al.
Published: (2024)
Statistical Context Detection for Deep Lifelong Reinforcement Learning
by: Dick, Jeffery, et al.
Published: (2024)
by: Dick, Jeffery, et al.
Published: (2024)
Regret-Free Reinforcement Learning for LTL Specifications
by: Majumdar, Rupak, et al.
Published: (2024)
by: Majumdar, Rupak, et al.
Published: (2024)
Reinforcement Learning and Regret Bounds for Admission Control
by: Weber, Lucas, et al.
Published: (2024)
by: Weber, Lucas, et al.
Published: (2024)
Tail Distribution of Regret in Optimistic Reinforcement Learning
by: Khodadadian, Sajad, et al.
Published: (2025)
by: Khodadadian, Sajad, et al.
Published: (2025)
Regret-Based Defense in Adversarial Reinforcement Learning
by: Belaire, Roman, et al.
Published: (2023)
by: Belaire, Roman, et al.
Published: (2023)
Fast Lifelong Adaptive Inverse Reinforcement Learning from Demonstrations
by: Chen, Letian, et al.
Published: (2022)
by: Chen, Letian, et al.
Published: (2022)
Reinforcement Learning From Imperfect Corrective Actions And Proxy Rewards
by: Jiang, Zhaohui, et al.
Published: (2024)
by: Jiang, Zhaohui, et al.
Published: (2024)
Disentangling Recognition and Decision Regrets in Image-Based Reinforcement Learning
by: Hüyük, Alihan, et al.
Published: (2024)
by: Hüyük, Alihan, et al.
Published: (2024)
Tighter Regret Bounds for Contextual Action-Set Reinforcement Learning
by: Chen, Zijun, et al.
Published: (2026)
by: Chen, Zijun, et al.
Published: (2026)
Optimal Dynamic Regret by Transformers for Non-Stationary Reinforcement Learning
by: Chen, Baiyuan, et al.
Published: (2025)
by: Chen, Baiyuan, et al.
Published: (2025)
Proxy Methods for Domain Adaptation
by: Tsai, Katherine, et al.
Published: (2024)
by: Tsai, Katherine, et al.
Published: (2024)
Transfer in Reinforcement Learning via Regret Bounds for Learning Agents
by: Tuynman, Adrienne, et al.
Published: (2022)
by: Tuynman, Adrienne, et al.
Published: (2022)
Kernelized Reinforcement Learning with Order Optimal Regret Bounds
by: Vakili, Sattar, et al.
Published: (2023)
by: Vakili, Sattar, et al.
Published: (2023)
Information-Theoretic Minimax Regret Bounds for Reinforcement Learning based on Duality
by: Bongole, Raghav, et al.
Published: (2024)
by: Bongole, Raghav, et al.
Published: (2024)
Lifelong Reinforcement Learning with Similarity-Driven Weighting by Large Models
by: Huang, Zhiyi, et al.
Published: (2025)
by: Huang, Zhiyi, et al.
Published: (2025)
Unified Framework of Distributional Regret in Multi-Armed Bandits and Reinforcement Learning
by: Lee, Harin, et al.
Published: (2026)
by: Lee, Harin, et al.
Published: (2026)
Regret Bounds for Reinforcement Learning from Multi-Source Imperfect Preferences
by: Shi, Ming, et al.
Published: (2026)
by: Shi, Ming, et al.
Published: (2026)
Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement Learning
by: Muppidi, Aneesh, et al.
Published: (2024)
by: Muppidi, Aneesh, et al.
Published: (2024)
Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning
by: Moradipari, Ahmadreza, et al.
Published: (2023)
by: Moradipari, Ahmadreza, et al.
Published: (2023)
Regret Bounds and Reinforcement Learning Exploration of EXP-based Algorithms
by: Xu, Mengfan, et al.
Published: (2020)
by: Xu, Mengfan, et al.
Published: (2020)
Privacy-Aware Lifelong Learning
by: Özdenizci, Ozan, et al.
Published: (2025)
by: Özdenizci, Ozan, et al.
Published: (2025)
Regret-Optimal Q-Learning with Low Cost for Single-Agent and Federated Reinforcement Learning
by: Zhang, Haochen, et al.
Published: (2025)
by: Zhang, Haochen, et al.
Published: (2025)
Open Problem: Order Optimal Regret Bounds for Kernel-Based Reinforcement Learning
by: Vakili, Sattar
Published: (2024)
by: Vakili, Sattar
Published: (2024)
No-Regret Learning in Bilateral Trade via Global Budget Balance
by: Bernasconi, Martino, et al.
Published: (2023)
by: Bernasconi, Martino, et al.
Published: (2023)
Regret-Based Federated Causal Discovery with Unknown Interventions
by: Baldo, Federico, et al.
Published: (2025)
by: Baldo, Federico, et al.
Published: (2025)
Similar Items
-
It's About Time: Temporal References in Emergent Communication
by: Lipinski, Olaf, et al.
Published: (2023) -
HAVA: Hybrid Approach to Value-Alignment through Reward Weighing for Reinforcement Learning
by: Varys, Kryspin, et al.
Published: (2025) -
Speaking Your Language: Spatial Relationships in Interpretable Emergent Communication
by: Lipinski, Olaf, et al.
Published: (2024) -
Posterior Sampling Reinforcement Learning with Gaussian Processes for Continuous Control: Sublinear Regret Bounds for Unbounded State Spaces
by: Flynn, Hamish, et al.
Published: (2026) -
Methodological Insights into Structural Causal Modelling and Uncertainty-Aware Forecasting for Economic Indicators
by: Cerutti, Federico
Published: (2025)