Saved in:
| Main Authors: | Hüyük, Alihan, Koblitz, Arndt Ryo, Mohajeri, Atefeh, Andrews, Matthew |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.13108 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Strategically Linked Decisions in Long-Term Planning and Reinforcement Learning
by: Hüyük, Alihan, et al.
Published: (2025)
by: Hüyük, Alihan, et al.
Published: (2025)
Quantifying Potential Observation Missingness in Inverse Reinforcement Learning
by: Benac, Leo, et al.
Published: (2026)
by: Benac, Leo, et al.
Published: (2026)
Adaptive Experiment Design with Synthetic Controls
by: Hüyük, Alihan, et al.
Published: (2024)
by: Hüyük, Alihan, et al.
Published: (2024)
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
by: Sun, Hao, et al.
Published: (2023)
by: Sun, Hao, et al.
Published: (2023)
Defining Expertise: Applications to Treatment Effect Estimation
by: Hüyük, Alihan, et al.
Published: (2024)
by: Hüyük, Alihan, et al.
Published: (2024)
Towards Regulatory-Confirmed Adaptive Clinical Trials: Machine Learning Opportunities and Solutions
by: Klein, Omer Noy, et al.
Published: (2025)
by: Klein, Omer Noy, et al.
Published: (2025)
Transparent Trade-offs between Properties of Explanations
by: Tadesse, Hiwot Belay, et al.
Published: (2024)
by: Tadesse, Hiwot Belay, et al.
Published: (2024)
Reasoning Elicitation in Language Models via Counterfactual Feedback
by: Hüyük, Alihan, et al.
Published: (2024)
by: Hüyük, Alihan, et al.
Published: (2024)
When is Off-Policy Evaluation (Reward Modeling) Useful in Contextual Bandits? A Data-Centric Perspective
by: Sun, Hao, et al.
Published: (2023)
by: Sun, Hao, et al.
Published: (2023)
To RL or not to RL? An Algorithmic Cheat-Sheet for AI-Based Radio Resource Management
by: Maggi, Lorenzo, et al.
Published: (2024)
by: Maggi, Lorenzo, et al.
Published: (2024)
Compositional Causal Reasoning Evaluation in Language Models
by: Maasch, Jacqueline R. M. A., et al.
Published: (2025)
by: Maasch, Jacqueline R. M. A., et al.
Published: (2025)
Informed Decision-Making through Advancements in Open Set Recognition and Unknown Sample Detection
by: Mahdavi, Atefeh, et al.
Published: (2024)
by: Mahdavi, Atefeh, et al.
Published: (2024)
Stagewise Reinforcement Learning and the Geometry of the Regret Landscape
by: Elliott, Chris, et al.
Published: (2026)
by: Elliott, Chris, et al.
Published: (2026)
Koopman-Based Generalization of Deep Reinforcement Learning With Application to Wireless Communications
by: Termehchi, Atefeh, et al.
Published: (2025)
by: Termehchi, Atefeh, et al.
Published: (2025)
Regret-Based Defense in Adversarial Reinforcement Learning
by: Belaire, Roman, et al.
Published: (2023)
by: Belaire, Roman, et al.
Published: (2023)
No-Regret Reinforcement Learning in Smooth MDPs
by: Maran, Davide, et al.
Published: (2024)
by: Maran, Davide, et al.
Published: (2024)
Regretful Decisions under Label Noise
by: Nagaraj, Sujay, et al.
Published: (2025)
by: Nagaraj, Sujay, et al.
Published: (2025)
Open Problem: Order Optimal Regret Bounds for Kernel-Based Reinforcement Learning
by: Vakili, Sattar
Published: (2024)
by: Vakili, Sattar
Published: (2024)
Test-Time Regret Minimization in Meta Reinforcement Learning
by: Mutti, Mirco, et al.
Published: (2024)
by: Mutti, Mirco, et al.
Published: (2024)
Logarithmic Regret for Online KL-Regularized Reinforcement Learning
by: Zhao, Heyang, et al.
Published: (2025)
by: Zhao, Heyang, et al.
Published: (2025)
Regret-Free Reinforcement Learning for LTL Specifications
by: Majumdar, Rupak, et al.
Published: (2024)
by: Majumdar, Rupak, et al.
Published: (2024)
Reinforcement Learning and Regret Bounds for Admission Control
by: Weber, Lucas, et al.
Published: (2024)
by: Weber, Lucas, et al.
Published: (2024)
Tail Distribution of Regret in Optimistic Reinforcement Learning
by: Khodadadian, Sajad, et al.
Published: (2025)
by: Khodadadian, Sajad, et al.
Published: (2025)
Tighter Regret Bounds for Contextual Action-Set Reinforcement Learning
by: Chen, Zijun, et al.
Published: (2026)
by: Chen, Zijun, et al.
Published: (2026)
Optimal Dynamic Regret by Transformers for Non-Stationary Reinforcement Learning
by: Chen, Baiyuan, et al.
Published: (2025)
by: Chen, Baiyuan, et al.
Published: (2025)
Transfer in Reinforcement Learning via Regret Bounds for Learning Agents
by: Tuynman, Adrienne, et al.
Published: (2022)
by: Tuynman, Adrienne, et al.
Published: (2022)
Kernelized Reinforcement Learning with Order Optimal Regret Bounds
by: Vakili, Sattar, et al.
Published: (2023)
by: Vakili, Sattar, et al.
Published: (2023)
Horizon-Free Regret for Linear Markov Decision Processes
by: Zhang, Zihan, et al.
Published: (2024)
by: Zhang, Zihan, et al.
Published: (2024)
Achieving Constant Regret in Linear Markov Decision Processes
by: Zhang, Weitong, et al.
Published: (2024)
by: Zhang, Weitong, et al.
Published: (2024)
Optimistic Regret Bounds for Online Learning in Adversarial Markov Decision Processes
by: Moon, Sang Bin, et al.
Published: (2024)
by: Moon, Sang Bin, et al.
Published: (2024)
Kernel-Based Function Approximation for Average Reward Reinforcement Learning: An Optimist No-Regret Algorithm
by: Vakili, Sattar, et al.
Published: (2024)
by: Vakili, Sattar, et al.
Published: (2024)
CHIRPs: Change-Induced Regret Proxy metrics for Lifelong Reinforcement Learning
by: Birkbeck, John, et al.
Published: (2024)
by: Birkbeck, John, et al.
Published: (2024)
Unified Framework of Distributional Regret in Multi-Armed Bandits and Reinforcement Learning
by: Lee, Harin, et al.
Published: (2026)
by: Lee, Harin, et al.
Published: (2026)
Regret Bounds for Reinforcement Learning from Multi-Source Imperfect Preferences
by: Shi, Ming, et al.
Published: (2026)
by: Shi, Ming, et al.
Published: (2026)
Rethinking State Disentanglement in Causal Reinforcement Learning
by: Cao, Haiyao, et al.
Published: (2024)
by: Cao, Haiyao, et al.
Published: (2024)
Distorted Distributional Policy Evaluation for Offline Reinforcement Learning
by: Iwaki, Ryo, et al.
Published: (2026)
by: Iwaki, Ryo, et al.
Published: (2026)
Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning
by: Moradipari, Ahmadreza, et al.
Published: (2023)
by: Moradipari, Ahmadreza, et al.
Published: (2023)
Regret Bounds and Reinforcement Learning Exploration of EXP-based Algorithms
by: Xu, Mengfan, et al.
Published: (2020)
by: Xu, Mengfan, et al.
Published: (2020)
Decomposing Observational Multiplicity in Decision Trees: Leaf and Structural Regret
by: Cavus, Mustafa
Published: (2026)
by: Cavus, Mustafa
Published: (2026)
Logarithmic Regret of Exploration in Average Reward Markov Decision Processes
by: Boone, Victor, et al.
Published: (2025)
by: Boone, Victor, et al.
Published: (2025)
Similar Items
-
Strategically Linked Decisions in Long-Term Planning and Reinforcement Learning
by: Hüyük, Alihan, et al.
Published: (2025) -
Quantifying Potential Observation Missingness in Inverse Reinforcement Learning
by: Benac, Leo, et al.
Published: (2026) -
Adaptive Experiment Design with Synthetic Controls
by: Hüyük, Alihan, et al.
Published: (2024) -
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
by: Sun, Hao, et al.
Published: (2023) -
Defining Expertise: Applications to Treatment Effect Estimation
by: Hüyük, Alihan, et al.
Published: (2024)