Saved in:
| Main Authors: | Moeini, Amir, Kwon, Minjae, Bozkurt, Alper Kamil, Motai, Yuichi, Chandra, Rohan, Feng, Lu, Zhang, Shangtong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.25582 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Adaptive Policy Selection and Fine-Tuning under Interaction Budgets for Offline-to-Online Reinforcement Learning
by: Bozkurt, Alper Kamil, et al.
Published: (2026)
by: Bozkurt, Alper Kamil, et al.
Published: (2026)
Prompt-Driven Domain Adaptation for End-to-End Autonomous Driving via In-Context RL
by: Khurram, Aleesha, et al.
Published: (2025)
by: Khurram, Aleesha, et al.
Published: (2025)
A Survey of In-Context Reinforcement Learning
by: Moeini, Amir, et al.
Published: (2025)
by: Moeini, Amir, et al.
Published: (2025)
Reward Is Enough: LLMs Are In-Context Reinforcement Learners
by: Song, Kefan, et al.
Published: (2025)
by: Song, Kefan, et al.
Published: (2025)
Towards Provable Emergence of In-Context Reinforcement Learning
by: Wang, Jiuqi, et al.
Published: (2025)
by: Wang, Jiuqi, et al.
Published: (2025)
Convergence and Emergence of In-Context Reinforcement Learning with Chain of Thought
by: Xie, Zixuan, et al.
Published: (2026)
by: Xie, Zixuan, et al.
Published: (2026)
Model-Free Learning of Safe yet Effective Controllers
by: Bozkurt, Alper Kamil, et al.
Published: (2021)
by: Bozkurt, Alper Kamil, et al.
Published: (2021)
Group Fairness in Multi-Task Reinforcement Learning
by: Song, Kefan, et al.
Published: (2025)
by: Song, Kefan, et al.
Published: (2025)
Adaptive Shielding for Safe Reinforcement Learning under Hidden-Parameter Dynamics Shifts
by: Kwon, Minjae, et al.
Published: (2025)
by: Kwon, Minjae, et al.
Published: (2025)
Beyond Linear Attention: Softmax Transformers Implement In-Context Reinforcement Learning
by: Xie, Zixuan, et al.
Published: (2026)
by: Xie, Zixuan, et al.
Published: (2026)
Experience Replay Addresses Loss of Plasticity in Continual Learning
by: Wang, Jiuqi, et al.
Published: (2025)
by: Wang, Jiuqi, et al.
Published: (2025)
Accelerated Learning with Linear Temporal Logic using Differentiable Simulation
by: Bozkurt, Alper Kamil, et al.
Published: (2025)
by: Bozkurt, Alper Kamil, et al.
Published: (2025)
Control Synthesis from Linear Temporal Logic Specifications using Model-Free Reinforcement Learning
by: Bozkurt, Alper Kamil, et al.
Published: (2019)
by: Bozkurt, Alper Kamil, et al.
Published: (2019)
Finite Sample Analysis of Linear Temporal Difference Learning with Arbitrary Features
by: Xie, Zixuan, et al.
Published: (2025)
by: Xie, Zixuan, et al.
Published: (2025)
Counterfactual Explanations for Continuous Action Reinforcement Learning
by: Dong, Shuyang, et al.
Published: (2025)
by: Dong, Shuyang, et al.
Published: (2025)
Towards Formalizing Reinforcement Learning Theory
by: Zhang, Shangtong
Published: (2025)
by: Zhang, Shangtong
Published: (2025)
Transformers Can Learn Temporal Difference Methods for In-Context Reinforcement Learning
by: Wang, Jiuqi, et al.
Published: (2024)
by: Wang, Jiuqi, et al.
Published: (2024)
Neuro-Logic Lifelong Learning
by: He, Bowen, et al.
Published: (2025)
by: He, Bowen, et al.
Published: (2025)
Learning Optimal Strategies for Temporal Tasks in Stochastic Games
by: Bozkurt, Alper Kamil, et al.
Published: (2021)
by: Bozkurt, Alper Kamil, et al.
Published: (2021)
Adaptive Reward Design for Reinforcement Learning
by: Kwon, Minjae, et al.
Published: (2024)
by: Kwon, Minjae, et al.
Published: (2024)
Doubly Optimal Policy Evaluation for Reinforcement Learning
by: Liu, Shuze Daniel, et al.
Published: (2024)
by: Liu, Shuze Daniel, et al.
Published: (2024)
Efficient Multi-Policy Evaluation for Reinforcement Learning
by: Liu, Shuze Daniel, et al.
Published: (2024)
by: Liu, Shuze Daniel, et al.
Published: (2024)
Towards Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models
by: Song, Kefan, et al.
Published: (2025)
by: Song, Kefan, et al.
Published: (2025)
Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
by: Chen, Claire, et al.
Published: (2024)
by: Chen, Claire, et al.
Published: (2024)
Extensions of Robbins-Siegmund Theorem with Applications in Reinforcement Learning
by: Liu, Xinyu, et al.
Published: (2025)
by: Liu, Xinyu, et al.
Published: (2025)
CRASH: Challenging Reinforcement-Learning Based Adversarial Scenarios For Safety Hardening
by: Kulkarni, Amar, et al.
Published: (2024)
by: Kulkarni, Amar, et al.
Published: (2024)
MathlibLemma: Folklore Lemma Generation and Benchmark for Formal Mathematics
by: Liu, Xinyu, et al.
Published: (2026)
by: Liu, Xinyu, et al.
Published: (2026)
Learning to Remember: End-to-End Training of Memory Agents for Long-Context Reasoning
by: Zhang, Kehao, et al.
Published: (2026)
by: Zhang, Kehao, et al.
Published: (2026)
The ODE Method for Stochastic Approximation and Reinforcement Learning with Markovian Noise
by: Liu, Shuze Daniel, et al.
Published: (2024)
by: Liu, Shuze Daniel, et al.
Published: (2024)
Almost Sure Convergence Rates of Stochastic Approximation and Reinforcement Learning via a Poisson-Moreau Drift
by: Liu, Xinyu, et al.
Published: (2026)
by: Liu, Xinyu, et al.
Published: (2026)
Revisiting a Design Choice in Gradient Temporal Difference Learning
by: Qian, Xiaochi, et al.
Published: (2023)
by: Qian, Xiaochi, et al.
Published: (2023)
On the Divergence of Differential Temporal Difference Learning without Local Clocks
by: Antrobius, David, et al.
Published: (2026)
by: Antrobius, David, et al.
Published: (2026)
Convergence of Two-Timescale Markovian Stochastic Approximations with Applications in Reinforcement Learning
by: Mahadevan, Vagul, et al.
Published: (2026)
by: Mahadevan, Vagul, et al.
Published: (2026)
Almost Sure Convergence of Linear Temporal Difference Learning with Arbitrary Features
by: Wang, Jiuqi, et al.
Published: (2024)
by: Wang, Jiuqi, et al.
Published: (2024)
Almost Sure Convergence Rates and Concentration of Stochastic Approximation and Reinforcement Learning with Markovian Noise
by: Qian, Xiaochi, et al.
Published: (2024)
by: Qian, Xiaochi, et al.
Published: (2024)
Correlative Information Maximization: A Biologically Plausible Approach to Supervised Deep Neural Networks without Weight Symmetry
by: Bozkurt, Bariscan, et al.
Published: (2023)
by: Bozkurt, Bariscan, et al.
Published: (2023)
GameChat: Multi-LLM Dialogue for Safe, Agile, and Socially Optimal Multi-Agent Navigation in Constrained Environments
by: Mahadevan, Vagul, et al.
Published: (2025)
by: Mahadevan, Vagul, et al.
Published: (2025)
Secure Planning Against Stealthy Attacks via Model-Free Reinforcement Learning
by: Bozkurt, Alper Kamil, et al.
Published: (2020)
by: Bozkurt, Alper Kamil, et al.
Published: (2020)
Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning
by: Cho, Minjae, et al.
Published: (2024)
by: Cho, Minjae, et al.
Published: (2024)
Hierarchical Meta-Reinforcement Learning via Automated Macro-Action Discovery
by: Cho, Minjae, et al.
Published: (2024)
by: Cho, Minjae, et al.
Published: (2024)
Similar Items
-
Adaptive Policy Selection and Fine-Tuning under Interaction Budgets for Offline-to-Online Reinforcement Learning
by: Bozkurt, Alper Kamil, et al.
Published: (2026) -
Prompt-Driven Domain Adaptation for End-to-End Autonomous Driving via In-Context RL
by: Khurram, Aleesha, et al.
Published: (2025) -
A Survey of In-Context Reinforcement Learning
by: Moeini, Amir, et al.
Published: (2025) -
Reward Is Enough: LLMs Are In-Context Reinforcement Learners
by: Song, Kefan, et al.
Published: (2025) -
Towards Provable Emergence of In-Context Reinforcement Learning
by: Wang, Jiuqi, et al.
Published: (2025)