:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Moeini, Amir, Kwon, Minjae, Bozkurt, Alper Kamil, Motai, Yuichi, Chandra, Rohan, Feng, Lu, Zhang, Shangtong
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2509.25582
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Adaptive Policy Selection and Fine-Tuning under Interaction Budgets for Offline-to-Online Reinforcement Learning
by: Bozkurt, Alper Kamil, et al.
Published: (2026)

Prompt-Driven Domain Adaptation for End-to-End Autonomous Driving via In-Context RL
by: Khurram, Aleesha, et al.
Published: (2025)

A Survey of In-Context Reinforcement Learning
by: Moeini, Amir, et al.
Published: (2025)

Reward Is Enough: LLMs Are In-Context Reinforcement Learners
by: Song, Kefan, et al.
Published: (2025)

Towards Provable Emergence of In-Context Reinforcement Learning
by: Wang, Jiuqi, et al.
Published: (2025)

Convergence and Emergence of In-Context Reinforcement Learning with Chain of Thought
by: Xie, Zixuan, et al.
Published: (2026)

Model-Free Learning of Safe yet Effective Controllers
by: Bozkurt, Alper Kamil, et al.
Published: (2021)

Group Fairness in Multi-Task Reinforcement Learning
by: Song, Kefan, et al.
Published: (2025)

Adaptive Shielding for Safe Reinforcement Learning under Hidden-Parameter Dynamics Shifts
by: Kwon, Minjae, et al.
Published: (2025)

Beyond Linear Attention: Softmax Transformers Implement In-Context Reinforcement Learning
by: Xie, Zixuan, et al.
Published: (2026)

Experience Replay Addresses Loss of Plasticity in Continual Learning
by: Wang, Jiuqi, et al.
Published: (2025)

Accelerated Learning with Linear Temporal Logic using Differentiable Simulation
by: Bozkurt, Alper Kamil, et al.
Published: (2025)

Control Synthesis from Linear Temporal Logic Specifications using Model-Free Reinforcement Learning
by: Bozkurt, Alper Kamil, et al.
Published: (2019)

Finite Sample Analysis of Linear Temporal Difference Learning with Arbitrary Features
by: Xie, Zixuan, et al.
Published: (2025)

Counterfactual Explanations for Continuous Action Reinforcement Learning
by: Dong, Shuyang, et al.
Published: (2025)

Towards Formalizing Reinforcement Learning Theory
by: Zhang, Shangtong
Published: (2025)

Transformers Can Learn Temporal Difference Methods for In-Context Reinforcement Learning
by: Wang, Jiuqi, et al.
Published: (2024)

Neuro-Logic Lifelong Learning
by: He, Bowen, et al.
Published: (2025)

Learning Optimal Strategies for Temporal Tasks in Stochastic Games
by: Bozkurt, Alper Kamil, et al.
Published: (2021)

Adaptive Reward Design for Reinforcement Learning
by: Kwon, Minjae, et al.
Published: (2024)

Doubly Optimal Policy Evaluation for Reinforcement Learning
by: Liu, Shuze Daniel, et al.
Published: (2024)

Efficient Multi-Policy Evaluation for Reinforcement Learning
by: Liu, Shuze Daniel, et al.
Published: (2024)

Towards Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models
by: Song, Kefan, et al.
Published: (2025)

Efficient Policy Evaluation with Safety Constraint for Reinforcement Learning
by: Chen, Claire, et al.
Published: (2024)

Extensions of Robbins-Siegmund Theorem with Applications in Reinforcement Learning
by: Liu, Xinyu, et al.
Published: (2025)

CRASH: Challenging Reinforcement-Learning Based Adversarial Scenarios For Safety Hardening
by: Kulkarni, Amar, et al.
Published: (2024)

MathlibLemma: Folklore Lemma Generation and Benchmark for Formal Mathematics
by: Liu, Xinyu, et al.
Published: (2026)

Learning to Remember: End-to-End Training of Memory Agents for Long-Context Reasoning
by: Zhang, Kehao, et al.
Published: (2026)

The ODE Method for Stochastic Approximation and Reinforcement Learning with Markovian Noise
by: Liu, Shuze Daniel, et al.
Published: (2024)

Almost Sure Convergence Rates of Stochastic Approximation and Reinforcement Learning via a Poisson-Moreau Drift
by: Liu, Xinyu, et al.
Published: (2026)

Revisiting a Design Choice in Gradient Temporal Difference Learning
by: Qian, Xiaochi, et al.
Published: (2023)

On the Divergence of Differential Temporal Difference Learning without Local Clocks
by: Antrobius, David, et al.
Published: (2026)

Convergence of Two-Timescale Markovian Stochastic Approximations with Applications in Reinforcement Learning
by: Mahadevan, Vagul, et al.
Published: (2026)

Almost Sure Convergence of Linear Temporal Difference Learning with Arbitrary Features
by: Wang, Jiuqi, et al.
Published: (2024)

Almost Sure Convergence Rates and Concentration of Stochastic Approximation and Reinforcement Learning with Markovian Noise
by: Qian, Xiaochi, et al.
Published: (2024)

Correlative Information Maximization: A Biologically Plausible Approach to Supervised Deep Neural Networks without Weight Symmetry
by: Bozkurt, Bariscan, et al.
Published: (2023)

GameChat: Multi-LLM Dialogue for Safe, Agile, and Socially Optimal Multi-Agent Navigation in Constrained Environments
by: Mahadevan, Vagul, et al.
Published: (2025)

Secure Planning Against Stealthy Attacks via Model-Free Reinforcement Learning
by: Bozkurt, Alper Kamil, et al.
Published: (2020)

Sparsity-based Safety Conservatism for Constrained Offline Reinforcement Learning
by: Cho, Minjae, et al.
Published: (2024)

Hierarchical Meta-Reinforcement Learning via Automated Macro-Action Discovery
by: Cho, Minjae, et al.
Published: (2024)