Saved in:
| Main Authors: | Cheikhi, David, Russo, Daniel |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2301.13289 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
On the Limited Representational Power of Value Functions and its Links to Statistical (In)Efficiency
by: Cheikhi, David, et al.
Published: (2024)
by: Cheikhi, David, et al.
Published: (2024)
Statistical Inference for Temporal Difference Learning with Linear Function Approximation
by: Wu, Weichen, et al.
Published: (2024)
by: Wu, Weichen, et al.
Published: (2024)
Statistical Efficiency of Distributional Temporal Difference Learning and Freedman's Inequality in Hilbert Spaces
by: Peng, Yang, et al.
Published: (2024)
by: Peng, Yang, et al.
Published: (2024)
On the Divergence of Differential Temporal Difference Learning without Local Clocks
by: Antrobius, David, et al.
Published: (2026)
by: Antrobius, David, et al.
Published: (2026)
An Analysis of Quantile Temporal-Difference Learning
by: Rowland, Mark, et al.
Published: (2023)
by: Rowland, Mark, et al.
Published: (2023)
Simplifying Deep Temporal Difference Learning
by: Gallici, Matteo, et al.
Published: (2024)
by: Gallici, Matteo, et al.
Published: (2024)
Discerning Temporal Difference Learning
by: Ma, Jianfei
Published: (2023)
by: Ma, Jianfei
Published: (2023)
Backstepping Temporal Difference Learning
by: Lim, Han-Dong, et al.
Published: (2023)
by: Lim, Han-Dong, et al.
Published: (2023)
Explainable Reinforcement Learning via Temporal Policy Decomposition
by: Ruggeri, Franco, et al.
Published: (2025)
by: Ruggeri, Franco, et al.
Published: (2025)
Towards Parameter-Free Temporal Difference Learning
by: Li, Yunxiang, et al.
Published: (2026)
by: Li, Yunxiang, et al.
Published: (2026)
Temporal Difference Learning with Constrained Initial Representations
by: Lyu, Jiafei, et al.
Published: (2026)
by: Lyu, Jiafei, et al.
Published: (2026)
Reinforcement Learning From State and Temporal Differences
by: Weaver, Lex, et al.
Published: (2025)
by: Weaver, Lex, et al.
Published: (2025)
New Versions of Gradient Temporal Difference Learning
by: Lee, Donghwan, et al.
Published: (2021)
by: Lee, Donghwan, et al.
Published: (2021)
Revisiting Prefix-tuning: Statistical Benefits of Reparameterization among Prompts
by: Le, Minh, et al.
Published: (2024)
by: Le, Minh, et al.
Published: (2024)
The Benefits of Temporal Correlations: SGD Learns k-Juntas from Random Walks Efficiently
by: Cornacchia, Elisabetta, et al.
Published: (2026)
by: Cornacchia, Elisabetta, et al.
Published: (2026)
Temporal-Difference Variational Continual Learning
by: Melo, Luckeciano C., et al.
Published: (2024)
by: Melo, Luckeciano C., et al.
Published: (2024)
Gradient Iterated Temporal-Difference Learning
by: Vincent, Théo, et al.
Published: (2026)
by: Vincent, Théo, et al.
Published: (2026)
n-Step Temporal Difference Learning with Optimal n
by: Mandal, Lakshmi, et al.
Published: (2023)
by: Mandal, Lakshmi, et al.
Published: (2023)
Implicit Updates for Average-Reward Temporal Difference Learning
by: Kim, Hwanwoo, et al.
Published: (2025)
by: Kim, Hwanwoo, et al.
Published: (2025)
Transformers Can Learn Temporal Difference Methods for In-Context Reinforcement Learning
by: Wang, Jiuqi, et al.
Published: (2024)
by: Wang, Jiuqi, et al.
Published: (2024)
Demystifying the Recency Heuristic in Temporal-Difference Learning
by: Daley, Brett, et al.
Published: (2024)
by: Daley, Brett, et al.
Published: (2024)
Exponential Moving Average of Weights in Deep Learning: Dynamics and Benefits
by: Morales-Brotons, Daniel, et al.
Published: (2024)
by: Morales-Brotons, Daniel, et al.
Published: (2024)
Revisiting a Design Choice in Gradient Temporal Difference Learning
by: Qian, Xiaochi, et al.
Published: (2023)
by: Qian, Xiaochi, et al.
Published: (2023)
Accelerated Distributional Temporal Difference Learning with Linear Function Approximation
by: Jin, Kaicheng, et al.
Published: (2025)
by: Jin, Kaicheng, et al.
Published: (2025)
Temporal Difference Learning for High-Dimensional PIDEs with Jumps
by: Lu, Liwei, et al.
Published: (2023)
by: Lu, Liwei, et al.
Published: (2023)
Is Temporal Difference Learning the Gold Standard for Stitching in RL?
by: Bortkiewicz, Michał, et al.
Published: (2025)
by: Bortkiewicz, Michał, et al.
Published: (2025)
A Variance Minimization Approach to Temporal-Difference Learning
by: Chen, Xingguo, et al.
Published: (2024)
by: Chen, Xingguo, et al.
Published: (2024)
Success Conditioning as Policy Improvement: The Optimization Problem Solved by Imitating Success
by: Russo, Daniel
Published: (2026)
by: Russo, Daniel
Published: (2026)
Worst-Case Regret Bounds for Exploration via Randomized Value Functions
by: Russo, Daniel
Published: (2019)
by: Russo, Daniel
Published: (2019)
Temporal Difference Flows
by: Farebrother, Jesse, et al.
Published: (2025)
by: Farebrother, Jesse, et al.
Published: (2025)
On DeepSeekMoE: Statistical Benefits of Shared Experts and Normalized Sigmoid Gating
by: Nguyen, Huy, et al.
Published: (2025)
by: Nguyen, Huy, et al.
Published: (2025)
Sparse Autoencoders Reveal Temporal Difference Learning in Large Language Models
by: Demircan, Can, et al.
Published: (2024)
by: Demircan, Can, et al.
Published: (2024)
Stabilizing Temporal Difference Learning via Implicit Stochastic Recursion
by: Kim, Hwanwoo, et al.
Published: (2025)
by: Kim, Hwanwoo, et al.
Published: (2025)
Collision Probability Distribution Estimation via Temporal Difference Learning
by: Steinecker, Thomas, et al.
Published: (2024)
by: Steinecker, Thomas, et al.
Published: (2024)
An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models
by: Pan, Yangchen, et al.
Published: (2024)
by: Pan, Yangchen, et al.
Published: (2024)
Optimizing Adaptive Experiments: A Unified Approach to Regret Minimization and Best-Arm Identification
by: Qin, Chao, et al.
Published: (2024)
by: Qin, Chao, et al.
Published: (2024)
Temporal-Difference Learning Using Distributed Error Signals
by: Guan, Jonas, et al.
Published: (2024)
by: Guan, Jonas, et al.
Published: (2024)
Model-Free Active Exploration in Reinforcement Learning
by: Russo, Alessio, et al.
Published: (2024)
by: Russo, Alessio, et al.
Published: (2024)
Gauss-Newton Temporal Difference Learning with Nonlinear Function Approximation
by: Ke, Zhifa, et al.
Published: (2023)
by: Ke, Zhifa, et al.
Published: (2023)
Finite-Time Analysis of Temporal Difference Learning with Experience Replay
by: Lim, Han-Dong, et al.
Published: (2023)
by: Lim, Han-Dong, et al.
Published: (2023)
Similar Items
-
On the Limited Representational Power of Value Functions and its Links to Statistical (In)Efficiency
by: Cheikhi, David, et al.
Published: (2024) -
Statistical Inference for Temporal Difference Learning with Linear Function Approximation
by: Wu, Weichen, et al.
Published: (2024) -
Statistical Efficiency of Distributional Temporal Difference Learning and Freedman's Inequality in Hilbert Spaces
by: Peng, Yang, et al.
Published: (2024) -
On the Divergence of Differential Temporal Difference Learning without Local Clocks
by: Antrobius, David, et al.
Published: (2026) -
An Analysis of Quantile Temporal-Difference Learning
by: Rowland, Mark, et al.
Published: (2023)