Saved in:
| Main Authors: | Moisescu-Pareja, Gabriela, McCracken, Gavin, Wiltzer, Harley, Létourneau, Vincent, Daniels, Colin, Precup, Doina, Love, Jonathan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.25060 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Uncovering a Universal Abstract Algorithm for Modular Addition in Neural Networks
by: McCracken, Gavin, et al.
Published: (2025)
by: McCracken, Gavin, et al.
Published: (2025)
On the Privacy of Selection Mechanisms with Gaussian Noise
by: Lebensold, Jonathan, et al.
Published: (2024)
by: Lebensold, Jonathan, et al.
Published: (2024)
Tractable Representations for Convergent Approximation of Distributional HJB Equations
by: Alhosh, Julie, et al.
Published: (2025)
by: Alhosh, Julie, et al.
Published: (2025)
Conditions on Preference Relations that Guarantee the Existence of Optimal Policies
by: Carr, Jonathan Colaço, et al.
Published: (2023)
by: Carr, Jonathan Colaço, et al.
Published: (2023)
Diversity-Enriched Option-Critic
by: Kamat, Anand, et al.
Published: (2020)
by: Kamat, Anand, et al.
Published: (2020)
Functional Acceleration for Policy Mirror Descent
by: Chelu, Veronica, et al.
Published: (2024)
by: Chelu, Veronica, et al.
Published: (2024)
A Look at Value-Based Decision-Time vs. Background Planning Methods Across Different Settings
by: Alver, Safa, et al.
Published: (2022)
by: Alver, Safa, et al.
Published: (2022)
Balancing Plasticity and Stability with Fast and Slow Successor Features
by: Chua, Raymond, et al.
Published: (2026)
by: Chua, Raymond, et al.
Published: (2026)
Foundations of Multivariate Distributional Reinforcement Learning
by: Wiltzer, Harley, et al.
Published: (2024)
by: Wiltzer, Harley, et al.
Published: (2024)
Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity
by: Arnob, Samin Yeasar, et al.
Published: (2025)
by: Arnob, Samin Yeasar, et al.
Published: (2025)
Adaptive Exploration for Data-Efficient General Value Function Evaluations
by: Jain, Arushi, et al.
Published: (2024)
by: Jain, Arushi, et al.
Published: (2024)
Partial Models for Building Adaptive Model-Based Reinforcement Learning Agents
by: Alver, Safa, et al.
Published: (2024)
by: Alver, Safa, et al.
Published: (2024)
Reinforcement Learning with Pairwise Preferences in Long-Term Decision Problems
by: Carr, Jonathan Colaço, et al.
Published: (2026)
by: Carr, Jonathan Colaço, et al.
Published: (2026)
Relative Trajectory Balance is equivalent to Trust-PCL
by: Deleu, Tristan, et al.
Published: (2025)
by: Deleu, Tristan, et al.
Published: (2025)
Convergence Theorems for Entropy-Regularized and Distributional Reinforcement Learning
by: Jhaveri, Yash, et al.
Published: (2025)
by: Jhaveri, Yash, et al.
Published: (2025)
Fluid-Agent Reinforcement Learning
by: Sharma, Shishir, et al.
Published: (2026)
by: Sharma, Shishir, et al.
Published: (2026)
Parseval Regularization for Continual Reinforcement Learning
by: Chung, Wesley, et al.
Published: (2024)
by: Chung, Wesley, et al.
Published: (2024)
Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
by: Ishfaq, Haque, et al.
Published: (2025)
by: Ishfaq, Haque, et al.
Published: (2025)
Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning
by: Wiltzer, Harley, et al.
Published: (2024)
by: Wiltzer, Harley, et al.
Published: (2024)
Incorporating Spatial Information into Goal-Conditioned Hierarchical Reinforcement Learning via Graph Representations
by: Zhang, Shuyuan, et al.
Published: (2025)
by: Zhang, Shuyuan, et al.
Published: (2025)
Discrete Probabilistic Inference as Control in Multi-path Environments
by: Deleu, Tristan, et al.
Published: (2024)
by: Deleu, Tristan, et al.
Published: (2024)
KerJEPA: Kernel Discrepancies for Euclidean Self-Supervised Learning
by: Zimmermann, Eric, et al.
Published: (2025)
by: Zimmermann, Eric, et al.
Published: (2025)
Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
by: Rahn, Nate, et al.
Published: (2023)
by: Rahn, Nate, et al.
Published: (2023)
Mitigating Downstream Model Risks via Model Provenance
by: Wang, Keyu, et al.
Published: (2024)
by: Wang, Keyu, et al.
Published: (2024)
Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation
by: Patil, Gandharv, et al.
Published: (2022)
by: Patil, Gandharv, et al.
Published: (2022)
Learning Successor Features the Simple Way
by: Chua, Raymond, et al.
Published: (2024)
by: Chua, Raymond, et al.
Published: (2024)
QGFN: Controllable Greediness with Action Values
by: Lau, Elaine, et al.
Published: (2024)
by: Lau, Elaine, et al.
Published: (2024)
Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
by: Jain, Arnav Kumar, et al.
Published: (2024)
by: Jain, Arnav Kumar, et al.
Published: (2024)
School Library Media Specialists' Perceptions of Practice and Importance of Roles Described in "Information Power".
by: McCracken, Anne
Published: (2001)
by: McCracken, Anne
Published: (2001)
Capacity-Constrained Continual Learning
by: Wen, Zheng, et al.
Published: (2025)
by: Wen, Zheng, et al.
Published: (2025)
Fairness in Reinforcement Learning with Bisimulation Metrics
by: Rezaei-Shoshtari, Sahand, et al.
Published: (2024)
by: Rezaei-Shoshtari, Sahand, et al.
Published: (2024)
Policy Gradient Methods in the Presence of Symmetries and State Abstractions
by: Panangaden, Prakash, et al.
Published: (2023)
by: Panangaden, Prakash, et al.
Published: (2023)
Understanding Behavioral Metric Learning: A Large-Scale Study on Distracting Reinforcement Learning Environments
by: Luo, Ziyan, et al.
Published: (2025)
by: Luo, Ziyan, et al.
Published: (2025)
Code as Reward: Empowering Reinforcement Learning with VLMs
by: Venuto, David, et al.
Published: (2024)
by: Venuto, David, et al.
Published: (2024)
Effective Protein-Protein Interaction Exploration with PPIretrieval
by: Hua, Chenqing, et al.
Published: (2024)
by: Hua, Chenqing, et al.
Published: (2024)
Affordances Enable Partial World Modeling with LLMs
by: Khetarpal, Khimya, et al.
Published: (2026)
by: Khetarpal, Khimya, et al.
Published: (2026)
A Distributional Analogue to the Successor Representation
by: Wiltzer, Harley, et al.
Published: (2024)
by: Wiltzer, Harley, et al.
Published: (2024)
Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning
by: Zhao, Mingde, et al.
Published: (2023)
by: Zhao, Mingde, et al.
Published: (2023)
Rotation-Preserving Supervised Fine-Tuning
by: Jin, Hangzhan, et al.
Published: (2026)
by: Jin, Hangzhan, et al.
Published: (2026)
MUDiff: Unified Diffusion for Complete Molecule Generation
by: Hua, Chenqing, et al.
Published: (2023)
by: Hua, Chenqing, et al.
Published: (2023)
Similar Items
-
Uncovering a Universal Abstract Algorithm for Modular Addition in Neural Networks
by: McCracken, Gavin, et al.
Published: (2025) -
On the Privacy of Selection Mechanisms with Gaussian Noise
by: Lebensold, Jonathan, et al.
Published: (2024) -
Tractable Representations for Convergent Approximation of Distributional HJB Equations
by: Alhosh, Julie, et al.
Published: (2025) -
Conditions on Preference Relations that Guarantee the Existence of Optimal Policies
by: Carr, Jonathan Colaço, et al.
Published: (2023) -
Diversity-Enriched Option-Critic
by: Kamat, Anand, et al.
Published: (2020)