:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Moisescu-Pareja, Gabriela, McCracken, Gavin, Wiltzer, Harley, Létourneau, Vincent, Daniels, Colin, Precup, Doina, Love, Jonathan
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2512.25060
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Uncovering a Universal Abstract Algorithm for Modular Addition in Neural Networks
by: McCracken, Gavin, et al.
Published: (2025)

On the Privacy of Selection Mechanisms with Gaussian Noise
by: Lebensold, Jonathan, et al.
Published: (2024)

Tractable Representations for Convergent Approximation of Distributional HJB Equations
by: Alhosh, Julie, et al.
Published: (2025)

Conditions on Preference Relations that Guarantee the Existence of Optimal Policies
by: Carr, Jonathan Colaço, et al.
Published: (2023)

Diversity-Enriched Option-Critic
by: Kamat, Anand, et al.
Published: (2020)

Functional Acceleration for Policy Mirror Descent
by: Chelu, Veronica, et al.
Published: (2024)

A Look at Value-Based Decision-Time vs. Background Planning Methods Across Different Settings
by: Alver, Safa, et al.
Published: (2022)

Balancing Plasticity and Stability with Fast and Slow Successor Features
by: Chua, Raymond, et al.
Published: (2026)

Foundations of Multivariate Distributional Reinforcement Learning
by: Wiltzer, Harley, et al.
Published: (2024)

Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity
by: Arnob, Samin Yeasar, et al.
Published: (2025)

Adaptive Exploration for Data-Efficient General Value Function Evaluations
by: Jain, Arushi, et al.
Published: (2024)

Partial Models for Building Adaptive Model-Based Reinforcement Learning Agents
by: Alver, Safa, et al.
Published: (2024)

Reinforcement Learning with Pairwise Preferences in Long-Term Decision Problems
by: Carr, Jonathan Colaço, et al.
Published: (2026)

Relative Trajectory Balance is equivalent to Trust-PCL
by: Deleu, Tristan, et al.
Published: (2025)

Convergence Theorems for Entropy-Regularized and Distributional Reinforcement Learning
by: Jhaveri, Yash, et al.
Published: (2025)

Fluid-Agent Reinforcement Learning
by: Sharma, Shishir, et al.
Published: (2026)

Parseval Regularization for Continual Reinforcement Learning
by: Chung, Wesley, et al.
Published: (2024)

Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
by: Ishfaq, Haque, et al.
Published: (2025)

Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning
by: Wiltzer, Harley, et al.
Published: (2024)

Incorporating Spatial Information into Goal-Conditioned Hierarchical Reinforcement Learning via Graph Representations
by: Zhang, Shuyuan, et al.
Published: (2025)

Discrete Probabilistic Inference as Control in Multi-path Environments
by: Deleu, Tristan, et al.
Published: (2024)

KerJEPA: Kernel Discrepancies for Euclidean Self-Supervised Learning
by: Zimmermann, Eric, et al.
Published: (2025)

Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
by: Rahn, Nate, et al.
Published: (2023)

Mitigating Downstream Model Risks via Model Provenance
by: Wang, Keyu, et al.
Published: (2024)

Finite time analysis of temporal difference learning with linear function approximation: Tail averaging and regularisation
by: Patil, Gandharv, et al.
Published: (2022)

Learning Successor Features the Simple Way
by: Chua, Raymond, et al.
Published: (2024)

QGFN: Controllable Greediness with Action Values
by: Lau, Elaine, et al.
Published: (2024)

Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching
by: Jain, Arnav Kumar, et al.
Published: (2024)

School Library Media Specialists' Perceptions of Practice and Importance of Roles Described in "Information Power".
by: McCracken, Anne
Published: (2001)

Capacity-Constrained Continual Learning
by: Wen, Zheng, et al.
Published: (2025)

Fairness in Reinforcement Learning with Bisimulation Metrics
by: Rezaei-Shoshtari, Sahand, et al.
Published: (2024)

Policy Gradient Methods in the Presence of Symmetries and State Abstractions
by: Panangaden, Prakash, et al.
Published: (2023)

Understanding Behavioral Metric Learning: A Large-Scale Study on Distracting Reinforcement Learning Environments
by: Luo, Ziyan, et al.
Published: (2025)

Code as Reward: Empowering Reinforcement Learning with VLMs
by: Venuto, David, et al.
Published: (2024)

Effective Protein-Protein Interaction Exploration with PPIretrieval
by: Hua, Chenqing, et al.
Published: (2024)

Affordances Enable Partial World Modeling with LLMs
by: Khetarpal, Khimya, et al.
Published: (2026)

A Distributional Analogue to the Successor Representation
by: Wiltzer, Harley, et al.
Published: (2024)

Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning
by: Zhao, Mingde, et al.
Published: (2023)

Rotation-Preserving Supervised Fine-Tuning
by: Jin, Hangzhan, et al.
Published: (2026)

MUDiff: Unified Diffusion for Complete Molecule Generation
by: Hua, Chenqing, et al.
Published: (2023)