:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Castro, Pablo Samuel
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2510.16175
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Survey of State Representation Learning for Deep Reinforcement Learning
by: Echchahed, Ayoub, et al.
Published: (2025)

Mind the GAP! The Challenges of Scale in Pixel-based Deep Reinforcement Learning
by: Sokar, Ghada, et al.
Published: (2025)

The Impact of On-Policy Parallelized Data Collection on Deep Reinforcement Learning Networks
by: Mayor, Walter, et al.
Published: (2025)

CALE: Continuous Arcade Learning Environment
by: Farebrother, Jesse, et al.
Published: (2024)

Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn
by: Tang, Hongyao, et al.
Published: (2025)

Stable Deep Reinforcement Learning via Isotropic Gaussian Representations
by: Pasand, Ali Saheb, et al.
Published: (2026)

Multi-Task Reinforcement Learning Enables Parameter Scaling
by: McLean, Reginald, et al.
Published: (2025)

Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
by: Garcin, Samuel, et al.
Published: (2025)

ARM-FM: Automated Reward Machines via Foundation Models for Compositional Reinforcement Learning
by: Castanyer, Roger Creus, et al.
Published: (2025)

The Generalization Gap in Offline Reinforcement Learning
by: Mediratta, Ishita, et al.
Published: (2023)

GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving
by: Wang, Ruida, et al.
Published: (2025)

Achieving the Tightest Relaxation of Sigmoids for Formal Verification
by: Chevalier, Samuel, et al.
Published: (2024)

Transformers Provably Implement In-Context Reinforcement Learning with Policy Improvement
by: Liang, Haodong, et al.
Published: (2026)

Probabilistic Constrained Reinforcement Learning with Formal Interpretability
by: Wang, Yanran, et al.
Published: (2023)

Bridging Domain Gaps with Target-Aligned Generation for Offline Reinforcement Learning
by: Kim, Minung, et al.
Published: (2026)

In value-based deep reinforcement learning, a pruned network is a good network
by: Obando-Ceron, Johan, et al.
Published: (2024)

Empirical Design in Reinforcement Learning
by: Patterson, Andrew, et al.
Published: (2023)

Bridging the Performance Gap Between Target-Free and Target-Based Reinforcement Learning
by: Vincent, Théo, et al.
Published: (2025)

The Definitive Guide to Policy Gradients in Deep Reinforcement Learning: Theory, Algorithms and Implementations
by: Lehmann, Matthias
Published: (2024)

Formally Verifying Deep Reinforcement Learning Controllers with Lyapunov Barrier Certificates
by: Mandal, Udayan, et al.
Published: (2024)

Understanding What Affects the Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence
by: Lyu, Jiafei, et al.
Published: (2024)

Bridging Dynamics Gaps via Diffusion Schrödinger Bridge for Cross-Domain Reinforcement Learning
by: Zhang, Hanping, et al.
Published: (2026)

Bridging the Domain Gap in Equation Distillation with Reinforcement Feedback
by: Ying, Wangyang, et al.
Published: (2025)

Bridging the Sim-to-Real Gap in Reinforcement Learning-Based Industrial Dispatching through Execution Semantics
by: Hoss, Jonathan, et al.
Published: (2026)

AltNet: Addressing the Plasticity-Stability Dilemma in Reinforcement Learning
by: Maheshwari, Mansi, et al.
Published: (2025)

Learning to Generate Formally Verifiable Step-by-Step Logic Reasoning via Structured Formal Intermediaries
by: Chen, Luoxin, et al.
Published: (2026)

On the consistency of hyper-parameter selection in value-based deep reinforcement learning
by: Obando-Ceron, Johan, et al.
Published: (2024)

Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
by: Sokar, Ghada, et al.
Published: (2024)

Mixture of Experts in a Mixture of RL settings
by: Willi, Timon, et al.
Published: (2024)

Simplicial Embeddings Improve Sample Efficiency in Actor-Critic Agents
by: Obando-Ceron, Johan, et al.
Published: (2025)

A Research Agenda for Usability and Generalisation in Reinforcement Learning
by: Soemers, Dennis J. N. J., et al.
Published: (2024)

Which Rewards Matter? Reward Selection for Reinforcement Learning under Limited Feedback
by: Chaudhari, Shreyas, et al.
Published: (2025)

CORE: Concept-Oriented Reinforcement for Bridging the Definition-Application Gap in Mathematical Reasoning
by: Gao, Zijun, et al.
Published: (2025)

Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
by: Fuhrer, Benjamin, et al.
Published: (2022)

Product Interaction: An Algebraic Formalism for Deep Learning Architectures
by: Dong, Haonan, et al.
Published: (2026)

Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning
by: Yuan, Mingqi, et al.
Published: (2025)

Perception Learning: A Formal Separation of Sensory Representation Learning from Decision Learning
by: Sanyal, Suman
Published: (2025)

Superhuman AI for Stratego Using Self-Play Reinforcement Learning and Test-Time Search
by: Sokota, Samuel, et al.
Published: (2025)

NAVIX: Scaling MiniGrid Environments with JAX
by: Pignatelli, Eduardo, et al.
Published: (2024)

Rank-1 Approximation of Inverse Fisher for Natural Policy Gradients in Deep Reinforcement Learning
by: Huo, Yingxiao, et al.
Published: (2026)