Saved in:
| Main Author: | Castro, Pablo Samuel |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.16175 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Survey of State Representation Learning for Deep Reinforcement Learning
by: Echchahed, Ayoub, et al.
Published: (2025)
by: Echchahed, Ayoub, et al.
Published: (2025)
Mind the GAP! The Challenges of Scale in Pixel-based Deep Reinforcement Learning
by: Sokar, Ghada, et al.
Published: (2025)
by: Sokar, Ghada, et al.
Published: (2025)
The Impact of On-Policy Parallelized Data Collection on Deep Reinforcement Learning Networks
by: Mayor, Walter, et al.
Published: (2025)
by: Mayor, Walter, et al.
Published: (2025)
CALE: Continuous Arcade Learning Environment
by: Farebrother, Jesse, et al.
Published: (2024)
by: Farebrother, Jesse, et al.
Published: (2024)
Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn
by: Tang, Hongyao, et al.
Published: (2025)
by: Tang, Hongyao, et al.
Published: (2025)
Stable Deep Reinforcement Learning via Isotropic Gaussian Representations
by: Pasand, Ali Saheb, et al.
Published: (2026)
by: Pasand, Ali Saheb, et al.
Published: (2026)
Multi-Task Reinforcement Learning Enables Parameter Scaling
by: McLean, Reginald, et al.
Published: (2025)
by: McLean, Reginald, et al.
Published: (2025)
Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning
by: Garcin, Samuel, et al.
Published: (2025)
by: Garcin, Samuel, et al.
Published: (2025)
ARM-FM: Automated Reward Machines via Foundation Models for Compositional Reinforcement Learning
by: Castanyer, Roger Creus, et al.
Published: (2025)
by: Castanyer, Roger Creus, et al.
Published: (2025)
The Generalization Gap in Offline Reinforcement Learning
by: Mediratta, Ishita, et al.
Published: (2023)
by: Mediratta, Ishita, et al.
Published: (2023)
GAR: Generative Adversarial Reinforcement Learning for Formal Theorem Proving
by: Wang, Ruida, et al.
Published: (2025)
by: Wang, Ruida, et al.
Published: (2025)
Achieving the Tightest Relaxation of Sigmoids for Formal Verification
by: Chevalier, Samuel, et al.
Published: (2024)
by: Chevalier, Samuel, et al.
Published: (2024)
Transformers Provably Implement In-Context Reinforcement Learning with Policy Improvement
by: Liang, Haodong, et al.
Published: (2026)
by: Liang, Haodong, et al.
Published: (2026)
Probabilistic Constrained Reinforcement Learning with Formal Interpretability
by: Wang, Yanran, et al.
Published: (2023)
by: Wang, Yanran, et al.
Published: (2023)
Bridging Domain Gaps with Target-Aligned Generation for Offline Reinforcement Learning
by: Kim, Minung, et al.
Published: (2026)
by: Kim, Minung, et al.
Published: (2026)
In value-based deep reinforcement learning, a pruned network is a good network
by: Obando-Ceron, Johan, et al.
Published: (2024)
by: Obando-Ceron, Johan, et al.
Published: (2024)
Empirical Design in Reinforcement Learning
by: Patterson, Andrew, et al.
Published: (2023)
by: Patterson, Andrew, et al.
Published: (2023)
Bridging the Performance Gap Between Target-Free and Target-Based Reinforcement Learning
by: Vincent, Théo, et al.
Published: (2025)
by: Vincent, Théo, et al.
Published: (2025)
The Definitive Guide to Policy Gradients in Deep Reinforcement Learning: Theory, Algorithms and Implementations
by: Lehmann, Matthias
Published: (2024)
by: Lehmann, Matthias
Published: (2024)
Formally Verifying Deep Reinforcement Learning Controllers with Lyapunov Barrier Certificates
by: Mandal, Udayan, et al.
Published: (2024)
by: Mandal, Udayan, et al.
Published: (2024)
Understanding What Affects the Generalization Gap in Visual Reinforcement Learning: Theory and Empirical Evidence
by: Lyu, Jiafei, et al.
Published: (2024)
by: Lyu, Jiafei, et al.
Published: (2024)
Bridging Dynamics Gaps via Diffusion Schrödinger Bridge for Cross-Domain Reinforcement Learning
by: Zhang, Hanping, et al.
Published: (2026)
by: Zhang, Hanping, et al.
Published: (2026)
Bridging the Domain Gap in Equation Distillation with Reinforcement Feedback
by: Ying, Wangyang, et al.
Published: (2025)
by: Ying, Wangyang, et al.
Published: (2025)
Bridging the Sim-to-Real Gap in Reinforcement Learning-Based Industrial Dispatching through Execution Semantics
by: Hoss, Jonathan, et al.
Published: (2026)
by: Hoss, Jonathan, et al.
Published: (2026)
AltNet: Addressing the Plasticity-Stability Dilemma in Reinforcement Learning
by: Maheshwari, Mansi, et al.
Published: (2025)
by: Maheshwari, Mansi, et al.
Published: (2025)
Learning to Generate Formally Verifiable Step-by-Step Logic Reasoning via Structured Formal Intermediaries
by: Chen, Luoxin, et al.
Published: (2026)
by: Chen, Luoxin, et al.
Published: (2026)
On the consistency of hyper-parameter selection in value-based deep reinforcement learning
by: Obando-Ceron, Johan, et al.
Published: (2024)
by: Obando-Ceron, Johan, et al.
Published: (2024)
Don't flatten, tokenize! Unlocking the key to SoftMoE's efficacy in deep RL
by: Sokar, Ghada, et al.
Published: (2024)
by: Sokar, Ghada, et al.
Published: (2024)
Mixture of Experts in a Mixture of RL settings
by: Willi, Timon, et al.
Published: (2024)
by: Willi, Timon, et al.
Published: (2024)
Simplicial Embeddings Improve Sample Efficiency in Actor-Critic Agents
by: Obando-Ceron, Johan, et al.
Published: (2025)
by: Obando-Ceron, Johan, et al.
Published: (2025)
A Research Agenda for Usability and Generalisation in Reinforcement Learning
by: Soemers, Dennis J. N. J., et al.
Published: (2024)
by: Soemers, Dennis J. N. J., et al.
Published: (2024)
Which Rewards Matter? Reward Selection for Reinforcement Learning under Limited Feedback
by: Chaudhari, Shreyas, et al.
Published: (2025)
by: Chaudhari, Shreyas, et al.
Published: (2025)
CORE: Concept-Oriented Reinforcement for Bridging the Definition-Application Gap in Mathematical Reasoning
by: Gao, Zijun, et al.
Published: (2025)
by: Gao, Zijun, et al.
Published: (2025)
Implementing Reinforcement Learning Datacenter Congestion Control in NVIDIA NICs
by: Fuhrer, Benjamin, et al.
Published: (2022)
by: Fuhrer, Benjamin, et al.
Published: (2022)
Product Interaction: An Algebraic Formalism for Deep Learning Architectures
by: Dong, Haonan, et al.
Published: (2026)
by: Dong, Haonan, et al.
Published: (2026)
Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning
by: Yuan, Mingqi, et al.
Published: (2025)
by: Yuan, Mingqi, et al.
Published: (2025)
Perception Learning: A Formal Separation of Sensory Representation Learning from Decision Learning
by: Sanyal, Suman
Published: (2025)
by: Sanyal, Suman
Published: (2025)
Superhuman AI for Stratego Using Self-Play Reinforcement Learning and Test-Time Search
by: Sokota, Samuel, et al.
Published: (2025)
by: Sokota, Samuel, et al.
Published: (2025)
NAVIX: Scaling MiniGrid Environments with JAX
by: Pignatelli, Eduardo, et al.
Published: (2024)
by: Pignatelli, Eduardo, et al.
Published: (2024)
Rank-1 Approximation of Inverse Fisher for Natural Policy Gradients in Deep Reinforcement Learning
by: Huo, Yingxiao, et al.
Published: (2026)
by: Huo, Yingxiao, et al.
Published: (2026)
Similar Items
-
A Survey of State Representation Learning for Deep Reinforcement Learning
by: Echchahed, Ayoub, et al.
Published: (2025) -
Mind the GAP! The Challenges of Scale in Pixel-based Deep Reinforcement Learning
by: Sokar, Ghada, et al.
Published: (2025) -
The Impact of On-Policy Parallelized Data Collection on Deep Reinforcement Learning Networks
by: Mayor, Walter, et al.
Published: (2025) -
CALE: Continuous Arcade Learning Environment
by: Farebrother, Jesse, et al.
Published: (2024) -
Mitigating Plasticity Loss in Continual Reinforcement Learning by Reducing Churn
by: Tang, Hongyao, et al.
Published: (2025)