Saved in:
| Main Authors: | Mesbahi, Golnaz, Panahi, Parham Mohammad, Mastikhina, Olya, Tang, Steven, White, Martha, White, Adam |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.02113 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Investigating the Interplay of Prioritized Replay and Generalization
by: Panahi, Parham Mohammad, et al.
Published: (2024)
by: Panahi, Parham Mohammad, et al.
Published: (2024)
Forager: a lightweight testbed for continual learning with partial observability in RL
by: Tang, Steven, et al.
Published: (2026)
by: Tang, Steven, et al.
Published: (2026)
A New View on Planning in Online Reinforcement Learning
by: Roice, Kevin, et al.
Published: (2024)
by: Roice, Kevin, et al.
Published: (2024)
Optimistic critics can empower small actors
by: Mastikhina, Olya, et al.
Published: (2025)
by: Mastikhina, Olya, et al.
Published: (2025)
Goal-Space Planning with Subgoal Models
by: Lo, Chunlok, et al.
Published: (2022)
by: Lo, Chunlok, et al.
Published: (2022)
Position: Benchmarking is Limited in Reinforcement Learning Research
by: Jordan, Scott M., et al.
Published: (2024)
by: Jordan, Scott M., et al.
Published: (2024)
Fine-Tuning without Performance Degradation
by: Wang, Han, et al.
Published: (2025)
by: Wang, Han, et al.
Published: (2025)
A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning
by: Patterson, Andrew, et al.
Published: (2021)
by: Patterson, Andrew, et al.
Published: (2021)
Investigating Action Encodings in Recurrent Neural Networks in Reinforcement Learning
by: Schlegel, Matthew, et al.
Published: (2026)
by: Schlegel, Matthew, et al.
Published: (2026)
Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
by: Elelimy, Esraa, et al.
Published: (2024)
by: Elelimy, Esraa, et al.
Published: (2024)
Empirical Design in Reinforcement Learning
by: Patterson, Andrew, et al.
Published: (2023)
by: Patterson, Andrew, et al.
Published: (2023)
Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning
by: Zhao, Hanyang, et al.
Published: (2024)
by: Zhao, Hanyang, et al.
Published: (2024)
The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning
by: Patterson, Andrew, et al.
Published: (2024)
by: Patterson, Andrew, et al.
Published: (2024)
Revisiting Mixture Policies in Entropy-Regularized Actor-Critic
by: He, Jiamin, et al.
Published: (2026)
by: He, Jiamin, et al.
Published: (2026)
Deep Double Q-learning
by: Nagarajan, Prabhat, et al.
Published: (2025)
by: Nagarajan, Prabhat, et al.
Published: (2025)
What to Do When Your Discrete Optimization Is the Size of a Neural Network?
by: Silva, Hugo, et al.
Published: (2024)
by: Silva, Hugo, et al.
Published: (2024)
Deep Reinforcement Learning with Gradient Eligibility Traces
by: Elelimy, Esraa, et al.
Published: (2025)
by: Elelimy, Esraa, et al.
Published: (2025)
Quantum reinforcement learning in continuous action space
by: Wu, Shaojun, et al.
Published: (2020)
by: Wu, Shaojun, et al.
Published: (2020)
Avoiding mode collapse in diffusion models fine-tuned with reinforcement learning
by: Barceló, Roberto, et al.
Published: (2024)
by: Barceló, Roberto, et al.
Published: (2024)
SAFE setup for generative molecular design
by: Mesbahi, Yassir El, et al.
Published: (2024)
by: Mesbahi, Yassir El, et al.
Published: (2024)
Value Bonuses using Ensemble Errors for Exploration in Reinforcement Learning
by: Wahab, Abdul, et al.
Published: (2026)
by: Wahab, Abdul, et al.
Published: (2026)
Averaging $n$-step Returns Reduces Variance in Reinforcement Learning
by: Daley, Brett, et al.
Published: (2024)
by: Daley, Brett, et al.
Published: (2024)
A Gaussian Comparison Theorem for Training Dynamics in Machine Learning
by: Panahi, Ashkan
Published: (2026)
by: Panahi, Ashkan
Published: (2026)
Gradient Iterated Temporal-Difference Learning
by: Vincent, Théo, et al.
Published: (2026)
by: Vincent, Théo, et al.
Published: (2026)
Demystifying the Recency Heuristic in Temporal-Difference Learning
by: Daley, Brett, et al.
Published: (2024)
by: Daley, Brett, et al.
Published: (2024)
Distributions as Actions: A Unified Framework for Diverse Action Spaces
by: He, Jiamin, et al.
Published: (2025)
by: He, Jiamin, et al.
Published: (2025)
Towards Interpretability in Audio and Visual Affective Machine Learning: A Review
by: Johnson, David S., et al.
Published: (2023)
by: Johnson, David S., et al.
Published: (2023)
Multi Agent Reinforcement Learning for Sequential Satellite Assignment Problems
by: Holder, Joshua, et al.
Published: (2024)
by: Holder, Joshua, et al.
Published: (2024)
Feature-driven reinforcement learning for photovoltaic in continuous intraday trading
by: Abate, Arega Getaneh, et al.
Published: (2025)
by: Abate, Arega Getaneh, et al.
Published: (2025)
Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces
by: Kamoutsi, Angeliki, et al.
Published: (2024)
by: Kamoutsi, Angeliki, et al.
Published: (2024)
Task diversity produces systematic transfer but inhibits continual reinforcement learning
by: Seth, Purab, et al.
Published: (2026)
by: Seth, Purab, et al.
Published: (2026)
Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
by: Daley, Brett, et al.
Published: (2023)
by: Daley, Brett, et al.
Published: (2023)
Small Models Are (Still) Effective Cross-Domain Argument Extractors
by: Gantt, William, et al.
Published: (2024)
by: Gantt, William, et al.
Published: (2024)
Deep reinforcement learning for weakly coupled MDP's with continuous actions
by: Robledo, Francisco, et al.
Published: (2024)
by: Robledo, Francisco, et al.
Published: (2024)
Generalized Munchausen Reinforcement Learning using Tsallis KL Divergence
by: Zhu, Lingwei, et al.
Published: (2023)
by: Zhu, Lingwei, et al.
Published: (2023)
Rethinking the Foundations for Continual Reinforcement Learning
by: Elelimy, Esraa, et al.
Published: (2025)
by: Elelimy, Esraa, et al.
Published: (2025)
When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
by: Liu, Vincent, et al.
Published: (2023)
by: Liu, Vincent, et al.
Published: (2023)
A Method for Evaluating Hyperparameter Sensitivity in Reinforcement Learning
by: Adkins, Jacob, et al.
Published: (2024)
by: Adkins, Jacob, et al.
Published: (2024)
BAPR: Bayesian amnesic piecewise-robust reinforcement learning for non-stationary continuous control
by: Zhang, Yifan, et al.
Published: (2026)
by: Zhang, Yifan, et al.
Published: (2026)
q-exponential family for policy optimization
by: Zhu, Lingwei, et al.
Published: (2024)
by: Zhu, Lingwei, et al.
Published: (2024)
Similar Items
-
Investigating the Interplay of Prioritized Replay and Generalization
by: Panahi, Parham Mohammad, et al.
Published: (2024) -
Forager: a lightweight testbed for continual learning with partial observability in RL
by: Tang, Steven, et al.
Published: (2026) -
A New View on Planning in Online Reinforcement Learning
by: Roice, Kevin, et al.
Published: (2024) -
Optimistic critics can empower small actors
by: Mastikhina, Olya, et al.
Published: (2025) -
Goal-Space Planning with Subgoal Models
by: Lo, Chunlok, et al.
Published: (2022)