:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Mesbahi, Golnaz, Panahi, Parham Mohammad, Mastikhina, Olya, Tang, Steven, White, Martha, White, Adam
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2404.02113
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Investigating the Interplay of Prioritized Replay and Generalization
by: Panahi, Parham Mohammad, et al.
Published: (2024)

Forager: a lightweight testbed for continual learning with partial observability in RL
by: Tang, Steven, et al.
Published: (2026)

A New View on Planning in Online Reinforcement Learning
by: Roice, Kevin, et al.
Published: (2024)

Optimistic critics can empower small actors
by: Mastikhina, Olya, et al.
Published: (2025)

Goal-Space Planning with Subgoal Models
by: Lo, Chunlok, et al.
Published: (2022)

Position: Benchmarking is Limited in Reinforcement Learning Research
by: Jordan, Scott M., et al.
Published: (2024)

Fine-Tuning without Performance Degradation
by: Wang, Han, et al.
Published: (2025)

A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning
by: Patterson, Andrew, et al.
Published: (2021)

Investigating Action Encodings in Recurrent Neural Networks in Reinforcement Learning
by: Schlegel, Matthew, et al.
Published: (2026)

Real-Time Recurrent Learning using Trace Units in Reinforcement Learning
by: Elelimy, Esraa, et al.
Published: (2024)

Empirical Design in Reinforcement Learning
by: Patterson, Andrew, et al.
Published: (2023)

Scores as Actions: a framework of fine-tuning diffusion models by continuous-time reinforcement learning
by: Zhao, Hanyang, et al.
Published: (2024)

The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning
by: Patterson, Andrew, et al.
Published: (2024)

Revisiting Mixture Policies in Entropy-Regularized Actor-Critic
by: He, Jiamin, et al.
Published: (2026)

Deep Double Q-learning
by: Nagarajan, Prabhat, et al.
Published: (2025)

What to Do When Your Discrete Optimization Is the Size of a Neural Network?
by: Silva, Hugo, et al.
Published: (2024)

Deep Reinforcement Learning with Gradient Eligibility Traces
by: Elelimy, Esraa, et al.
Published: (2025)

Quantum reinforcement learning in continuous action space
by: Wu, Shaojun, et al.
Published: (2020)

Avoiding mode collapse in diffusion models fine-tuned with reinforcement learning
by: Barceló, Roberto, et al.
Published: (2024)

SAFE setup for generative molecular design
by: Mesbahi, Yassir El, et al.
Published: (2024)

Value Bonuses using Ensemble Errors for Exploration in Reinforcement Learning
by: Wahab, Abdul, et al.
Published: (2026)

Averaging $n$-step Returns Reduces Variance in Reinforcement Learning
by: Daley, Brett, et al.
Published: (2024)

A Gaussian Comparison Theorem for Training Dynamics in Machine Learning
by: Panahi, Ashkan
Published: (2026)

Gradient Iterated Temporal-Difference Learning
by: Vincent, Théo, et al.
Published: (2026)

Demystifying the Recency Heuristic in Temporal-Difference Learning
by: Daley, Brett, et al.
Published: (2024)

Distributions as Actions: A Unified Framework for Diverse Action Spaces
by: He, Jiamin, et al.
Published: (2025)

Towards Interpretability in Audio and Visual Affective Machine Learning: A Review
by: Johnson, David S., et al.
Published: (2023)

Multi Agent Reinforcement Learning for Sequential Satellite Assignment Problems
by: Holder, Joshua, et al.
Published: (2024)

Feature-driven reinforcement learning for photovoltaic in continuous intraday trading
by: Abate, Arega Getaneh, et al.
Published: (2025)

Randomized algorithms and PAC bounds for inverse reinforcement learning in continuous spaces
by: Kamoutsi, Angeliki, et al.
Published: (2024)

Task diversity produces systematic transfer but inhibits continual reinforcement learning
by: Seth, Purab, et al.
Published: (2026)

Trajectory-Aware Eligibility Traces for Off-Policy Reinforcement Learning
by: Daley, Brett, et al.
Published: (2023)

Small Models Are (Still) Effective Cross-Domain Argument Extractors
by: Gantt, William, et al.
Published: (2024)

Deep reinforcement learning for weakly coupled MDP's with continuous actions
by: Robledo, Francisco, et al.
Published: (2024)

Generalized Munchausen Reinforcement Learning using Tsallis KL Divergence
by: Zhu, Lingwei, et al.
Published: (2023)

Rethinking the Foundations for Continual Reinforcement Learning
by: Elelimy, Esraa, et al.
Published: (2025)

When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
by: Liu, Vincent, et al.
Published: (2023)

A Method for Evaluating Hyperparameter Sensitivity in Reinforcement Learning
by: Adkins, Jacob, et al.
Published: (2024)

BAPR: Bayesian amnesic piecewise-robust reinforcement learning for non-stationary continuous control
by: Zhang, Yifan, et al.
Published: (2026)

q-exponential family for policy optimization
by: Zhu, Lingwei, et al.
Published: (2024)