:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Voelcker, Claas A, Hussing, Marcel, Eaton, Eric, Farahmand, Amir-massoud, Gilitschenski, Igor
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2410.08896
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
by: Hussing, Marcel, et al.
Published: (2024)

When does Self-Prediction help? Understanding Auxiliary Tasks in Reinforcement Learning
by: Voelcker, Claas, et al.
Published: (2024)

Relative Entropy Pathwise Policy Optimization
by: Voelcker, Claas, et al.
Published: (2025)

$λ$-models: Effective Decision-Aware Reinforcement Learning with Latent Models
by: Voelcker, Claas A, et al.
Published: (2023)

Calibrated Value-Aware Model Learning with Probabilistic Environment Models
by: Voelcker, Claas, et al.
Published: (2025)

Can we hop in general? A discussion of benchmark selection and design using the Hopper environment
by: Voelcker, Claas A, et al.
Published: (2024)

Behavior-Consistent Deep Reinforcement Learning
by: Hussing, Marcel, et al.
Published: (2026)

Test-Time Graph Search for Goal-Conditioned Reinforcement Learning
by: Opryshko, Evgenii, et al.
Published: (2025)

Distributed Continual Learning
by: Le, Long, et al.
Published: (2024)

PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling
by: Ma, Avery, et al.
Published: (2025)

PID Accelerated Temporal Difference Algorithms
by: Bedaywi, Mark, et al.
Published: (2024)

Efficient and Accurate Optimal Transport with Mirror Descent and Conjugate Gradients
by: Kemertas, Mete, et al.
Published: (2023)

Press Start to Charge: Videogaming the Online Centralized Charging Scheduling Problem
by: Ghahtarani, Alireza, et al.
Published: (2026)

Deflated Dynamics Value Iteration
by: Lee, Jongmin, et al.
Published: (2024)

A Truncated Newton Method for Optimal Transport
by: Kemertas, Mete, et al.
Published: (2025)

Improving Adversarial Transferability via Model Alignment
by: Ma, Avery, et al.
Published: (2023)

Majority of the Bests: Improving Best-of-N via Bootstrapping
by: Rakhsha, Amin, et al.
Published: (2025)

Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning
by: Hussing, Marcel, et al.
Published: (2023)

Iterative Compositional Data Generation for Robot Control
by: Pham, Anh-Quan, et al.
Published: (2025)

Replicable Reinforcement Learning with Linear Function Approximation
by: Eaton, Eric, et al.
Published: (2025)

Model Agreement via Anchoring
by: Eaton, Eric, et al.
Published: (2026)

Intersectional Fairness in Reinforcement Learning with Large State and Constraint Spaces
by: Eaton, Eric, et al.
Published: (2025)

Update-Free On-Policy Steering via Verifiers
by: Attarian, Maria, et al.
Published: (2026)

TESPEC: Temporally-Enhanced Self-Supervised Pretraining for Event Cameras
by: Mohammadi, Mohammad, et al.
Published: (2025)

Temporal-Difference Learning Using Distributed Error Signals
by: Guan, Jonas, et al.
Published: (2024)

Track, Inpaint, Resplat: Subject-driven 3D and 4D Generation with Progressive Texture Infilling
by: Zheng, Shuhong, et al.
Published: (2025)

Sorrel: A simple and flexible framework for multi-agent reinforcement learning
by: Gelpí, Rebekah A., et al.
Published: (2025)

Realistic Evaluation of Model Merging for Compositional Generalization
by: Tam, Derek, et al.
Published: (2024)

Augmenting Offline RL with Unlabeled Data
by: Wang, Zhao, et al.
Published: (2024)

Oracle-Efficient Reinforcement Learning for Max Value Ensembles
by: Hussing, Marcel, et al.
Published: (2024)

IBCL: Zero-shot Model Generation under Stability-Plasticity Trade-offs
by: Lu, Pengyuan, et al.
Published: (2023)

SPEQ: Offline Stabilization Phases for Efficient Q-Learning in High Update-To-Data Ratio Reinforcement Learning
by: Romeo, Carlo, et al.
Published: (2025)

Producing and Leveraging Online Map Uncertainty in Trajectory Prediction
by: Gu, Xunjiang, et al.
Published: (2024)

Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention
by: Gu, Xunjiang, et al.
Published: (2024)

Data Augmentation for Continual RL via Adversarial Gradient Episodic Memory
by: Wu, Sihao, et al.
Published: (2024)

Understanding when Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning Updates
by: Corrado, Nicholas E., et al.
Published: (2023)

Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis
by: Huang, Ruiquan, et al.
Published: (2025)

Recovery Guarantees for Continual Learning of Dependent Tasks: Memory, Data-Dependent Regularization, and Data-Dependent Weights
by: Peng, Liangzu, et al.
Published: (2026)

Data Augmentations for Improved (Large) Language Model Generalization
by: Feder, Amir, et al.
Published: (2023)

ReAugment: Model Zoo-Guided RL for Few-Shot Time Series Augmentation and Forecasting
by: Yuan, Haochen, et al.
Published: (2024)