Saved in:
| Main Authors: | Voelcker, Claas A, Hussing, Marcel, Eaton, Eric, Farahmand, Amir-massoud, Gilitschenski, Igor |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.08896 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
by: Hussing, Marcel, et al.
Published: (2024)
by: Hussing, Marcel, et al.
Published: (2024)
When does Self-Prediction help? Understanding Auxiliary Tasks in Reinforcement Learning
by: Voelcker, Claas, et al.
Published: (2024)
by: Voelcker, Claas, et al.
Published: (2024)
Relative Entropy Pathwise Policy Optimization
by: Voelcker, Claas, et al.
Published: (2025)
by: Voelcker, Claas, et al.
Published: (2025)
$λ$-models: Effective Decision-Aware Reinforcement Learning with Latent Models
by: Voelcker, Claas A, et al.
Published: (2023)
by: Voelcker, Claas A, et al.
Published: (2023)
Calibrated Value-Aware Model Learning with Probabilistic Environment Models
by: Voelcker, Claas, et al.
Published: (2025)
by: Voelcker, Claas, et al.
Published: (2025)
Can we hop in general? A discussion of benchmark selection and design using the Hopper environment
by: Voelcker, Claas A, et al.
Published: (2024)
by: Voelcker, Claas A, et al.
Published: (2024)
Behavior-Consistent Deep Reinforcement Learning
by: Hussing, Marcel, et al.
Published: (2026)
by: Hussing, Marcel, et al.
Published: (2026)
Test-Time Graph Search for Goal-Conditioned Reinforcement Learning
by: Opryshko, Evgenii, et al.
Published: (2025)
by: Opryshko, Evgenii, et al.
Published: (2025)
Distributed Continual Learning
by: Le, Long, et al.
Published: (2024)
by: Le, Long, et al.
Published: (2024)
PANDAS: Improving Many-shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling
by: Ma, Avery, et al.
Published: (2025)
by: Ma, Avery, et al.
Published: (2025)
PID Accelerated Temporal Difference Algorithms
by: Bedaywi, Mark, et al.
Published: (2024)
by: Bedaywi, Mark, et al.
Published: (2024)
Efficient and Accurate Optimal Transport with Mirror Descent and Conjugate Gradients
by: Kemertas, Mete, et al.
Published: (2023)
by: Kemertas, Mete, et al.
Published: (2023)
Press Start to Charge: Videogaming the Online Centralized Charging Scheduling Problem
by: Ghahtarani, Alireza, et al.
Published: (2026)
by: Ghahtarani, Alireza, et al.
Published: (2026)
Deflated Dynamics Value Iteration
by: Lee, Jongmin, et al.
Published: (2024)
by: Lee, Jongmin, et al.
Published: (2024)
A Truncated Newton Method for Optimal Transport
by: Kemertas, Mete, et al.
Published: (2025)
by: Kemertas, Mete, et al.
Published: (2025)
Improving Adversarial Transferability via Model Alignment
by: Ma, Avery, et al.
Published: (2023)
by: Ma, Avery, et al.
Published: (2023)
Majority of the Bests: Improving Best-of-N via Bootstrapping
by: Rakhsha, Amin, et al.
Published: (2025)
by: Rakhsha, Amin, et al.
Published: (2025)
Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning
by: Hussing, Marcel, et al.
Published: (2023)
by: Hussing, Marcel, et al.
Published: (2023)
Iterative Compositional Data Generation for Robot Control
by: Pham, Anh-Quan, et al.
Published: (2025)
by: Pham, Anh-Quan, et al.
Published: (2025)
Replicable Reinforcement Learning with Linear Function Approximation
by: Eaton, Eric, et al.
Published: (2025)
by: Eaton, Eric, et al.
Published: (2025)
Model Agreement via Anchoring
by: Eaton, Eric, et al.
Published: (2026)
by: Eaton, Eric, et al.
Published: (2026)
Intersectional Fairness in Reinforcement Learning with Large State and Constraint Spaces
by: Eaton, Eric, et al.
Published: (2025)
by: Eaton, Eric, et al.
Published: (2025)
Update-Free On-Policy Steering via Verifiers
by: Attarian, Maria, et al.
Published: (2026)
by: Attarian, Maria, et al.
Published: (2026)
TESPEC: Temporally-Enhanced Self-Supervised Pretraining for Event Cameras
by: Mohammadi, Mohammad, et al.
Published: (2025)
by: Mohammadi, Mohammad, et al.
Published: (2025)
Temporal-Difference Learning Using Distributed Error Signals
by: Guan, Jonas, et al.
Published: (2024)
by: Guan, Jonas, et al.
Published: (2024)
Track, Inpaint, Resplat: Subject-driven 3D and 4D Generation with Progressive Texture Infilling
by: Zheng, Shuhong, et al.
Published: (2025)
by: Zheng, Shuhong, et al.
Published: (2025)
Sorrel: A simple and flexible framework for multi-agent reinforcement learning
by: Gelpí, Rebekah A., et al.
Published: (2025)
by: Gelpí, Rebekah A., et al.
Published: (2025)
Realistic Evaluation of Model Merging for Compositional Generalization
by: Tam, Derek, et al.
Published: (2024)
by: Tam, Derek, et al.
Published: (2024)
Augmenting Offline RL with Unlabeled Data
by: Wang, Zhao, et al.
Published: (2024)
by: Wang, Zhao, et al.
Published: (2024)
Oracle-Efficient Reinforcement Learning for Max Value Ensembles
by: Hussing, Marcel, et al.
Published: (2024)
by: Hussing, Marcel, et al.
Published: (2024)
IBCL: Zero-shot Model Generation under Stability-Plasticity Trade-offs
by: Lu, Pengyuan, et al.
Published: (2023)
by: Lu, Pengyuan, et al.
Published: (2023)
SPEQ: Offline Stabilization Phases for Efficient Q-Learning in High Update-To-Data Ratio Reinforcement Learning
by: Romeo, Carlo, et al.
Published: (2025)
by: Romeo, Carlo, et al.
Published: (2025)
Producing and Leveraging Online Map Uncertainty in Trajectory Prediction
by: Gu, Xunjiang, et al.
Published: (2024)
by: Gu, Xunjiang, et al.
Published: (2024)
Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention
by: Gu, Xunjiang, et al.
Published: (2024)
by: Gu, Xunjiang, et al.
Published: (2024)
Data Augmentation for Continual RL via Adversarial Gradient Episodic Memory
by: Wu, Sihao, et al.
Published: (2024)
by: Wu, Sihao, et al.
Published: (2024)
Understanding when Dynamics-Invariant Data Augmentations Benefit Model-Free Reinforcement Learning Updates
by: Corrado, Nicholas E., et al.
Published: (2023)
by: Corrado, Nicholas E., et al.
Published: (2023)
Augmenting Online RL with Offline Data is All You Need: A Unified Hybrid RL Algorithm Design and Analysis
by: Huang, Ruiquan, et al.
Published: (2025)
by: Huang, Ruiquan, et al.
Published: (2025)
Recovery Guarantees for Continual Learning of Dependent Tasks: Memory, Data-Dependent Regularization, and Data-Dependent Weights
by: Peng, Liangzu, et al.
Published: (2026)
by: Peng, Liangzu, et al.
Published: (2026)
Data Augmentations for Improved (Large) Language Model Generalization
by: Feder, Amir, et al.
Published: (2023)
by: Feder, Amir, et al.
Published: (2023)
ReAugment: Model Zoo-Guided RL for Few-Shot Time Series Augmentation and Forecasting
by: Yuan, Haochen, et al.
Published: (2024)
by: Yuan, Haochen, et al.
Published: (2024)
Similar Items
-
Dissecting Deep RL with High Update Ratios: Combatting Value Divergence
by: Hussing, Marcel, et al.
Published: (2024) -
When does Self-Prediction help? Understanding Auxiliary Tasks in Reinforcement Learning
by: Voelcker, Claas, et al.
Published: (2024) -
Relative Entropy Pathwise Policy Optimization
by: Voelcker, Claas, et al.
Published: (2025) -
$λ$-models: Effective Decision-Aware Reinforcement Learning with Latent Models
by: Voelcker, Claas A, et al.
Published: (2023) -
Calibrated Value-Aware Model Learning with Probabilistic Environment Models
by: Voelcker, Claas, et al.
Published: (2025)